-
The Llama 3 Herd of Models
Authors:
Abhimanyu Dubey,
Abhinav Jauhri,
Abhinav Pandey,
Abhishek Kadian,
Ahmad Al-Dahle,
Aiesha Letman,
Akhil Mathur,
Alan Schelten,
Amy Yang,
Angela Fan,
Anirudh Goyal,
Anthony Hartshorn,
Aobo Yang,
Archi Mitra,
Archie Sravankumar,
Artem Korenev,
Arthur Hinsvark,
Arun Rao,
Aston Zhang,
Aurelien Rodriguez,
Austen Gregerson,
Ava Spataru,
Baptiste Roziere,
Bethany Biron,
Binh Tang
, et al. (510 additional authors not shown)
Abstract:
Modern artificial intelligence (AI) systems are powered by foundation models. This paper presents a new set of foundation models, called Llama 3. It is a herd of language models that natively support multilinguality, coding, reasoning, and tool usage. Our largest model is a dense Transformer with 405B parameters and a context window of up to 128K tokens. This paper presents an extensive empirical…
▽ More
Modern artificial intelligence (AI) systems are powered by foundation models. This paper presents a new set of foundation models, called Llama 3. It is a herd of language models that natively support multilinguality, coding, reasoning, and tool usage. Our largest model is a dense Transformer with 405B parameters and a context window of up to 128K tokens. This paper presents an extensive empirical evaluation of Llama 3. We find that Llama 3 delivers comparable quality to leading language models such as GPT-4 on a plethora of tasks. We publicly release Llama 3, including pre-trained and post-trained versions of the 405B parameter language model and our Llama Guard 3 model for input and output safety. The paper also presents the results of experiments in which we integrate image, video, and speech capabilities into Llama 3 via a compositional approach. We observe this approach performs competitively with the state-of-the-art on image, video, and speech recognition tasks. The resulting models are not yet being broadly released as they are still under development.
△ Less
Submitted 15 August, 2024; v1 submitted 31 July, 2024;
originally announced July 2024.
-
SpinQuant: LLM quantization with learned rotations
Authors:
Zechun Liu,
Changsheng Zhao,
Igor Fedorov,
Bilge Soran,
Dhruv Choudhary,
Raghuraman Krishnamoorthi,
Vikas Chandra,
Yuandong Tian,
Tijmen Blankevoort
Abstract:
Post-training quantization (PTQ) techniques applied to weights, activations, and the KV cache greatly reduce memory usage, latency, and power consumption of Large Language Models (LLMs), but may lead to large quantization errors when outliers are present. Recent findings suggest that rotating activation or weight matrices helps remove outliers and benefits quantization. In this work, we identify a…
▽ More
Post-training quantization (PTQ) techniques applied to weights, activations, and the KV cache greatly reduce memory usage, latency, and power consumption of Large Language Models (LLMs), but may lead to large quantization errors when outliers are present. Recent findings suggest that rotating activation or weight matrices helps remove outliers and benefits quantization. In this work, we identify a collection of applicable rotation parameterizations that lead to identical outputs in full-precision Transformer architectures, and find that some random rotations lead to much better quantization than others, with an up to 13 points difference in downstream zero-shot reasoning performance. As a result, we propose SpinQuant that optimizes (or learns) the rotation matrices with Cayley optimization on a small validation set. With 4-bit quantization of weight, activation, and KV-cache, SpinQuant narrows the accuracy gap on zero-shot reasoning tasks with full precision to merely 2.9 points on the LLaMA-2 7B model, surpassing LLM-QAT by 19.1 points and SmoothQuant by 25.0 points. SpinQuant also outperforms concurrent work QuaRot, which applies random rotations to remove outliers. In particular, for LLaMA-2 7B/LLaMA-3 8B models that are hard to quantize, SpinQuant reduces the gap to full precision by 30.2%/34.1% relative to QuaRot.
△ Less
Submitted 28 May, 2024; v1 submitted 25 May, 2024;
originally announced May 2024.
-
Knowledge Graph Reasoning Based on Attention GCN
Authors:
Meera Gupta,
Ravi Khanna,
Divya Choudhary,
Nandini Rao
Abstract:
We propose a novel technique to enhance Knowledge Graph Reasoning by combining Graph Convolution Neural Network (GCN) with the Attention Mechanism. This approach utilizes the Attention Mechanism to examine the relationships between entities and their neighboring nodes, which helps to develop detailed feature vectors for each entity. The GCN uses shared parameters to effectively represent the chara…
▽ More
We propose a novel technique to enhance Knowledge Graph Reasoning by combining Graph Convolution Neural Network (GCN) with the Attention Mechanism. This approach utilizes the Attention Mechanism to examine the relationships between entities and their neighboring nodes, which helps to develop detailed feature vectors for each entity. The GCN uses shared parameters to effectively represent the characteristics of adjacent entities. We first learn the similarity of entities for node representation learning. By integrating the attributes of the entities and their interactions, this method generates extensive implicit feature vectors for each entity, improving performance in tasks including entity classification and link prediction, outperforming traditional neural network models. To conclude, this work provides crucial methodological support for a range of applications, such as search engines, question-answering systems, recommendation systems, and data integration tasks.
△ Less
Submitted 27 January, 2024; v1 submitted 2 December, 2023;
originally announced December 2023.
-
Microscaling Data Formats for Deep Learning
Authors:
Bita Darvish Rouhani,
Ritchie Zhao,
Ankit More,
Mathew Hall,
Alireza Khodamoradi,
Summer Deng,
Dhruv Choudhary,
Marius Cornea,
Eric Dellinger,
Kristof Denolf,
Stosic Dusan,
Venmugil Elango,
Maximilian Golub,
Alexander Heinecke,
Phil James-Roxby,
Dharmesh Jani,
Gaurav Kolhe,
Martin Langhammer,
Ada Li,
Levi Melnick,
Maral Mesmakhosroshahi,
Andres Rodriguez,
Michael Schulte,
Rasoul Shafipour,
Lei Shao
, et al. (8 additional authors not shown)
Abstract:
Narrow bit-width data formats are key to reducing the computational and storage costs of modern deep learning applications. This paper evaluates Microscaling (MX) data formats that combine a per-block scaling factor with narrow floating-point and integer types for individual elements. MX formats balance the competing needs of hardware efficiency, model accuracy, and user friction. Empirical result…
▽ More
Narrow bit-width data formats are key to reducing the computational and storage costs of modern deep learning applications. This paper evaluates Microscaling (MX) data formats that combine a per-block scaling factor with narrow floating-point and integer types for individual elements. MX formats balance the competing needs of hardware efficiency, model accuracy, and user friction. Empirical results on over two dozen benchmarks demonstrate practicality of MX data formats as a drop-in replacement for baseline FP32 for AI inference and training with low user friction. We also show the first instance of training generative language models at sub-8-bit weights, activations, and gradients with minimal accuracy loss and no modifications to the training recipe.
△ Less
Submitted 19 October, 2023; v1 submitted 16 October, 2023;
originally announced October 2023.
-
$μ$TAS: Design and implementation of Time Aware Shaper on SmartNICs to achieve bounded latency
Authors:
Joydeep Pal,
Deepak Choudhary,
Nithish Krishnabharathi Gnani,
Chandramani Singh,
T. V. Prabhakar
Abstract:
Time-Aware Shaper (TAS) is a time-triggered scheduling mechanism that ensures bounded latency for time-critical Scheduled Traffic (ST) flows. The Linux kernel implementation (a.k.a TAPRIO) has limited capabilities due to varying CPU workloads and thus does not offer tight latency bound for the ST flows. Also, currently only higher cycle times are possible. Other software implementations are limite…
▽ More
Time-Aware Shaper (TAS) is a time-triggered scheduling mechanism that ensures bounded latency for time-critical Scheduled Traffic (ST) flows. The Linux kernel implementation (a.k.a TAPRIO) has limited capabilities due to varying CPU workloads and thus does not offer tight latency bound for the ST flows. Also, currently only higher cycle times are possible. Other software implementations are limited to simulation studies without physical implementation. In this paper, we present $μ$TAS, a MicroC-based hardware implementation of TAS onto a programmable SmartNIC. $μ$TAS takes advantage of the parallel-processing architecture of the SmartNIC to configure the scheduling behaviour of its queues at runtime. To demonstrate the effectiveness of $μ$TAS, we built a Time-Sensitive Networking (TSN) testbed from scratch. This consists of multiple end-hosts capable of generating ST and Best Effort (BE) flows and TSN switches equipped with SmartNICs running $μ$TAS. Time synchronization is maintained between the switches and hosts. Our experiments demonstrate that the ST flows experience a bounded latency of the order of tens of microseconds.
△ Less
Submitted 11 October, 2023;
originally announced October 2023.
-
EdgeP4: A P4-Programmable Edge Intelligent Ethernet Switch for Tactile Cyber-Physical Systems
Authors:
Nithish Krishnabharathi Gnani,
Joydeep Pal,
Deepak Choudhary,
Himanshu Verma,
Soumya Kanta Rana,
Kaushal Mhapsekar,
T. V. Prabhakar,
Chandramani Singh
Abstract:
Tactile Internet based operations, e.g., telesurgery, rely on end-to-end closed loop control for accuracy and corrections. The feedback and control are subject to network latency and loss. We design two edge intelligence algorithms hosted at P4 programmable end switches. These algorithms locally compute and command corrective signals, thereby dispense the feedback signals from traversing the netwo…
▽ More
Tactile Internet based operations, e.g., telesurgery, rely on end-to-end closed loop control for accuracy and corrections. The feedback and control are subject to network latency and loss. We design two edge intelligence algorithms hosted at P4 programmable end switches. These algorithms locally compute and command corrective signals, thereby dispense the feedback signals from traversing the network to the other ends and save on control loop latency and network load. We implement these algorithms entirely on data plane on Netronome Agilio SmartNICs using P4. Our first algorithm, $\textit{pose correction}$, is placed at the edge switch connected to an industrial robot gripping a tool. The round trip between transmitting force sensor array readings to the edge switch and receiving correct tip coordinates at the robot is shown to be less than $100~μs$. The second algorithm, $\textit{tremor suppression}$, is placed at the edge switch connected to the human operator. It suppresses physiological tremors of amplitudes smaller than $100~μm$ which not only improves the application's performance but also reduces the network load up to $99.9\%$. Our solution allows edge intelligence modules to seamlessly switch between the algorithms based on the tasks being executed at the end hosts.
△ Less
Submitted 19 September, 2023;
originally announced September 2023.
-
Ground-based monitoring of the variability of visible Solar spectral lines for improved understanding of solar and stellar magnetism and dynamics
Authors:
S. Criscuoli,
L. Bertello,
D. P. Choudhary,
M. DeLand,
G. Kopp,
A. Kowalski,
S. Marchenko,
K. Reardon,
A. A. Pevtsov,
D. Tilipman
Abstract:
Long-term high-cadence measurements of stellar spectral variability are fundamental to better understand stellar atmospheric properties and stellar magnetism. These, in turn, are fundamental for the detectability of exoplanets as well as the characterization of their atmospheres and habitability. The Sun, viewed as a star via disk-integrated observations, offers a means of exploring such measureme…
▽ More
Long-term high-cadence measurements of stellar spectral variability are fundamental to better understand stellar atmospheric properties and stellar magnetism. These, in turn, are fundamental for the detectability of exoplanets as well as the characterization of their atmospheres and habitability. The Sun, viewed as a star via disk-integrated observations, offers a means of exploring such measurements while also offering the spatially resolved observations that are necessary to discern the causes of observed spectral variations. High-spectral resolution observations of the solar spectrum are fundamental for a variety of Earth-system studies, including climate influences, renewable energies, and biology. The Integrated Sunlight Spectrometer at SOLIS, has been acquiring daily high-spectral resolution Sun-as-a-star measurements since 2006.More recently, a few ground-based telescopes with the capability of monitoring the solar visible spectrum at high spectral resolution have been deployed (e.g. PEPSI, HARPS, NEID). However, the main scientific goal of these instruments is to detect exo-planets, and solar observations are acquired mainly as a reference. Consequently, their technical requirements are not ideal to monitor solar variations with high photometric stability, especially over solar-cycle temporal scales.The goal of this white paper is to emphasize the scientific return and explore the technical requirements of a network of ground-based spectrographs devoted to long-term monitoring of disk-integrated solar-spectral variability with high spectral resolution and high photometric stability, in conjunction with disk-resolved observations in selected spectral lines,to complement planet-hunter measurements and stellar-variability studies. The proposed network of instruments offers the opportunity for a larger variety of multidisciplinary studies.
△ Less
Submitted 11 May, 2023;
originally announced May 2023.
-
Understanding Sun-as-a-star variability of solar Balmer lines
Authors:
Serena Criscuoli,
Sergey Marchenko,
Matthew DeLand,
Debi Choudhary,
Greg Kopp
Abstract:
Precise, high-cadence, long-term records of stellar spectral variability at different temporal scales lead to better understanding of a wide variety of phenomena including stellar atmospheres and dynamos, convective motions, and rotational periods. Here, we investigate the variability of solar Balmer lines (H-$α$, -$β$, -$γ$, -$δ$) observed by space-borne radiometers (OSIRIS, SCIAMACHY, OMI, and G…
▽ More
Precise, high-cadence, long-term records of stellar spectral variability at different temporal scales lead to better understanding of a wide variety of phenomena including stellar atmospheres and dynamos, convective motions, and rotational periods. Here, we investigate the variability of solar Balmer lines (H-$α$, -$β$, -$γ$, -$δ$) observed by space-borne radiometers (OSIRIS, SCIAMACHY, OMI, and GOME-2), combining these precise, long-term observations with high-resolution data from the ground-based NSO/ISS spectrograph. We relate the detected variability to the appearance of magnetic features on the solar disk. We find that on solar-rotational timescales (about 1 month), the Balmer line activity indices (defined as line-core to line-wing ratios) closely follow variations in the total solar irradiance (which is predominantly photospheric), thus frequently (specifically, during passages of sunspot groups) deviating from behavior of activity indices that track chromospheric activity levels. On longer timescales, the correlation with chromospheric indices increases, with periods of low- or even anti-correlation found at intermediate timescales. Comparison of these observations with estimates from semi-empirical irradiance reconstructions helps quantify the contributions of different magnetic and quiet features. We conclude that both the lower sensitivity to network and in part the higher sensitivity to filaments and prominences, may result in complex, time-dependent relationships between Balmer and other chromospheric indices observed for the Sun and solar-like stars. The fact that core and wings contribute in similar manner to the variability, and current knowledge of Balmer-lines formation in stellar atmospheres, support the notion that Balmer lines core-to-wing ratios indices behave more like photospheric rather than chromospheric indices.
△ Less
Submitted 9 May, 2023;
originally announced May 2023.
-
FlexShard: Flexible Sharding for Industry-Scale Sequence Recommendation Models
Authors:
Geet Sethi,
Pallab Bhattacharya,
Dhruv Choudhary,
Carole-Jean Wu,
Christos Kozyrakis
Abstract:
Sequence-based deep learning recommendation models (DLRMs) are an emerging class of DLRMs showing great improvements over their prior sum-pooling based counterparts at capturing users' long term interests. These improvements come at immense system cost however, with sequence-based DLRMs requiring substantial amounts of data to be dynamically materialized and communicated by each accelerator during…
▽ More
Sequence-based deep learning recommendation models (DLRMs) are an emerging class of DLRMs showing great improvements over their prior sum-pooling based counterparts at capturing users' long term interests. These improvements come at immense system cost however, with sequence-based DLRMs requiring substantial amounts of data to be dynamically materialized and communicated by each accelerator during a single iteration. To address this rapidly growing bottleneck, we present FlexShard, a new tiered sequence embedding table sharding algorithm which operates at a per-row granularity by exploiting the insight that not every row is equal. Through precise replication of embedding rows based on their underlying probability distribution, along with the introduction of a new sharding strategy adapted to the heterogeneous, skewed performance of real-world cluster network topologies, FlexShard is able to significantly reduce communication demand while using no additional memory compared to the prior state-of-the-art. When evaluated on production-scale sequence DLRMs, FlexShard was able to reduce overall global all-to-all communication traffic by over 85%, resulting in end-to-end training communication latency improvements of almost 6x over the prior state-of-the-art approach.
△ Less
Submitted 7 January, 2023;
originally announced January 2023.
-
Sustained heating of the chromosphere and transition region over a sunspot light bridge
Authors:
Rohan E. Louis,
Shibu K. Mathew,
A. Raja Bayanna,
Christian Beck,
Debi P. Choudhary
Abstract:
Sunspot light bridges (LBs) exhibit a wide range of short-lived phenomena in the chromosphere and transition region. In contrast, we use here data from the Multi-Application Solar Telescope (MAST), the Interface Region Imaging Spectrograph (IRIS), Hinode, the Atmospheric Imaging Assembly (AIA), and the Helioseismic and Magnetic Imager (HMI) to analyze the sustained heating over days in an LB in a…
▽ More
Sunspot light bridges (LBs) exhibit a wide range of short-lived phenomena in the chromosphere and transition region. In contrast, we use here data from the Multi-Application Solar Telescope (MAST), the Interface Region Imaging Spectrograph (IRIS), Hinode, the Atmospheric Imaging Assembly (AIA), and the Helioseismic and Magnetic Imager (HMI) to analyze the sustained heating over days in an LB in a regular sunspot. Chromospheric temperatures were retrieved from the the MAST Ca II and IRIS Mg II lines by nonlocal thermodynamic equilibrium inversions. Line widths, Doppler shifts, and intensities were derived from the IRIS lines using Gaussian fits. Coronal temperatures were estimated through the differential emission measure, while the coronal magnetic field was obtained from an extrapolation of the HMI vector field. At the photosphere, the LB exhibits a granular morphology with field strengths of about 400 G and no significant electric currents. The sunspot does not fragment, and the LB remains stable for several days. The chromospheric temperature, IRIS line intensities and widths, and AIA 171 Åand 211 Åintensities are all enhanced in the LB with temperatures from 8000 K to 2.5 MK. Photospheric plasma motions remain small, while the chromosphere and transition region indicate predominantly red-shifts of 5-20 km/s with occasional supersonic downflows exceeding 100 km/s. The excess thermal energy over the LB is about 3.2x10^26 erg and matches the radiative losses. It could be supplied by magnetic flux loss of the sunspot (7.5x10^27 erg), kinetic energy from the increase in the LB width (4x10^28 erg), or freefall of mass along the coronal loops (6.3x10^26 ,erg).
△ Less
Submitted 2 January, 2023;
originally announced January 2023.
-
RecD: Deduplication for End-to-End Deep Learning Recommendation Model Training Infrastructure
Authors:
Mark Zhao,
Dhruv Choudhary,
Devashish Tyagi,
Ajay Somani,
Max Kaplan,
Sung-Han Lin,
Sarunya Pumma,
Jongsoo Park,
Aarti Basant,
Niket Agarwal,
Carole-Jean Wu,
Christos Kozyrakis
Abstract:
We present RecD (Recommendation Deduplication), a suite of end-to-end infrastructure optimizations across the Deep Learning Recommendation Model (DLRM) training pipeline. RecD addresses immense storage, preprocessing, and training overheads caused by feature duplication inherent in industry-scale DLRM training datasets. Feature duplication arises because DLRM datasets are generated from interactio…
▽ More
We present RecD (Recommendation Deduplication), a suite of end-to-end infrastructure optimizations across the Deep Learning Recommendation Model (DLRM) training pipeline. RecD addresses immense storage, preprocessing, and training overheads caused by feature duplication inherent in industry-scale DLRM training datasets. Feature duplication arises because DLRM datasets are generated from interactions. While each user session can generate multiple training samples, many features' values do not change across these samples. We demonstrate how RecD exploits this property, end-to-end, across a deployed training pipeline. RecD optimizes data generation pipelines to decrease dataset storage and preprocessing resource demands and to maximize duplication within a training batch. RecD introduces a new tensor format, InverseKeyedJaggedTensors (IKJTs), to deduplicate feature values in each batch. We show how DLRM model architectures can leverage IKJTs to drastically increase training throughput. RecD improves the training and preprocessing throughput and storage efficiency by up to 2.48x, 1.79x, and 3.71x, respectively, in an industry-scale DLRM training system.
△ Less
Submitted 1 May, 2023; v1 submitted 9 November, 2022;
originally announced November 2022.
-
Future Gradient Descent for Adapting the Temporal Shifting Data Distribution in Online Recommendation Systems
Authors:
Mao Ye,
Ruichen Jiang,
Haoxiang Wang,
Dhruv Choudhary,
Xiaocong Du,
Bhargav Bhushanam,
Aryan Mokhtari,
Arun Kejariwal,
Qiang Liu
Abstract:
One of the key challenges of learning an online recommendation model is the temporal domain shift, which causes the mismatch between the training and testing data distribution and hence domain generalization error. To overcome, we propose to learn a meta future gradient generator that forecasts the gradient information of the future data distribution for training so that the recommendation model c…
▽ More
One of the key challenges of learning an online recommendation model is the temporal domain shift, which causes the mismatch between the training and testing data distribution and hence domain generalization error. To overcome, we propose to learn a meta future gradient generator that forecasts the gradient information of the future data distribution for training so that the recommendation model can be trained as if we were able to look ahead at the future of its deployment. Compared with Batch Update, a widely used paradigm, our theory suggests that the proposed algorithm achieves smaller temporal domain generalization error measured by a gradient variation term in a local regret. We demonstrate the empirical advantage by comparing with various representative baselines.
△ Less
Submitted 2 September, 2022;
originally announced September 2022.
-
AutoShard: Automated Embedding Table Sharding for Recommender Systems
Authors:
Daochen Zha,
Louis Feng,
Bhargav Bhushanam,
Dhruv Choudhary,
Jade Nie,
Yuandong Tian,
Jay Chae,
Yinbin Ma,
Arun Kejariwal,
Xia Hu
Abstract:
Embedding learning is an important technique in deep recommendation models to map categorical features to dense vectors. However, the embedding tables often demand an extremely large number of parameters, which become the storage and efficiency bottlenecks. Distributed training solutions have been adopted to partition the embedding tables into multiple devices. However, the embedding tables can ea…
▽ More
Embedding learning is an important technique in deep recommendation models to map categorical features to dense vectors. However, the embedding tables often demand an extremely large number of parameters, which become the storage and efficiency bottlenecks. Distributed training solutions have been adopted to partition the embedding tables into multiple devices. However, the embedding tables can easily lead to imbalances if not carefully partitioned. This is a significant design challenge of distributed systems named embedding table sharding, i.e., how we should partition the embedding tables to balance the costs across devices, which is a non-trivial task because 1) it is hard to efficiently and precisely measure the cost, and 2) the partition problem is known to be NP-hard. In this work, we introduce our novel practice in Meta, namely AutoShard, which uses a neural cost model to directly predict the multi-table costs and leverages deep reinforcement learning to solve the partition problem. Experimental results on an open-sourced large-scale synthetic dataset and Meta's production dataset demonstrate the superiority of AutoShard over the heuristics. Moreover, the learned policy of AutoShard can transfer to sharding tasks with various numbers of tables and different ratios of the unseen tables without any fine-tuning. Furthermore, AutoShard can efficiently shard hundreds of tables in seconds. The effectiveness, transferability, and efficiency of AutoShard make it desirable for production use. Our algorithms have been deployed in Meta production environment. A prototype is available at https://github.com/daochenzha/autoshard
△ Less
Submitted 12 August, 2022;
originally announced August 2022.
-
If it Bleeds, it Leads: A Computational Approach to Covering Crime in Los Angeles
Authors:
Alexander Spangher,
Divya Choudhary
Abstract:
Developing and improving computational approaches to covering news can increase journalistic output and improve the way stories are covered. In this work we approach the problem of covering crime stories in Los Angeles. We present a machine-in-the-loop system that covers individual crimes by (1) learning the prototypical coverage archetypes from classical news articles on crime to learn their stru…
▽ More
Developing and improving computational approaches to covering news can increase journalistic output and improve the way stories are covered. In this work we approach the problem of covering crime stories in Los Angeles. We present a machine-in-the-loop system that covers individual crimes by (1) learning the prototypical coverage archetypes from classical news articles on crime to learn their structure and (2) using output from the Los Angeles Police department to generate "lede paragraphs", first structural unit of crime-articles. We introduce a probabilistic graphical model for learning article structure and a rule-based system for generating ledes. We hope our work can lead to systems that use these components together to form the skeletons of news articles covering crime.
This work was done for a class project in Jonathan May's Advanced Natural Language Processing Course, Fall, 2019.
△ Less
Submitted 14 June, 2022;
originally announced June 2022.
-
Positive Unlabeled Contrastive Learning
Authors:
Anish Acharya,
Sujay Sanghavi,
Li Jing,
Bhargav Bhushanam,
Dhruv Choudhary,
Michael Rabbat,
Inderjit Dhillon
Abstract:
Self-supervised pretraining on unlabeled data followed by supervised fine-tuning on labeled data is a popular paradigm for learning from limited labeled examples. We extend this paradigm to the classical positive unlabeled (PU) setting, where the task is to learn a binary classifier given only a few labeled positive samples, and (often) a large amount of unlabeled samples (which could be positive…
▽ More
Self-supervised pretraining on unlabeled data followed by supervised fine-tuning on labeled data is a popular paradigm for learning from limited labeled examples. We extend this paradigm to the classical positive unlabeled (PU) setting, where the task is to learn a binary classifier given only a few labeled positive samples, and (often) a large amount of unlabeled samples (which could be positive or negative).
We first propose a simple extension of standard infoNCE family of contrastive losses, to the PU setting; and show that this learns superior representations, as compared to existing unsupervised and supervised approaches. We then develop a simple methodology to pseudo-label the unlabeled samples using a new PU-specific clustering scheme; these pseudo-labels can then be used to train the final (positive vs. negative) classifier. Our method handily outperforms state-of-the-art PU methods over several standard PU benchmark datasets, while not requiring a-priori knowledge of any class prior (which is a common assumption in other PU methods). We also provide a simple theoretical analysis that motivates our methods.
△ Less
Submitted 28 March, 2024; v1 submitted 1 June, 2022;
originally announced June 2022.
-
The magnetic topology of the inverse Evershed flow
Authors:
A. Prasad,
M. Ranganathan,
C. Beck,
D. P. Choudhary,
Q. Hu
Abstract:
The inverse Evershed flow (IEF) is a mass motion towards sunspots at chromospheric heights. We combined high-resolution observations of NOAA 12418 from the Dunn Solar Telescope and vector magnetic field measurements from the Helioseismic and Magnetic Imager (HMI) to determine the driver of the IEF. We derived chromospheric line-of-sight (LOS) velocities from spectra of H$α$ and Ca II IR. The HMI d…
▽ More
The inverse Evershed flow (IEF) is a mass motion towards sunspots at chromospheric heights. We combined high-resolution observations of NOAA 12418 from the Dunn Solar Telescope and vector magnetic field measurements from the Helioseismic and Magnetic Imager (HMI) to determine the driver of the IEF. We derived chromospheric line-of-sight (LOS) velocities from spectra of H$α$ and Ca II IR. The HMI data were used in a non-force-free magnetic field extrapolation to track closed field lines near the sunspot in the active region. We determined their length and height, located their inner and outer foot points, and derived flow velocities along them. The magnetic field lines related to the IEF reach on average a height of 3 Mm over a length of 13 Mm. The inner (outer) foot points are located at 1.2 (1.9) sunspot radii. The average field strength difference $ΔB$ between inner and outer foot points is +400 G. The temperature difference $ΔT$ is anti-correlated with $ΔB$ with an average value of -100 K. The pressure difference $Δp$ is dominated by $ΔB$ and is primarily positive with a driving force towards the inner foot points of 1.7 kPa on average. The velocities predicted from $Δp$ reproduce the LOS velocities of 2-10 km s$^{-1}$ with a square-root dependence. We find that the IEF is driven along magnetic field lines connecting network elements with the outer penumbra by a gas pressure difference that results from a difference in field strength as predicted by the classical siphon flow scenario.
△ Less
Submitted 5 March, 2022;
originally announced March 2022.
-
Velocities of an Erupting Filament
Authors:
Shuo Wang,
Jack M. Jenkins,
Karin Muglach,
Valentin Martinez Pillet,
Christian Beck,
David M. Long,
Debi Prasad Choudhary,
James McAteer
Abstract:
Solar filaments exist as stable structures for extended periods of time before many of them form the core of a CME. We examine the properties of an erupting filament on 2017 May 29--30 with high-resolution He I 10830 A and Halpha spectra from the Dunn Solar Telescope, full-disk Dopplergrams of He I 10830 A from the Chromospheric Telescope, and EUV and coronograph data from SDO and STEREO. Pre-erup…
▽ More
Solar filaments exist as stable structures for extended periods of time before many of them form the core of a CME. We examine the properties of an erupting filament on 2017 May 29--30 with high-resolution He I 10830 A and Halpha spectra from the Dunn Solar Telescope, full-disk Dopplergrams of He I 10830 A from the Chromospheric Telescope, and EUV and coronograph data from SDO and STEREO. Pre-eruption line-of-sight velocities from an inversion of He I with the HAZEL code exhibit coherent patches of 5 Mm extent that indicate counter-streaming and/or buoyant behavior. During the eruption, individual, aligned threads appear in the He I velocity maps. The distribution of velocities evolves from Gaussian to strongly asymmetric. The maximal optical depth of He I 10830 A decreased from tau = 1.75 to 0.25, the temperature increased by 13 kK, and the average speed and width of the filament increased from 0 to 25 km s-1 and 10 to 20 Mm, respectively. All data sources agree that the filament rose with an exponential acceleration reaching 7.4 m s-2 that increased to a final velocity of 430 km s-1 at 22:24 UT; a CME was associated with this filament eruption. The properties during the eruption favor a kink/torus instability, which requires the existence of a flux rope. We conclude that full-disk chromospheric Dopplergrams can be used to trace the initial phase of on-disk filament eruptions in real-time, which might potentially be useful for modelling the source of any subsequent CMEs.
△ Less
Submitted 15 November, 2021;
originally announced November 2021.
-
Can't Fool Me: Adversarially Robust Transformer for Video Understanding
Authors:
Divya Choudhary,
Palash Goyal,
Saurabh Sahu
Abstract:
Deep neural networks have been shown to perform poorly on adversarial examples. To address this, several techniques have been proposed to increase robustness of a model for image classification tasks. However, in video understanding tasks, developing adversarially robust models is still unexplored. In this paper, we aim to bridge this gap. We first show that simple extensions of image based advers…
▽ More
Deep neural networks have been shown to perform poorly on adversarial examples. To address this, several techniques have been proposed to increase robustness of a model for image classification tasks. However, in video understanding tasks, developing adversarially robust models is still unexplored. In this paper, we aim to bridge this gap. We first show that simple extensions of image based adversarially robust models slightly improve the worst-case performance. Further, we propose a temporal attention regularization scheme in Transformer to improve the robustness of attention modules to adversarial examples. We illustrate using a large-scale video data set YouTube-8M that the final model (A-ART) achieves close to non-adversarial performance on its adversarial example set. We achieve 91% GAP on adversarial examples, whereas baseline Transformer and simple adversarial extensions achieve 72.9% and 82% respectively, showing significant improvement in robustness over the state-of-the-art.
△ Less
Submitted 26 October, 2021;
originally announced October 2021.
-
Frequency-aware SGD for Efficient Embedding Learning with Provable Benefits
Authors:
Yan Li,
Dhruv Choudhary,
Xiaohan Wei,
Baichuan Yuan,
Bhargav Bhushanam,
Tuo Zhao,
Guanghui Lan
Abstract:
Embedding learning has found widespread applications in recommendation systems and natural language modeling, among other domains. To learn quality embeddings efficiently, adaptive learning rate algorithms have demonstrated superior empirical performance over SGD, largely accredited to their token-dependent learning rate. However, the underlying mechanism for the efficiency of token-dependent lear…
▽ More
Embedding learning has found widespread applications in recommendation systems and natural language modeling, among other domains. To learn quality embeddings efficiently, adaptive learning rate algorithms have demonstrated superior empirical performance over SGD, largely accredited to their token-dependent learning rate. However, the underlying mechanism for the efficiency of token-dependent learning rate remains underexplored. We show that incorporating frequency information of tokens in the embedding learning problems leads to provably efficient algorithms, and demonstrate that common adaptive algorithms implicitly exploit the frequency information to a large extent. Specifically, we propose (Counter-based) Frequency-aware Stochastic Gradient Descent, which applies a frequency-dependent learning rate for each token, and exhibits provable speed-up compared to SGD when the token distribution is imbalanced. Empirically, we show the proposed algorithms are able to improve or match adaptive algorithms on benchmark recommendation tasks and a large-scale industrial recommendation system, closing the performance gap between SGD and adaptive algorithms. Our results are the first to show token-dependent learning rate provably improves convergence for non-convex embedding learning problems.
△ Less
Submitted 23 November, 2021; v1 submitted 10 October, 2021;
originally announced October 2021.
-
Heating of the solar chromosphere in a sunspot light bridge by electric currents
Authors:
Rohan E. Louis,
Avijeet Prasad,
Christian Beck,
Debi Prasad Choudhary,
Mehmet S. Yalim
Abstract:
Context: Resistive Ohmic dissipation has been suggested as a mechanism for heating the solar chromosphere, but few studies have established this association. Aim: We aim to determine how Ohmic dissipation by electric currents can heat the solar chromosphere. Methods: We combine high-resolution spectroscopic Ca II data from the Dunn Solar Telescope and vector magnetic field observations from the He…
▽ More
Context: Resistive Ohmic dissipation has been suggested as a mechanism for heating the solar chromosphere, but few studies have established this association. Aim: We aim to determine how Ohmic dissipation by electric currents can heat the solar chromosphere. Methods: We combine high-resolution spectroscopic Ca II data from the Dunn Solar Telescope and vector magnetic field observations from the Helioseismic and Magnetic Imager (HMI) to investigate thermal enhancements in a sunspot light bridge. The photospheric magnetic field from HMI was extrapolated to the corona using a non-force-free field technique that provided the three-dimensional distribution of electric currents, while an inversion of the chromospheric Ca II line with a local thermodynamic equilibrium and a nonlocal thermodynamic equilibrium spectral archive delivered the temperature stratifications from the photosphere to the chromosphere. Results: We find that the light bridge is a site of strong electric currents, of about 0.3 A/m^2 at the bottom boundary, which extend to about 0.7 Mm while decreasing monotonically with height. These currents produce a chromospheric temperature excess of about 600-800 K relative to the umbra. Only the light bridge, where relatively weak and highly inclined magnetic fields emerge over a duration of 13 hr, shows a spatial coincidence of thermal enhancements and electric currents. The temperature enhancements and the Cowling heating are primarily confined to a height range of 0.4-0.7 Mm above the light bridge. The corresponding increase in internal energy of 200 J/m^3 can be supplied by the heating in about 10 min. Conclusions: Our results provide direct evidence for currents heating the lower solar chromosphere through Ohmic dissipation.
△ Less
Submitted 26 July, 2021;
originally announced July 2021.
-
Low-Precision Hardware Architectures Meet Recommendation Model Inference at Scale
Authors:
Zhaoxia,
Deng,
Jongsoo Park,
Ping Tak Peter Tang,
Haixin Liu,
Jie,
Yang,
Hector Yuen,
Jianyu Huang,
Daya Khudia,
Xiaohan Wei,
Ellie Wen,
Dhruv Choudhary,
Raghuraman Krishnamoorthi,
Carole-Jean Wu,
Satish Nadathur,
Changkyu Kim,
Maxim Naumov,
Sam Naghshineh,
Mikhail Smelyanskiy
Abstract:
Tremendous success of machine learning (ML) and the unabated growth in ML model complexity motivated many ML-specific designs in both CPU and accelerator architectures to speed up the model inference. While these architectures are diverse, highly optimized low-precision arithmetic is a component shared by most. Impressive compute throughputs are indeed often exhibited by these architectures on ben…
▽ More
Tremendous success of machine learning (ML) and the unabated growth in ML model complexity motivated many ML-specific designs in both CPU and accelerator architectures to speed up the model inference. While these architectures are diverse, highly optimized low-precision arithmetic is a component shared by most. Impressive compute throughputs are indeed often exhibited by these architectures on benchmark ML models. Nevertheless, production models such as recommendation systems important to Facebook's personalization services are demanding and complex: These systems must serve billions of users per month responsively with low latency while maintaining high prediction accuracy, notwithstanding computations with many tens of billions parameters per inference. Do these low-precision architectures work well with our production recommendation systems? They do. But not without significant effort. We share in this paper our search strategies to adapt reference recommendation models to low-precision hardware, our optimization of low-precision compute kernels, and the design and development of tool chain so as to maintain our models' accuracy throughout their lifespan during which topic trends and users' interests inevitably evolve. Practicing these low-precision technologies helped us save datacenter capacities while deploying models with up to 5X complexity that would otherwise not be deployed on traditional general-purpose CPUs. We believe these lessons from the trenches promote better co-design between hardware architecture and software engineering and advance the state of the art of ML in industry.
△ Less
Submitted 26 May, 2021;
originally announced May 2021.
-
Alternate Model Growth and Pruning for Efficient Training of Recommendation Systems
Authors:
Xiaocong Du,
Bhargav Bhushanam,
Jiecao Yu,
Dhruv Choudhary,
Tianxiang Gao,
Sherman Wong,
Louis Feng,
Jongsoo Park,
Yu Cao,
Arun Kejariwal
Abstract:
Deep learning recommendation systems at scale have provided remarkable gains through increasing model capacity (i.e. wider and deeper neural networks), but it comes at significant training cost and infrastructure cost. Model pruning is an effective technique to reduce computation overhead for deep neural networks by removing redundant parameters. However, modern recommendation systems are still th…
▽ More
Deep learning recommendation systems at scale have provided remarkable gains through increasing model capacity (i.e. wider and deeper neural networks), but it comes at significant training cost and infrastructure cost. Model pruning is an effective technique to reduce computation overhead for deep neural networks by removing redundant parameters. However, modern recommendation systems are still thirsty for model capacity due to the demand for handling big data. Thus, pruning a recommendation model at scale results in a smaller model capacity and consequently lower accuracy. To reduce computation cost without sacrificing model capacity, we propose a dynamic training scheme, namely alternate model growth and pruning, to alternatively construct and prune weights in the course of training. Our method leverages structured sparsification to reduce computational cost without hurting the model capacity at the end of offline training so that a full-size model is available in the recurring training stage to learn new data in real-time. To the best of our knowledge, this is the first work to provide in-depth experiments and discussion of applying structural dynamics to recommendation systems at scale to reduce training cost. The proposed method is validated with an open-source deep-learning recommendation model (DLRM) and state-of-the-art industrial-scale production models.
△ Less
Submitted 3 May, 2021;
originally announced May 2021.
-
Neurological Status Classification Using Convolutional Neural Network
Authors:
Mehrad Jaloli,
Divya Choudhary,
Marzia Cescon
Abstract:
In this study we show that a Convolutional Neural Network (CNN) model is able to accuratelydiscriminate between 4 different phases of neurological status in a non-Electroencephalogram(EEG) dataset recorded in an experiment in which subjects are exposed to physical, cognitiveand emotional stress. We demonstrate that the proposed model is able to obtain 99.99% AreaUnder the Curve (AUC) of Receiver O…
▽ More
In this study we show that a Convolutional Neural Network (CNN) model is able to accuratelydiscriminate between 4 different phases of neurological status in a non-Electroencephalogram(EEG) dataset recorded in an experiment in which subjects are exposed to physical, cognitiveand emotional stress. We demonstrate that the proposed model is able to obtain 99.99% AreaUnder the Curve (AUC) of Receiver Operation characteristic (ROC) and 99.82% classificationaccuracy on the test dataset. Furthermore, for comparison, we show that our models outperformstraditional classification methods such as SVM, and RF. Finally, we show the advantage of CNN models, in comparison to other methods, in robustness to noise by 97.46% accuracy on a noisy dataset.
△ Less
Submitted 1 April, 2021;
originally announced April 2021.
-
The formation of an atypical sunspot light bridge as a result of large-scale flux emergence
Authors:
Rohan E. Louis,
Christian Beck,
Debi P. Choudhary
Abstract:
We use a combination of full-disk data from the Solar Dynamics Observatory and high-resolution data from the Dunn Solar Telescope (DST) to study the formation, structure, and evolution of an atypical light bridge (LB) in a regular sunspot. The LB results from the emergence of magnetic flux with one footpoint rooted in a pore outside the parent sunspot that appears about 17 hrs before the LB. The p…
▽ More
We use a combination of full-disk data from the Solar Dynamics Observatory and high-resolution data from the Dunn Solar Telescope (DST) to study the formation, structure, and evolution of an atypical light bridge (LB) in a regular sunspot. The LB results from the emergence of magnetic flux with one footpoint rooted in a pore outside the parent sunspot that appears about 17 hrs before the LB. The pore has a polarity opposite to that of the sunspot and recedes away from it at a speed of about 0.4 km/s. This is accompanied by the development of an elongated magnetic channel in the outer penumbra which triggers the formation of the LB when it reaches the inner penumbral boundary. The LB is a nearly horizontal structure with a field strength of about 1.2 kG that exhibits long-lived photospheric blue-shifts of about 0.85 km/s along its entire length.The emergence of the LB leads to dynamic surges in the chromosphere and transition region about 13 min later. We derived the photospheric and chromospheric structure of the LB in the DST data from spectral line parameters and inversions of He i at 1083 nm, Si i at 1082.7 nm, Ca ii IR at 854 nm and Halpha at 656 nm, and speckle-reconstructed imaging at 700 nm and 430 nm. The LB shows an elongated filamentary shape in the photosphere without lateral extrusions. The thermal inversion of Ca ii IR reveals the LB to be about 600-800 K hotter than the umbra. Different sections of the LB are elevated to heights between 400 and 700 km. Our results indicate that the LB formation is part of a flux emergence event with the LB envelope reaching a height of about 29 Mm before dissolving after about 13 hr. We suggest that the existence of persistent, large-scale photospheric blue-shifts in LBs is the most likely criterion to distinguish between flux emergence events and overturning convection in field-free umbral intrusions.
△ Less
Submitted 27 October, 2020;
originally announced October 2020.
-
Training Recommender Systems at Scale: Communication-Efficient Model and Data Parallelism
Authors:
Vipul Gupta,
Dhruv Choudhary,
Ping Tak Peter Tang,
Xiaohan Wei,
Xing Wang,
Yuzhen Huang,
Arun Kejariwal,
Kannan Ramchandran,
Michael W. Mahoney
Abstract:
In this paper, we consider hybrid parallelism -- a paradigm that employs both Data Parallelism (DP) and Model Parallelism (MP) -- to scale distributed training of large recommendation models. We propose a compression framework called Dynamic Communication Thresholding (DCT) for communication-efficient hybrid training. DCT filters the entities to be communicated across the network through a simple…
▽ More
In this paper, we consider hybrid parallelism -- a paradigm that employs both Data Parallelism (DP) and Model Parallelism (MP) -- to scale distributed training of large recommendation models. We propose a compression framework called Dynamic Communication Thresholding (DCT) for communication-efficient hybrid training. DCT filters the entities to be communicated across the network through a simple hard-thresholding function, allowing only the most relevant information to pass through. For communication efficient DP, DCT compresses the parameter gradients sent to the parameter server during model synchronization. The threshold is updated only once every few thousand iterations to reduce the computational overhead of compression. For communication efficient MP, DCT incorporates a novel technique to compress the activations and gradients sent across the network during the forward and backward propagation, respectively. This is done by identifying and updating only the most relevant neurons of the neural network for each training sample in the data. We evaluate DCT on publicly available natural language processing and recommender models and datasets, as well as recommendation systems used in production at Facebook. DCT reduces communication by at least $100\times$ and $20\times$ during DP and MP, respectively. The algorithm has been deployed in production, and it improves end-to-end training time for a state-of-the-art industrial recommender model by 37\%, without any loss in performance.
△ Less
Submitted 21 May, 2021; v1 submitted 17 October, 2020;
originally announced October 2020.
-
Adaptive Dense-to-Sparse Paradigm for Pruning Online Recommendation System with Non-Stationary Data
Authors:
Mao Ye,
Dhruv Choudhary,
Jiecao Yu,
Ellie Wen,
Zeliang Chen,
Jiyan Yang,
Jongsoo Park,
Qiang Liu,
Arun Kejariwal
Abstract:
Large scale deep learning provides a tremendous opportunity to improve the quality of content recommendation systems by employing both wider and deeper models, but this comes at great infrastructural cost and carbon footprint in modern data centers. Pruning is an effective technique that reduces both memory and compute demand for model inference. However, pruning for online recommendation systems…
▽ More
Large scale deep learning provides a tremendous opportunity to improve the quality of content recommendation systems by employing both wider and deeper models, but this comes at great infrastructural cost and carbon footprint in modern data centers. Pruning is an effective technique that reduces both memory and compute demand for model inference. However, pruning for online recommendation systems is challenging due to the continuous data distribution shift (a.k.a non-stationary data). Although incremental training on the full model is able to adapt to the non-stationary data, directly applying it on the pruned model leads to accuracy loss. This is because the sparsity pattern after pruning requires adjustment to learn new patterns. To the best of our knowledge, this is the first work to provide in-depth analysis and discussion of applying pruning to online recommendation systems with non-stationary data distribution. Overall, this work makes the following contributions: 1) We present an adaptive dense to sparse paradigm equipped with a novel pruning algorithm for pruning a large scale recommendation system with non-stationary data distribution; 2) We design the pruning algorithm to automatically learn the sparsity across layers to avoid repeating hand-tuning, which is critical for pruning the heterogeneous architectures of recommendation systems trained with non-stationary data.
△ Less
Submitted 21 October, 2020; v1 submitted 16 October, 2020;
originally announced October 2020.
-
Center-to-Limb Variation of the Inverse Evershed Flow
Authors:
C. Beck,
D. P. Choudhary,
M. Ranganathan
Abstract:
We present the properties of the inverse Evershed flow (IEF) based on the center-to-limb variation of the plasma speed and loop geometry of chromospheric superpenumbral fibrils in eleven sunspots that were located at a wide range of heliocentric angles from 12 to 79 deg. The observations were acquired at the Dunn Solar Telescope in the spectral lines of Halpha at 656nm, CaII IR at 854 nm and HeI a…
▽ More
We present the properties of the inverse Evershed flow (IEF) based on the center-to-limb variation of the plasma speed and loop geometry of chromospheric superpenumbral fibrils in eleven sunspots that were located at a wide range of heliocentric angles from 12 to 79 deg. The observations were acquired at the Dunn Solar Telescope in the spectral lines of Halpha at 656nm, CaII IR at 854 nm and HeI at 1083 nm. All sunspots display opposite line-of-sight (LOS) velocities on the limb and center side with a distinct shock signature near the outer penumbral edge. We developed a simplified flexible sunspot model assuming axisymmetry and prescribing the radial flow speed profile at a known loop geometry to replicate the observed two-dimensional IEF patterns under different viewing angles. The simulated flow maps match the observations for chromospheric loops with 10-20 Mm length starting at 0.8-1.1 sunspot radii, an apex height of 2-3Mm and a true constant flow speed of 2-9km/s. We find on average a good agreement of the simulated velocities and the observations on elliptical annuli around the sunspot. Individual IEF channels show a significant range of variation in their properties and reach maximal LOS speeds of up to 12km/s. Upwards or downwards directed flows do not show a change of sign in the LOS velocities for heliocentric angles above 30 deg. Our results are consistent with the IEF being caused by a siphon flow mechanism driving a flow at a constant sonic speed along elevated loops with a flattened top in the chromosphere.
△ Less
Submitted 28 August, 2020;
originally announced August 2020.
-
Temporal Evolution of the Inverse Evershed Flow
Authors:
C. Beck,
D. P. Choudhary
Abstract:
The inverse Evershed flow (IEF) is an inflow of material into the penumbra of sunspots in the solar chromosphere that occurs along dark, elongated superpenumbral fibrils extending from about the outer edge of the moat cell to the sunspot. The IEF channels exhibit brightenings in the penumbra, where the supersonic IEF descends to the photosphere causing shock fronts with localized heating. We used…
▽ More
The inverse Evershed flow (IEF) is an inflow of material into the penumbra of sunspots in the solar chromosphere that occurs along dark, elongated superpenumbral fibrils extending from about the outer edge of the moat cell to the sunspot. The IEF channels exhibit brightenings in the penumbra, where the supersonic IEF descends to the photosphere causing shock fronts with localized heating. We used an 1-hr time-series of spectroscopic observations of the chromospheric spectral lines of CaIIIR at 854nm and H$α$ at 656nm taken with IBIS at the DST to investigate the temporal evolution of IEF channels. Complementary information on the photospheric magnetic field was obtained from observations with FIRS at 1083\nm and HMI. We find that individual IEF channels are long-lived (10-60min) and only show minor changes in position and flow speed during their life time. Initiation and termination of IEF channels takes several minutes. The IEF channels with line-of-sight velocities of about 10km/s show no lasting impact from transient or oscillatory phenomena with maximal velocity amplitudes of only about 1km/s that run along them. We could not detect any clear correlation of the location and evolution of IEF channels to local magnetic field properties in the photosphere in the penumbra or moving magnetic features in the sunspot moat. Our results support a picture of the IEF as a field-aligned siphon flow along arched loops. From our data we cannot determine if their evolution is controlled by events at the outer end in the moat or at the inner end in the penumbra.
△ Less
Submitted 11 February, 2020;
originally announced February 2020.
-
Magnetic Structure of an Erupting Filament
Authors:
Shuo Wang,
Jack M. Jenkins,
Valentin Martinez Pillet,
Christian Beck,
David M. Long,
Debi Prasad Choudhary,
Karin Muglach,
James McAteer
Abstract:
The full 3-D vector magnetic field of a solar filament prior to eruption is presented. The filament was observed with the Facility Infrared Spectropolarimeter at the Dunn Solar Telescope in the chromospheric He i line at 10830 Å on May 29 and 30, 2017. We inverted the spectropolarimetric observations with the HAnle and ZEeman Light (HAZEL) code to obtain the chromospheric magnetic field. A bimodal…
▽ More
The full 3-D vector magnetic field of a solar filament prior to eruption is presented. The filament was observed with the Facility Infrared Spectropolarimeter at the Dunn Solar Telescope in the chromospheric He i line at 10830 Å on May 29 and 30, 2017. We inverted the spectropolarimetric observations with the HAnle and ZEeman Light (HAZEL) code to obtain the chromospheric magnetic field. A bimodal distribution of field strength was found in or near the filament. The average field strength was 24 Gauss, but prior to the eruption we find the 90th percentile of field strength was 435 Gauss for the observations on May 29. The field inclination was about 67 degree from the solar vertical. The field azimuth made an angle of about 47 to 65 degree to the spine axis. The results suggest an inverse configuration indicative of a flux rope topology. He i intensity threads were found to be co-aligned with the magnetic field direction. The filament had a sinistral configuration as expected for the southern hemisphere. The filament was stable on May 29, 2017 and started to rise during two observations on May 30, before erupting and causing a minor coronal mass ejection. There was no obvious change of the magnetic topology during the eruption process. Such information on the magnetic topology of erupting filaments could improve the prediction of the geoeffectiveness of solar storms.
△ Less
Submitted 7 February, 2020; v1 submitted 6 February, 2020;
originally announced February 2020.
-
2D non-LTE modelling of a filament observed in the H_alpha line with the DST/IBIS spectropolarimeter
Authors:
P. Schwartz,
S. Gunar,
J. M. Jenkins,
D. M. Long,
P. Heinzel,
D. P. Choudhary
Abstract:
We study a fragment of a large quiescent filament observed on May 29, 2017 by the Interferometric BIdimensional Spectropolarimeter (IBIS) mounted at the Dunn Solar Telescope. We focus on its quiescent stage prior to its eruption. We analyse the spectral observations obtained in the H$α$ line to derive the thermodynamic properties of the plasma of the observed fragment of the filament. We used a 2D…
▽ More
We study a fragment of a large quiescent filament observed on May 29, 2017 by the Interferometric BIdimensional Spectropolarimeter (IBIS) mounted at the Dunn Solar Telescope. We focus on its quiescent stage prior to its eruption. We analyse the spectral observations obtained in the H$α$ line to derive the thermodynamic properties of the plasma of the observed fragment of the filament. We used a 2D filament model employing radiative transfer computations under conditions that depart from the local thermodynamic equilibrium. We employed a forward modelling technique in which we used the 2D model to producesynthetic H_alpha line profiles that we compared with the observations. We then found the set of model input parameters, which produces synthetic spectra with the best agreement with observations. Our analysis shows that one part of the observed fragment of the filament is cooler, denser, and more dynamic than its other part that is hotter, less dense, and more quiescent. The derived temperatures in the first part range from 6,000 K to 10,000$ K and in the latter part from 11,000 K to 14,000 K. The gas pressure is 0.2-0.4 dyn/cm}^{2} in the first part and around 0.15 dyn/cm}^{2} in the latter part. The more dynamic nature of the first part is characterised by the line-of-sight velocities with absolute values of 6-7 km/s and microturbulent velocities of 8-9 km/s. On the other hand, the latter part exhibits line-of-sight velocities with absolute values 0-2.5 km/s and microturbulent velocities of 4-6 km/s.
△ Less
Submitted 8 October, 2019;
originally announced October 2019.
-
Magnetic Properties and Flow Angle of the Inverse Evershed Flow at Its Downflow Points
Authors:
C. Beck,
D. P. Choudhary
Abstract:
We determined the direction and strength of the photospheric and lower chromospheric magnetic field in the umbra and penumbra of a sunspot from inversions of spectropolarimetric observations of photospheric lines at 617\,nm and 1565\,nm, and the chromospheric \ion{Ca}{ii} IR line at 854\,nm, respectively. We compare the magnetic field vector with the direction of 75 flow channels that harbor the c…
▽ More
We determined the direction and strength of the photospheric and lower chromospheric magnetic field in the umbra and penumbra of a sunspot from inversions of spectropolarimetric observations of photospheric lines at 617\,nm and 1565\,nm, and the chromospheric \ion{Ca}{ii} IR line at 854\,nm, respectively. We compare the magnetic field vector with the direction of 75 flow channels that harbor the chromospheric inverse Evershed effect (IEF) near their downflow points (DFPs) in the sunspot's penumbra. The azimuth and inclination of the IEF channels to the line of sight (LOS) were derived from spatial maps of the LOS velocity and line-core intensity of the \ion{Ca}{ii} IR line and a thermal inversion of the \ion{Ca}{ii} IR spectra to obtain temperature cubes. We find that the flow direction of the IEF near the DFPs is aligned with the photospheric magnetic field to within about $\pm$\,15\,deg. The IEF flow fibrils make an angle of 30--90\,deg to the local vertical with an average value of about 65\,deg. The average field strength at the DFPs is about 1.3\,kG. Our findings suggest that the IEF in the lower chromosphere is a field-aligned siphon flow, where the larger field strength at the inner footpoints together with the lower temperature in the penumbra causes the necessary gas pressure difference relative to the outer footpoints in the hotter quiet Sun with lower magnetic field strength. The IEF connects to magnetic field lines that are not horizontal like for the regular photospheric Evershed flow, but which continue upwards into the chromosphere indicating an "uncombed" penumbral structure.
△ Less
Submitted 12 February, 2019;
originally announced February 2019.
-
Evolution of Photospheric Vector Magnetic Field Associated with Moving Flare Ribbons As Seen By GST
Authors:
Chang Liu,
Wenda Cao,
Jongchul Chae,
Kwangsu Ahn,
Debi Prasad Choudhary,
Jeongwoo Lee,
Rui Liu,
Na Deng,
Jiasheng Wang,
Haimin Wang
Abstract:
The photospheric response to solar flares, also known as coronal back reaction, is often observed as sudden flare-induced changes in vector magnetic field and sunspot motions. However, it remains obscure whether evolving flare ribbons, the flare signature closest to the photosphere, are accompanied by changes in vector magnetic field therein. Here we explore the relationship between the dynamics o…
▽ More
The photospheric response to solar flares, also known as coronal back reaction, is often observed as sudden flare-induced changes in vector magnetic field and sunspot motions. However, it remains obscure whether evolving flare ribbons, the flare signature closest to the photosphere, are accompanied by changes in vector magnetic field therein. Here we explore the relationship between the dynamics of flare ribbons in the chromosphere and variations of magnetic fields in the underlying photosphere, using high-resolution off-band H-alpha images and near-infrared vector magnetograms of the M6.5 flare on 2015 June 22 observed with the 1.6 m Goode Solar Telescope. We find that changes of photospheric fields occur at the arrival of the flare ribbon front, thus propagating analogously to flare ribbons. In general, the horizontal field increases and the field lines become more inclined to the surface. When ribbons sweep through regions that undergo a rotational motion, the fields transiently turn more vertical with decreased horizontal field and inclination angle, and then restore and/or become more horizontal than before the ribbon arrival. The ribbon propagation decelerates near the sunspot rotation center, where the vertical field becomes permanently enhanced. Similar magnetic field changes are discernible in magnetograms from the Helioseismic and Magnetic Imager (HMI), and an inward collapse of coronal magnetic fields is inferred from the time sequence of non-linear force-free field models extrapolated from HMI magnetograms. We conclude that photospheric fields respond nearly instantaneously to magnetic reconnection in the corona.
△ Less
Submitted 27 October, 2018;
originally announced October 2018.
-
Super Penumbral Chromospheric Flare
Authors:
S. Liu,
H. Q. Zhang,
D. P. Choudhary,
A. K. Srivastava,
B. N. Dwivedi
Abstract:
We observed a C-class flare at the outer boundary of the super-penumbra of a sunspot. The flare was triggered by an emerging magnetic bipolar region that was obliquely oriented with respect to the super-penumbral fibrils. The flare started due to the low height magnetic reconnection of emerging magnetic flux with super-penumbral field resulting hot multi-temperature plasma flows in the inverse Eve…
▽ More
We observed a C-class flare at the outer boundary of the super-penumbra of a sunspot. The flare was triggered by an emerging magnetic bipolar region that was obliquely oriented with respect to the super-penumbral fibrils. The flare started due to the low height magnetic reconnection of emerging magnetic flux with super-penumbral field resulting hot multi-temperature plasma flows in the inverse Evershed flow channel and its overlying atmosphere. The inverse Evershed flows in the chromosphere start from super penumbra towards sunspot that end at the outer boundary of the penumbra. The hot plasma flow towards the sunspot in the inverse Evershed channels show about 10 km s$^{-1}$ higher velocity in H$α$ wavelengths compared to the plasma emissions at various temperatures as seen in different AIA filters. Even though these velocities are about seven times higher than the typical inverse-Evershed flow speeds, the flow is diminished at the outer boundary of the sunspot's penumbra. This suggests that the super-penumbral field lines that carry the inverse Evershed flows, are discontinued at the boundary where the penumbral field lines dive into the sun and these two sets of field lines are completely distinct. The discontinuity in the typical magnetic field and plasma properties at the adjoining of these two sets of field lines further leads the discontinuity in the characteristic magnetoacoustic and Alfvén speeds, therefore, stopping the plasma flows further on. The multi-temperature plasma in the inverse Evershed channels exhibits \textbf{possible} longitudinal oscillations initially during the onset of the flare, and later flows towards the sunspot. In the multi-temperature view, the different layers above the flare region have the mixture of supersonic as well as subsonic flows.
△ Less
Submitted 1 June, 2018;
originally announced June 2018.
-
Thermodynamic Properties of the Evershed Flow in the Lower Chromosphere
Authors:
Debi Prasad Choudhary,
Christian Beck
Abstract:
We used spectropolarimetric observations of a sunspot in active region NOAA 11809 in the Ca ii line at 854.2 nm taken with the SpectroPolarimeter for Optical and Infrared Regions (SPINOR) at the Dunn Solar Telescope to infer thermodynamic parameters along one hundred super-penumbral fibrils that harbor the inverse Evershed flow. The fibrils were identified in line-of-sight (LOS) velocity and line…
▽ More
We used spectropolarimetric observations of a sunspot in active region NOAA 11809 in the Ca ii line at 854.2 nm taken with the SpectroPolarimeter for Optical and Infrared Regions (SPINOR) at the Dunn Solar Telescope to infer thermodynamic parameters along one hundred super-penumbral fibrils that harbor the inverse Evershed flow. The fibrils were identified in line-of-sight (LOS) velocity and line core intensity maps and were located in a segment of the sunspot that showed a regular penumbra in the photosphere. The chromospheric LOS velocity abruptly decreases from 3 to 15 km/s to zero at the inner footpoints of the fibrils that are located from the mid penumbra to about 1.4 spot radii. The spectra often show multiple components, i.e., one at the rest wavelength and one with a strong red shift, which indicates spatially or vertically unresolved structures. The line-core intensity always peaks slightly closer to the umbra than the LOS velocity. An inversion of the spectra with the CAlcium Inversion using a Spectral ARchive (CAISAR) code provided us with temperature stratifications that allowed us to trace individual fibrils through the atmosphere and to determine the angle of the flows relative to the surface without any additional assumptions on the flow topology such as radial symmetry. We find that the fibrils are not horizontal near the downflow points, but make an angle of 30 to 60 degrees to the local vertical. The temperature is enhanced by 200K at log(tau) ~ -2 and up to 2000K at log(tau) ~ -6 over that of the quiet Sun, whereas there is no signature in the low photosphere. Our results are consistent with a critical, i.e., sonic, or super-sonic siphon flow along super-penumbral flux tubes in which accelerating plasma abruptly attains sub-critical velocity through a standing shock in or near the penumbra as predicted by Montesinos & Thomas (1993).
△ Less
Submitted 19 April, 2018;
originally announced April 2018.
-
Statistical Investigation of Supersonic Downflows in the Transition Region above Sunspots
Authors:
Tanmoy Samanta,
Hui Tian,
Debi Prasad Choudhary
Abstract:
Downflows at supersonic speeds have been observed in the transition region (TR) above sunspots for more than three decades. These downflows are often seen in different TR spectral lines above sunspots. We have performed a statistical investigation of these downflows using a large sample which was missing earlier. The Interface Region Imaging Spectrograph (IRIS) has provided a wealth of observation…
▽ More
Downflows at supersonic speeds have been observed in the transition region (TR) above sunspots for more than three decades. These downflows are often seen in different TR spectral lines above sunspots. We have performed a statistical investigation of these downflows using a large sample which was missing earlier. The Interface Region Imaging Spectrograph (IRIS) has provided a wealth of observational data of sunspots at high spatial and spectral resolution in the past few years. We have identified sixty datasets obtained with IRIS raster scans. Using an automated code, we identified the locations of strong downflows within these sunspots. We found that around eighty percent of our sample show supersonic downflows in the Si IV 1403 Å line. These downflows mostly appear in the penumbral regions, though some of them are found in the umbrae. We also found that almost half of these downflows show signatures in chromospheric lines. Furthermore, a detailed spectral analysis was performed by selecting a small spectral window containing the O IV 1400/1401 Å and Si IV 1403 Å lines. Six Gaussian functions were simultaneously fitted to these three spectral lines and their satellite lines associated with the supersonic downflows. We calculated the intensity, Doppler velocity and line width for these lines. Using the O IV 1400/1401 Å line ratio, we find that the downflow components are around one order of magnitude less dense than the regular components. Results from our statistical analysis suggest that these downflows may originate from the corona and that they are independent of the background TR plasma.
△ Less
Submitted 13 April, 2018;
originally announced April 2018.
-
High-resolution Observations of Halpha Spectra with a Subtractive Double Pass
Authors:
C. Beck,
R. Rezaei,
D. Prasad Choudhary,
S. Gosain,
A. Tritschler,
R. E. Louis
Abstract:
High-resolution imaging spectroscopy in solar physics has relied on Fabry-Perot Interferometers (FPIs) in recent years. FPI systems, however, get technically challenging and expensive for telescopes larger than the 1-m class. A conventional slit spectrograph with a diffraction-limited performance over a large field of view (FOV) can be built at much lower cost and effort. It can be converted to an…
▽ More
High-resolution imaging spectroscopy in solar physics has relied on Fabry-Perot Interferometers (FPIs) in recent years. FPI systems, however, get technically challenging and expensive for telescopes larger than the 1-m class. A conventional slit spectrograph with a diffraction-limited performance over a large field of view (FOV) can be built at much lower cost and effort. It can be converted to an imaging spectro(polari)meter using the concept of a subtractive double pass (SDP). We demonstrate that an SDP system can reach a similar performance as FPI-based systems with a high spatial and moderate spectral resolution across a FOV of 100"x100" with a spectral coverage of 1 nm. We use Halpha spectra taken with a SDP system at the Dunn Solar Telescope and complementary full-disc data to infer the properties of small-scale superpenumbral filaments. We find that the majority of all filaments end in patches of opposite-polarity fields. The internal fine-structure in the line-core intensity of Halpha at spatial scales of about 0.5" exceeds that in other parameters such as the line width, indicating small-scale opacity effects in a larger-scale structure with common properties. We conclude that SDP systems are a valid alternative to FPI systems when high spatial resolution and a large FOV are required. They also can reach a cadence that is comparable to that of FPI systems, while providing a much larger spectral range.
△ Less
Submitted 19 December, 2017;
originally announced December 2017.
-
On the Runtime-Efficacy Trade-off of Anomaly Detection Techniques for Real-Time Streaming Data
Authors:
Dhruv Choudhary,
Arun Kejariwal,
Francois Orsini
Abstract:
Ever growing volume and velocity of data coupled with decreasing attention span of end users underscore the critical need for real-time analytics. In this regard, anomaly detection plays a key role as an application as well as a means to verify data fidelity. Although the subject of anomaly detection has been researched for over 100 years in a multitude of disciplines such as, but not limited to,…
▽ More
Ever growing volume and velocity of data coupled with decreasing attention span of end users underscore the critical need for real-time analytics. In this regard, anomaly detection plays a key role as an application as well as a means to verify data fidelity. Although the subject of anomaly detection has been researched for over 100 years in a multitude of disciplines such as, but not limited to, astronomy, statistics, manufacturing, econometrics, marketing, most of the existing techniques cannot be used as is on real-time data streams. Further, the lack of characterization of performance -- both with respect to real-timeliness and accuracy -- on production data sets makes model selection very challenging. To this end, we present an in-depth analysis, geared towards real-time streaming data, of anomaly detection techniques. Given the requirements with respect to real-timeliness and accuracy, the analysis presented in this paper should serve as a guide for selection of the "best" anomaly detection technique. To the best of our knowledge, this is the first characterization of anomaly detection techniques proposed in very diverse set of fields, using production data sets corresponding to a wide set of application domains.
△ Less
Submitted 12 October, 2017;
originally announced October 2017.
-
Fast inversion of solar Ca II spectra
Authors:
C. Beck,
D. Prasad Choudhary,
R. Rezaei,
R. E. Louis
Abstract:
We present a fast (<< 1 s per profile) inversion code for solar Ca II lines. The code uses an archive of spectra that are synthesized prior to the inversion under the assumption of local thermodynamic equilibrium (LTE). We show that it can be successfully applied to spectrograph data or more sparsely sampled spectra from two-dimensional spectrometers. From a comparison to a non-LTE inversion of th…
▽ More
We present a fast (<< 1 s per profile) inversion code for solar Ca II lines. The code uses an archive of spectra that are synthesized prior to the inversion under the assumption of local thermodynamic equilibrium (LTE). We show that it can be successfully applied to spectrograph data or more sparsely sampled spectra from two-dimensional spectrometers. From a comparison to a non-LTE inversion of the same set of spectra, we derive a first-order non-LTE correction to the temperature stratifications derived in the LTE approach. The correction factor is close to unity up to log tau ~ -3 and increases to values of 2.5 and 4 at log tau = -6 in the quiet Sun and the umbra, respectively.
△ Less
Submitted 5 November, 2014; v1 submitted 30 October, 2014;
originally announced October 2014.
-
A three-dimensional view of the thermal structure in a super-penumbral canopy
Authors:
C. Beck,
D. Prasad Choudhary,
R. Rezaei
Abstract:
We investigate the thermal topology in a super-penumbral canopy by determining the 3D thermal structure of an active region. We derive the temperature stratifications in the active region by an inversion of the Ca II IR line at 854.2 nm, assuming LTE. We trace the 3D topology of individual features located in the super-penumbral canopy, mainly radially oriented fibrils. We find that about half of…
▽ More
We investigate the thermal topology in a super-penumbral canopy by determining the 3D thermal structure of an active region. We derive the temperature stratifications in the active region by an inversion of the Ca II IR line at 854.2 nm, assuming LTE. We trace the 3D topology of individual features located in the super-penumbral canopy, mainly radially oriented fibrils. We find that about half of the fibrils form short, arched, low-lying loops in the temperature cube. These closed loops connect from bright grains that are either in or close to the penumbra to the photosphere a few Mms away from the sunspot. They reach less than 1 Mm in height. The other half of the fibrils rise with distance from the sunspot until they leave the Ca II IR formation height. Many of the fibrils show a central dark core and two lateral brightenings as seen in line-core intensity images. The corresponding velocity image shows fibrils that are as wide as the fibrils seen in intensity without a lateral substructure. Additionally, we study one example of exceptional brightness in more detail. It belongs to a different class of structures without prominent mass flows and with a 3D topology formed by two parallel, closed loops connecting patches of opposite polarity. We present evidence that the inverse Evershed flow into the sunspot in the lower chromosphere is the consequence of siphon flows along short loops that connect photospheric foot points. The dark-cored structure of the chromospheric fibrils cannot have an convective origin because of their location above regular granulation in an optically thin atmosphere. The dark core most likely results from an opacity difference between the central axis and the lateral edges caused by the significant flow speed along the fibrils.
△ Less
Submitted 6 May, 2014;
originally announced May 2014.
-
Multi-wavelength Diagnostics of the Precursor and Main phases of an M1.8 Flare on 2011 April 22
Authors:
A. K. Awasthi,
R. Jain,
P. D. Gadhiya,
M. J. Aschwanden,
W. Uddin,
A. K. Srivastava,
R. Chandra,
N. Gopalswamy,
N. Nitta,
S. Yashiro,
P. K. Manoharan,
D. P. Choudhary,
N. C. Joshi,
V. C. Dwivedi,
K. Mahalakshmi
Abstract:
We study the temporal, spatial and spectral evolution of the M1.8 flare, which occurred in NOAA AR 11195 (S17E31) on 22 April 2011, and explore the underlying physical processes during the precursors and their relation to the main phase. The study of the source morphology using the composite images in 131 °A wavelength observed by the SDO/AIA and 6-14 keV revealed a multiloop system that destabili…
▽ More
We study the temporal, spatial and spectral evolution of the M1.8 flare, which occurred in NOAA AR 11195 (S17E31) on 22 April 2011, and explore the underlying physical processes during the precursors and their relation to the main phase. The study of the source morphology using the composite images in 131 °A wavelength observed by the SDO/AIA and 6-14 keV revealed a multiloop system that destabilized systematically during the precursor and main phases. In contrast, HXR emission (20-50 keV) was absent during the precursor phase, appearing only from the onset of the impulsive phase in the form of foot-points of emitting loop/s. This study has also revealed the heated loop-top prior to the loop emission, although no accompanying foot-point sources were observed during the precursor phase. We estimate the flare plasma parameters viz. T, EM, power-law index, and photon turn-over energy by forward fitting RHESSI spectral observations. The energy released in the precursor phase was thermal and constituted ~1 per cent of the total energy released during the flare. The study of morphological evolution of the filament in conjunction with synthesized T and EM maps has been carried out which reveals (a) Partial filament eruption prior to the onset of the precursor emission, (b) Heated dense plasma over the polarity inversion line and in the vicinity of the slowly rising filament during the precursor phase. Based on the implications from multi-wavelength observations, we propose a scheme to unify the energy release during the precursor and main phase emissions in which, the precursor phase emission has been originated via conduction front formed due to the partial filament eruption. Next, the heated leftover S-shaped filament has undergone slow rise and heating due to magnetic reconnection and finally erupted to produce emission during the impulsive and gradual phases.
△ Less
Submitted 21 October, 2013;
originally announced October 2013.
-
He I D3 Observation of the 1984 May 22 M6.3 Solar Flare
Authors:
Chang Liu,
Yan Xu,
Na Deng,
Jeongwoo Lee,
Jifeng Zhang,
Debi Prasad Choudhary,
Haimin Wang
Abstract:
He I D3 line has a unique response to the flare impact on the low solar atmosphere and can be a powerful diagnostic tool for energy transport processes. Using images obtained from the recently digitized films of Big Bear Solar Observatory, we report D3 observation of the M6.3 flare on 1984 May 22, which occurred in an active region with a circular magnetic polarity inversion line (PIL). The impuls…
▽ More
He I D3 line has a unique response to the flare impact on the low solar atmosphere and can be a powerful diagnostic tool for energy transport processes. Using images obtained from the recently digitized films of Big Bear Solar Observatory, we report D3 observation of the M6.3 flare on 1984 May 22, which occurred in an active region with a circular magnetic polarity inversion line (PIL). The impulsive phase of the flare starts with a main elongated source that darkens in D3, inside of which bright emission kernels appear at the time of the initial small peak in hard X-rays (HXRs). These flare cores subsequently evolve into a sharp emission strand lying within the dark halo simultaneously with the main peak in HXRs, reversing the overall source contrast from -5% to 5%. The radiated energy in D3 during the main peak is estimated to be about 10^30 ergs, which is comparable to that carried by nonthermal electrons above 20 keV. Afterwards the flare proceeds along the circular PIL in the counterclockwise direction to form a dark circular ribbon in D3, which apparently mirrors the bright ribbons in Halpha and He I 10830 A. All these ribbons last for over one hour in the late gradual phase. We suggest that the present event resembles the so-called black-light flare that is proposed based on continuum images, and that D3 darkening and brightening features herein may be due to, respectively, the thermal conduction heating and the direct precipitation of high-energy electrons.
△ Less
Submitted 25 June, 2013;
originally announced June 2013.
-
A Multiwavelength Study of Eruptive Events on January 23, 2012 Associated with a Major Solar Energetic Particle Event
Authors:
N. C. Joshi,
W. Uddin,
A. K. Srivastava,
R. Chandra,
N. Gopalswamy,
P. K. Manoharan,
M. J. Aschwanden,
D. P. Choudhary,
R. Jain,
N. V. Nitta,
H. Xie,
S. Yashiro,
S. Akiyama,
P. Makela,
P. Kayshap,
A. K. Awasthi,
V. C. Dwivedi,
K. Mahalakshmi
Abstract:
We use multiwavelength data from space and ground based instruments to study the solar flares and coronal mass ejections (CMEs) on January 23, 2012 that were responsible for one of the largest solar energetic particle (SEP) events of solar cycle 24. The eruptions consisting of two fast CMEs (1400 km/s and 2000 km/s) and M-class flares that occurred in active region 11402 located at N28 W36. The tw…
▽ More
We use multiwavelength data from space and ground based instruments to study the solar flares and coronal mass ejections (CMEs) on January 23, 2012 that were responsible for one of the largest solar energetic particle (SEP) events of solar cycle 24. The eruptions consisting of two fast CMEs (1400 km/s and 2000 km/s) and M-class flares that occurred in active region 11402 located at N28 W36. The two CMEs occurred in quick successions, so they interacted very close to the Sun. The second CME caught up with the first one at a distance of 11-12 Rsun. The CME interaction may be responsible for the elevated SEP flux and significant changes in the intensity profile of the SEP event. The compound CME resulted in a double-dip moderate geomagnetic storm (Dst = -73 nT). The two dips are due to the southward component of the interplanetary magnetic field in the shock sheath and the ICME intervals. One possible reason for the lack of a stronger geomagnetic storm may be that the ICME delivered a glancing blow to Earth.
△ Less
Submitted 5 March, 2013;
originally announced March 2013.
-
Height of Shock Formation in the Solar Corona Inferred from Observations of Type II Radio Bursts and Coronal Mass Ejections
Authors:
N. Gopalswamy,
H. Xie,
P. Mäkelä,
S. Yashiro,
S. Akiyama,
W. Uddin. A. K. Srivastava,
N. C. Joshi,
R. Chandra,
P. K. Manoharan,
K. Mahalakshmi,
V. C. Dwivedi,
R. Jain A. K. Awasthi,
N. V. Nitta,
M. J. Aschwanden,
D. P. Choudhary
Abstract:
Employing coronagraphic and EUV observations close to the solar surface made by the Solar Terrestrial Relations Observatory (STEREO) mission, we determined the heliocentric distance of coronal mass ejections (CMEs) at the starting time of associated metric type II bursts. We used the wave diameter and leading edge methods and measured the CME heights for a set of 32 metric type II bursts from sola…
▽ More
Employing coronagraphic and EUV observations close to the solar surface made by the Solar Terrestrial Relations Observatory (STEREO) mission, we determined the heliocentric distance of coronal mass ejections (CMEs) at the starting time of associated metric type II bursts. We used the wave diameter and leading edge methods and measured the CME heights for a set of 32 metric type II bursts from solar cycle 24. We minimized the projection effects by making the measurements from a view that is roughly orthogonal to the direction of the ejection. We also chose image frames close to the onset times of the type II bursts, so no extrapolation was necessary. We found that the CMEs were located in the heliocentric distance range from 1.20 to 1.93 solar radii (Rs), with mean and median values of 1.43 and 1.38 Rs, respectively. We conclusively find that the shock formation can occur at heights substantially below 1.5 Rs. In a few cases, the CME height at type II onset was close to 2 Rs. In these cases, the starting frequency of the type II bursts was very low, in the range 25 to 40 MHz, which confirms that the shock can also form at larger heights. The starting frequencies of metric type II bursts have a weak correlation with the measured CME/shock heights and are consistent with the rapid decline of density with height in the inner corona.
△ Less
Submitted 5 January, 2013;
originally announced January 2013.
-
Rapid Enhancement of Sheared Evershed Flow Along the Neutral Line Associated with an X6.5 Flare Observed by Hinode
Authors:
Na Deng,
Chang Liu,
Debi Prasad Choudhary,
Haimin Wang
Abstract:
We present G-band and Ca II H observations of NOAA AR 10930 obtained by Hinode/SOT on 2006 December 6 covering an X6.5 flare. Local Correlation Tracking (LCT) technique was applied to the foreshortening-corrected G-band image series to acquire horizontal proper motions in this complex beta-gamma-delta active region. With the continuous high quality, spatial and temporal resolution G-band data, we…
▽ More
We present G-band and Ca II H observations of NOAA AR 10930 obtained by Hinode/SOT on 2006 December 6 covering an X6.5 flare. Local Correlation Tracking (LCT) technique was applied to the foreshortening-corrected G-band image series to acquire horizontal proper motions in this complex beta-gamma-delta active region. With the continuous high quality, spatial and temporal resolution G-band data, we not only confirm the rapid decay of outer penumbrae and darkening of the central structure near the flaring neutral line, but also unambiguously detect for the first time the enhancement of the sheared Evershed flow (average horizontal flow speed increased from 330+-3.1 to 403+-4.6 m/s) along the neutral line right after the eruptive white-light flare. Post-flare Ca II H images indicate that the originally fanning out field lines at the two sides of the neutral line get connected. Since penumbral structure and Evershed flow are closely related to photospheric magnetic inclination or horizontal field strength, we interpret the rapid changes of sunspot structure and surface flow as the result of flare-induced magnetic restructuring down to the photosphere. The magnetic fields turn from fanning out to inward connection causing outer penumbrae decay, meanwhile those near the flaring neutral line become more horizontal leading to stronger Evershed flow there. The inferred enhancement of horizontal magnetic field near the neutral line is consistent with recent magnetic observations and theoretical predictions of flare-invoked photospheric magnetic field change.
△ Less
Submitted 19 April, 2011;
originally announced April 2011.
-
What determines the penumbral size and Evershed flow speed?
Authors:
Na Deng,
Toshifumi Shimizu,
Debi Prasad Choudhary,
Haimin Wang
Abstract:
Using Hinode SP and G-band observations, we examined the relationship between magnetic field structure and penumbral size as well as Evershed flow speed. The latter two are positively correlated with magnetic inclination angle or horizontal field strength within 1.5 kilogauss, which is in agreement with recent magnetoconvective simulations of Evershed effect. This work thus provides direct observa…
▽ More
Using Hinode SP and G-band observations, we examined the relationship between magnetic field structure and penumbral size as well as Evershed flow speed. The latter two are positively correlated with magnetic inclination angle or horizontal field strength within 1.5 kilogauss, which is in agreement with recent magnetoconvective simulations of Evershed effect. This work thus provides direct observational evidence supporting the magnetoconvection nature of penumbral structure and Evershed flow in the presence of strong and inclined magnetic field.
△ Less
Submitted 18 February, 2011; v1 submitted 15 February, 2011;
originally announced February 2011.
-
On the Doppler Shift and Asymmetry of Stokes Profiles of Photospheric FeI and Chromospheric MgI Lines
Authors:
Na Deng,
Debi Prasad Choudhary,
K. S. Balasubramaniam
Abstract:
We analyzed the full Stokes spectra using simultaneous measurements of the photospheric (FeI 630.15 and 630.25 nm) and chromospheric (MgI b2 517.27 nm) lines. The data were obtained with the HAO/NSO Advanced Stokes Polarimeter, about a near disc center sunspot region, NOAA AR 9661. We compare the characteristics of Stokes profiles in terms of Doppler shifts and asymmetries among the three spectral…
▽ More
We analyzed the full Stokes spectra using simultaneous measurements of the photospheric (FeI 630.15 and 630.25 nm) and chromospheric (MgI b2 517.27 nm) lines. The data were obtained with the HAO/NSO Advanced Stokes Polarimeter, about a near disc center sunspot region, NOAA AR 9661. We compare the characteristics of Stokes profiles in terms of Doppler shifts and asymmetries among the three spectral lines, which helps us to better understand the chromospheric lines and the magnetic and flow fields in different magnetic regions. The main results are: (1) For penumbral area observed by the photospheric FeI lines, Doppler velocities derived from Stokes I (Vi) are very close to those derived from linear polarization profiles (Vlp) but significantly different from those derived from Stokes V profiles (Vzc), which provides direct and strong evidence that the penumbral Evershed flows are magnetized and mainly carried by the horizontal magnetic component. (2) The rudimentary inverse Evershed effect observed by the MgI b2 line provides a qualitative evidence on its formation height that is around or just above the temperature minimum region. (3) Vzc and Vlp in penumbrae and Vzc in pores generally approach their Vi observed by the chromospheric MgI line, which is not the case for the photospheric FeI lines. (4) Outer penumbrae and pores show similar behavior of the Stokes V asymmetries that tend to change from positive values in the photosphere (FeI lines) to negative values in the low chromosphere (MgI line). (5) The Stokes V profiles in plage regions are highly asymmetric in the photosphere and more symmetric in the low chromosphere. (6) Strong red shifts and large asymmetries are found around the magnetic polarity inversion line within the common penumbra of the Delta spot. This study thus emphasizes the importance of spectro-polarimetry using chromospheric lines.
△ Less
Submitted 17 June, 2010;
originally announced June 2010.
-
Stokes Profiles at the Narrow Magnetic Lanes of Sun Spots
Authors:
Gordon A. MacDonald,
Kemal A. Yassin,
Debi Prasad Choudhary
Abstract:
It has been previously observed that narrow lanes of transverse and longitudinal magnetic field with opposite polarity are the site of large solar flares. We performed a comprehensive examination of the stokes asymmetries of active region NOAA 10930. The active region was observed just before, during and after an X-class flare, which occurred during December 13, 2006 from 02:20 to 06:18 UT. We obs…
▽ More
It has been previously observed that narrow lanes of transverse and longitudinal magnetic field with opposite polarity are the site of large solar flares. We performed a comprehensive examination of the stokes asymmetries of active region NOAA 10930. The active region was observed just before, during and after an X-class flare, which occurred during December 13, 2006 from 02:20 to 06:18 UT. We observe a static fibril interacting with a rotating penumbra of opposite polarity in the hours prior to the flare. Above the fibril were several small sites of hot gas in the chromosphere. During and after the flare, the fibril and its corresponding flow and profiles were much less pronounced. We present a full analysis of magnetic and plasma properties of this active region.
△ Less
Submitted 20 April, 2010;
originally announced April 2010.
-
Properties of 16 Sunspots Observed with Hinode Solar Optical Telescope
Authors:
Debi Prasad Choudhary,
Gordon A. MacDonald,
Toshifumi
Abstract:
We studied 16 sunspots with different sizes and shapes using the observations with the Hinode Solar Optical Telescope. The ratio of G-band and CaII H images reveal rich structures both within the umbra and penumbra of most spots. The striking features are the compact blob at the foot point of the umbra side of the penumbral fibrils with disk center-limb side asymmetry. In this paper, we present pr…
▽ More
We studied 16 sunspots with different sizes and shapes using the observations with the Hinode Solar Optical Telescope. The ratio of G-band and CaII H images reveal rich structures both within the umbra and penumbra of most spots. The striking features are the compact blob at the foot point of the umbra side of the penumbral fibrils with disk center-limb side asymmetry. In this paper, we present properties of these features using the spectropolarimetry and images in G-band, CaII and blue filters. We discuss the results using the contemporary models of the sunspots.
△ Less
Submitted 20 April, 2010;
originally announced April 2010.
-
Sunspot Bright Points
Authors:
Debi Prasad Choudhary,
Toshifumi Shimizu
Abstract:
We used the flux calibrated images through the Broad Band Filter Imager and Stokes Polarimeter data obtained with the Solar Optical Telescope onboard the Hinode spacecraft to study the properties of bright points in and around the sunspots. The well isolated bright points were selected and classified as umbral dot, peripheral umbral dot, penumbral grains and G-band bright point depending on thei…
▽ More
We used the flux calibrated images through the Broad Band Filter Imager and Stokes Polarimeter data obtained with the Solar Optical Telescope onboard the Hinode spacecraft to study the properties of bright points in and around the sunspots. The well isolated bright points were selected and classified as umbral dot, peripheral umbral dot, penumbral grains and G-band bright point depending on their location. Most of the bright points are smaller than about 150 km. The larger points are mostly associated with the penumbral features. The bright points are not uniformly distributed over the umbra but preferentially located around the penumbral boundary and in the fast decaying parts of umbra. The color temperature of the bright points, derived using the continuum irradiance, are in the range of 4600 K to 6600 K with cooler ones located in the umbra. The temperature increases as a function of distance from the center to outside. The G-band, CN-band and CaII H flux of the bright points as a function of their blue band brightness increase continuously in a nonlinear fashion unlike their red and green counterpart. This is consistent with a model in which the localized heating of the flux tube deplete the molecular concentration resulting the reduced opacity which leads to the exposition of deeper and hotter layers. The scatter in CaII H irradiance is higher compared to the G-band and CN-band irradiance. The light curve of the bright points show that the enhanced brightness at these locations last for about 15 to 60 minutes with the least contrast for the points out side the spot. The umbral dots near the penumbral boundary are associated with elongated filamentary structures.
△ Less
Submitted 14 January, 2010;
originally announced January 2010.
-
Successive Solar Flares and Coronal Mass Ejections on 2005 September 13 from Noaa Ar 10808
Authors:
Chang Liu,
Jeongwoo Lee,
Marian Karlicky,
Debi Prasad Choudhary,
Na Deng,
Haimin Wang
Abstract:
We present a multiwavelength study of the 2005 September 13 eruption from NOAA 10808 that produced total four flares and two fast coronal mass ejections (CMEs) within 1.5 hours. Our primary attention is paid to the fact that these eruptions occurred in close succession in time, and that all of them were located along an S-shaped magnetic polarity inversion line (PIL) of the active region. In our…
▽ More
We present a multiwavelength study of the 2005 September 13 eruption from NOAA 10808 that produced total four flares and two fast coronal mass ejections (CMEs) within 1.5 hours. Our primary attention is paid to the fact that these eruptions occurred in close succession in time, and that all of them were located along an S-shaped magnetic polarity inversion line (PIL) of the active region. In our analysis, (1) the disturbance created by the first flare propagated southward along the PIL to cause a major filament eruption that led to the first CME and the associated second flare underneath. (2) The first CME partially removed the overlying magnetic fields over the northern Delta spot to allow the third flare and the second CME. (3) The ribbon separation during the fourth flare would indicate reclosing of the overlying field lines opened by the second CME. It is thus concluded that this series of flares and CMEs are interrelated to each other via magnetic reconnections between the expanding magnetic structure and the nearby magnetic fields. These results complement previous works made on this event with the suggested causal relationship among the successive eruptions.
△ Less
Submitted 4 August, 2009;
originally announced August 2009.