-
Raw Text is All you Need: Knowledge-intensive Multi-turn Instruction Tuning for Large Language Model
Authors:
Xia Hou,
Qifeng Li,
Jian Yang,
Tongliang Li,
Linzheng Chai,
Xianjie Wu,
Hangyuan Ji,
Zhoujun Li,
Jixuan Nie,
Jingbo Dun,
Wenfeng Song
Abstract:
Instruction tuning as an effective technique aligns the outputs of large language models (LLMs) with human preference. But how to generate the seasonal multi-turn dialogues from raw documents for instruction tuning still requires further exploration. In this paper, we present a novel framework named R2S that leverages the CoD-Chain of Dialogue logic to guide large language models (LLMs) in generat…
▽ More
Instruction tuning as an effective technique aligns the outputs of large language models (LLMs) with human preference. But how to generate the seasonal multi-turn dialogues from raw documents for instruction tuning still requires further exploration. In this paper, we present a novel framework named R2S that leverages the CoD-Chain of Dialogue logic to guide large language models (LLMs) in generating knowledge-intensive multi-turn dialogues for instruction tuning. By integrating raw documents from both open-source datasets and domain-specific web-crawled documents into a benchmark K-BENCH, we cover diverse areas such as Wikipedia (English), Science (Chinese), and Artifacts (Chinese). Our approach first decides the logic flow of the current dialogue and then prompts LLMs to produce key phrases for sourcing relevant response content. This methodology enables the creation of the G I NSTRUCT instruction dataset, retaining raw document knowledge within dialoguestyle interactions. Utilizing this dataset, we fine-tune GLLM, a model designed to transform raw documents into structured multi-turn dialogues, thereby injecting comprehensive domain knowledge into the SFT model for enhanced instruction tuning. This work signifies a stride towards refining the adaptability and effectiveness of LLMs in processing and generating more accurate, contextually nuanced responses across various fields.
△ Less
Submitted 3 July, 2024;
originally announced July 2024.
-
The triggering process of an X-class solar flare on a small quadrupolar active region
Authors:
Qiao Song,
Jing-Song Wang,
Xiaoxin Zhang,
Hechao Chen,
Shuhong Yang,
Zhenyong Hou,
Yijun Hou,
Qian Ye,
Peng Zhang,
Xiuqing Hu,
Jinping Dun,
Weiguo Zong,
Xianyong Bai,
Bo Chen,
Lingping He,
Kefei Song
Abstract:
The occurrence of X-class solar flares and their potential impact on the space weather often receive great attention than other flares. But predicting when and where an X-class flare will occur is still a challenge. With the multi-wavelength observation from the Solar Dynamics Observatory and FengYun- 3E satellite, we investigate the triggering of a GOES X1.0 flare occurring in the NOAA active reg…
▽ More
The occurrence of X-class solar flares and their potential impact on the space weather often receive great attention than other flares. But predicting when and where an X-class flare will occur is still a challenge. With the multi-wavelength observation from the Solar Dynamics Observatory and FengYun- 3E satellite, we investigate the triggering of a GOES X1.0 flare occurring in the NOAA active region (AR) 12887. Our results show that this unique X-class flare is bred in a relatively small but complex quadrupolar AR. Before the X-class flare, two filaments (F1 and F2) exist below a null-point topology of the quadrupolar AR. Magnetic field extrapolation and observation reveal that F1 and F2 correspond to two magnetic flux ropes with the same chirality and their adjacent feet rooted at nonconjugated opposite polarities, respectively. Interestingly, these two polarities collide rapidly, accompanied by photospheric magnetic flux emergence, cancellation and shear motion in the AR center. Above this site, F1 and F2 subsequently intersect and merge to a longer filament (F3) via a tether-cutting-like reconnection process. As a result, the F3 rises and erupts, involving the large-scale arcades overlying filament and the quadrupolar magnetic field above the AR, and eventually leads to the eruption of the X-class flare with a quasi-X-shaped flare ribbon and a coronal mass ejection. It suggests that the rapid collision of nonconjugated opposite polarities provides a key condition for the triggering of this X-class flare, and also provides a featured case for flare trigger mechanism and space weather forecasting.
△ Less
Submitted 17 September, 2023;
originally announced September 2023.
-
A new post-hoc flat field measurement method for the Solar X-ray and Extreme Ultraviolet Imager onboard the Fengyun-3E satellite
Authors:
Qiao Song,
Xianyong Bai,
Bo Chen,
Xiuqing Hu,
Yajie Chen,
Zhenyong Hou,
Xiaofan Zhang,
Lingping He,
Kefei Song,
Peng Zhang,
Jing-Song Wang,
Xiaoxin Zhang,
Weiguo Zong,
Jinping Dun,
Hui Tian,
Yuanyong Deng
Abstract:
The extreme ultraviolet (EUV) observations are widely used in solar activity research and space weather forecasting since they can observe both the solar eruptions and the source regions of the solar wind. Flat field processing is indispensable to remove the instrumental non-uniformity of a solar EUV imager in producing high-quality scientific data from original observed data. Fengyun-3E (FY-3E) i…
▽ More
The extreme ultraviolet (EUV) observations are widely used in solar activity research and space weather forecasting since they can observe both the solar eruptions and the source regions of the solar wind. Flat field processing is indispensable to remove the instrumental non-uniformity of a solar EUV imager in producing high-quality scientific data from original observed data. Fengyun-3E (FY-3E) is a meteorological satellite operated in Sun-synchronous orbit, and the routine EUV imaging data from the Solar X-ray and Extreme Ultraviolet Imager (X-EUVI) onboard FY-3E has the characteristics of concentric rotation. Taking advantage of the concentric rotation, we propose a post-hoc flat field measurement method for its EUV 195 channel in this paper. This method removes small-scale and time-varying component of the coronal activities by taking the median value for each pixel along the time axis of a concentric rotation data cube, and then derives large-scale and invariable component of the quiet coronal radiation, and finally generates a flat field image. Analysis shows that our method is able to measure the instrumental spot-like non-uniformity possibly caused by contamination on the detector, which mostly disappears after the in-orbit self-cleaning process. It can also measure the quasi-periodic grid-like non-uniformity, possibly from the obscuration of the support mesh on the rear filter. After flat field correction, these instrumental non-uniformities from the original data are effectively removed. X-EUVI 195 data after dark and flat field corrections are consistent with the 193 channel data from SDO/AIA, verifying the suitability of the method. Our method is not only suitable for FY-3E/X-EUVI but also a candidate method for the flat field measurement of future solar EUV telescopes.
△ Less
Submitted 5 July, 2022;
originally announced July 2022.
-
Three-dimensional Propagation of the Global EUV Wave associated with a solar eruption on 2021 October 28
Authors:
Zhenyong Hou,
Hui Tian,
Jing-Song Wang,
Xiaoxin Zhang,
Qiao Song,
Ruisheng Zheng,
Hechao Chen,
Bo Chen,
Xianyong Bai,
Yajie Chen,
Lingping He,
Kefei Song,
Peng Zhang,
Xiuqing Hu,
Jinping Dun,
Weiguo Zong,
Yongliang Song,
Yu Xu,
Guangyu Tan
Abstract:
We present a case study for the global extreme ultraviolet (EUV) wave and its chromospheric counterpart `Moreton-Ramsey wave' associated with the second X-class flare in Solar Cycle 25 and a halo coronal mass ejection (CME). The EUV wave was observed in the H$α$ and EUV passbands with different characteristic temperatures. In the 171 Å and 193/195 Å images, the wave propagates circularly with an i…
▽ More
We present a case study for the global extreme ultraviolet (EUV) wave and its chromospheric counterpart `Moreton-Ramsey wave' associated with the second X-class flare in Solar Cycle 25 and a halo coronal mass ejection (CME). The EUV wave was observed in the H$α$ and EUV passbands with different characteristic temperatures. In the 171 Å and 193/195 Å images, the wave propagates circularly with an initial velocity of 600-720 km s$^{-1}$ and a deceleration of 110-320 m s$^{-2}$. The local coronal plasma is heated from log(T/K)=5.9 to log(T/K)=6.2 during the passage of the wavefront. The H$α$ and 304 Å images also reveal signatures of wave propagation with a velocity of 310-540 km s$^{-1}$. With multi-wavelength and dual-perspective observations, we found that the wavefront likely propagates forwardly inclined to the solar surface with a tilt angle of ~53.2$^{\circ}$. Our results suggest that this EUV wave is a fast-mode magnetohydrodynamic wave or shock driven by the expansion of the associated CME, whose wavefront is likely a dome-shaped structure that could impact the upper chromosphere, transition region and corona.
△ Less
Submitted 25 February, 2022;
originally announced February 2022.