-
Measurement and Modeling of Polarized Atmosphere at the South Pole with SPT-3G
Authors:
A. Coerver,
J. A. Zebrowski,
S. Takakura,
W. L. Holzapfel,
P. A. R. Ade,
A. J. Anderson,
Z. Ahmed,
B. Ansarinejad,
M. Archipley,
L. Balkenhol,
D. Barron,
K. Benabed,
A. N. Bender,
B. A. Benson,
F. Bianchini,
L. E. Bleem,
F. R. Bouchet,
L. Bryant,
E. Camphuis,
J. E. Carlstrom,
T. W. Cecil,
C. L. Chang,
P. Chaubal,
P. M. Chichura,
A. Chokshi
, et al. (80 additional authors not shown)
Abstract:
We present the detection and characterization of fluctuations in linearly polarized emission from the atmosphere above the South Pole. These measurements make use of Austral winter survey data from the SPT-3G receiver on the South Pole Telescope in three frequency bands centered at 95, 150, and 220 GHz. We use the cross-correlation between detectors to produce an unbiased estimate of the power in…
▽ More
We present the detection and characterization of fluctuations in linearly polarized emission from the atmosphere above the South Pole. These measurements make use of Austral winter survey data from the SPT-3G receiver on the South Pole Telescope in three frequency bands centered at 95, 150, and 220 GHz. We use the cross-correlation between detectors to produce an unbiased estimate of the power in Stokes I, Q, and U parameters on large angular scales. Our results are consistent with the polarized signal being produced by the combination of Rayleigh scattering of thermal radiation from the ground and thermal emission from a population of horizontally aligned ice crystals with an anisotropic distribution described by Kolmogorov turbulence. The signal is most significant at large angular scales, high observing frequency, and low elevation angle. Polarized atmospheric emission has the potential to significantly impact observations on the large angular scales being targeted by searches for inflationary B-mode CMB polarization. We present the distribution of measured angular power spectrum amplitudes in Stokes Q and I for 4 years of winter observations, which can be used to simulate the impact of atmospheric polarization and intensity fluctuations at the South Pole on a specified experiment and observation strategy. For the SPT-3G data, downweighting the small fraction of significantly contaminated observations is an effective mitigation strategy. In addition, we present a strategy for further improving sensitivity on large angular scales where maps made in the 220 GHz band are used to measure and subtract the polarized atmosphere signal from the 150 GHz band maps. In observations with the SPT-3G instrument at the South Pole, the polarized atmospheric signal is a well-understood and sub-dominant contribution to the measured noise after implementing the mitigation strategies described here.
△ Less
Submitted 30 July, 2024;
originally announced July 2024.
-
Word Embedding Dimension Reduction via Weakly-Supervised Feature Selection
Authors:
Jintang Xue,
Yun-Cheng Wang,
Chengwei Wei,
C. -C. Jay Kuo
Abstract:
As a fundamental task in natural language processing, word embedding converts each word into a representation in a vector space. A challenge with word embedding is that as the vocabulary grows, the vector space's dimension increases and it can lead to a vast model size. Storing and processing word vectors are resource-demanding, especially for mobile edge-devices applications. This paper explores…
▽ More
As a fundamental task in natural language processing, word embedding converts each word into a representation in a vector space. A challenge with word embedding is that as the vocabulary grows, the vector space's dimension increases and it can lead to a vast model size. Storing and processing word vectors are resource-demanding, especially for mobile edge-devices applications. This paper explores word embedding dimension reduction. To balance computational costs and performance, we propose an efficient and effective weakly-supervised feature selection method, named WordFS. It has two variants, each utilizing novel criteria for feature selection. Experiments conducted on various tasks (e.g., word and sentence similarity and binary and multi-class classification) indicate that the proposed WordFS model outperforms other dimension reduction methods at lower computational costs.
△ Less
Submitted 17 July, 2024;
originally announced July 2024.
-
GSBIQA: Green Saliency-guided Blind Image Quality Assessment Method
Authors:
Zhanxuan Mei,
Yun-Cheng Wang,
C. -C. Jay Kuo
Abstract:
Blind Image Quality Assessment (BIQA) is an essential task that estimates the perceptual quality of images without reference. While many BIQA methods employ deep neural networks (DNNs) and incorporate saliency detectors to enhance performance, their large model sizes limit deployment on resource-constrained devices. To address this challenge, we introduce a novel and non-deep-learning BIQA method…
▽ More
Blind Image Quality Assessment (BIQA) is an essential task that estimates the perceptual quality of images without reference. While many BIQA methods employ deep neural networks (DNNs) and incorporate saliency detectors to enhance performance, their large model sizes limit deployment on resource-constrained devices. To address this challenge, we introduce a novel and non-deep-learning BIQA method with a lightweight saliency detection module, called Green Saliency-guided Blind Image Quality Assessment (GSBIQA). It is characterized by its minimal model size, reduced computational demands, and robust performance. Experimental results show that the performance of GSBIQA is comparable with state-of-the-art DL-based methods with significantly lower resource requirements.
△ Less
Submitted 7 July, 2024;
originally announced July 2024.
-
GreenCOD: A Green Camouflaged Object Detection Method
Authors:
Hong-Shuo Chen,
Yao Zhu,
Suya You,
Azad M. Madni,
C. -C. Jay Kuo
Abstract:
We introduce GreenCOD, a green method for detecting camouflaged objects, distinct in its avoidance of backpropagation techniques. GreenCOD leverages gradient boosting and deep features extracted from pre-trained Deep Neural Networks (DNNs). Traditional camouflaged object detection (COD) approaches often rely on complex deep neural network architectures, seeking performance improvements through bac…
▽ More
We introduce GreenCOD, a green method for detecting camouflaged objects, distinct in its avoidance of backpropagation techniques. GreenCOD leverages gradient boosting and deep features extracted from pre-trained Deep Neural Networks (DNNs). Traditional camouflaged object detection (COD) approaches often rely on complex deep neural network architectures, seeking performance improvements through backpropagation-based fine-tuning. However, such methods are typically computationally demanding and exhibit only marginal performance variations across different models. This raises the question of whether effective training can be achieved without backpropagation. Addressing this, our work proposes a new paradigm that utilizes gradient boosting for COD. This approach significantly simplifies the model design, resulting in a system that requires fewer parameters and operations and maintains high performance compared to state-of-the-art deep learning models. Remarkably, our models are trained without backpropagation and achieve the best performance with fewer than 20G Multiply-Accumulate Operations (MACs). This new, more efficient paradigm opens avenues for further exploration in green, backpropagation-free model training.
△ Less
Submitted 25 May, 2024;
originally announced May 2024.
-
Physical properties and electronic structure of the two-gap superconductor V$_{2}$Ga$_{5}$
Authors:
P. -Y. Cheng,
Mohamed Oudah,
T. -L. Hung,
C. -E. Hsu,
C. -C. Chang,
J. -Y. Haung,
T. -C. Liu,
C. -M. Cheng,
M. -N. Ou,
W. -T. Chen,
L. Z. Deng,
C. -C. Lee,
Y. -Y. Chen,
C. -N. Kuo,
C. -S. Lue,
Janna Machts,
Kenji M. Kojima,
Alannah M. Hallas,
C. -L. Huang
Abstract:
We present a thorough investigation of the physical properties and superconductivity of the binary intermetallic V2Ga5. Electrical resistivity and specific heat measurements show that V2Ga5 enters its superconducting state below Tsc = 3.5 K, with a critical field of Hc2,perp c(Hc2,para c) = 6.5(4.1) kOe. With H perp c, the peak effect was observed in resistivity measurements, indicating the ultrah…
▽ More
We present a thorough investigation of the physical properties and superconductivity of the binary intermetallic V2Ga5. Electrical resistivity and specific heat measurements show that V2Ga5 enters its superconducting state below Tsc = 3.5 K, with a critical field of Hc2,perp c(Hc2,para c) = 6.5(4.1) kOe. With H perp c, the peak effect was observed in resistivity measurements, indicating the ultrahigh quality of the single crystal studied. The resistivity measurements under high pressure reveal that the Tsc is suppressed linearly with pressure and reaches absolute zero around 20 GPa. Specific heat and muon spin relaxation measurements both indicate that the two-gap s-wave model best describes the superconductivity of V2Ga5. The spectra obtained from angle-resolved photoemission spectroscopy measurements suggest that two superconducting gaps open at the Fermi surface around the Z and Γ points. These results are verified by first-principles band structure calculations. We therefore conclude that V2Ga5 is a phonon-mediated two-gap s-wave superconductor
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
Mass calibration of DES Year-3 clusters via SPT-3G CMB cluster lensing
Authors:
B. Ansarinejad,
S. Raghunathan,
T. M. C. Abbott,
P. A. R. Ade,
M. Aguena,
O. Alves,
A. J. Anderson,
F. Andrade-Oliveira,
M. Archipley,
L. Balkenhol,
K. Benabed,
A. N. Bender,
B. A. Benson,
E. Bertin,
F. Bianchini,
L. E. Bleem,
S. Bocquet,
F. R. Bouchet,
D. Brooks,
L. Bryant,
D. L. Burke,
E. Camphuis,
J. E. Carlstrom,
A. Carnero Rosell,
J. Carretero
, et al. (120 additional authors not shown)
Abstract:
We measure the stacked lensing signal in the direction of galaxy clusters in the Dark Energy Survey Year 3 (DES Y3) redMaPPer sample, using cosmic microwave background (CMB) temperature data from SPT-3G, the third-generation CMB camera on the South Pole Telescope (SPT). We estimate the lensing signal using temperature maps constructed from the initial 2 years of data from the SPT-3G 'Main' survey,…
▽ More
We measure the stacked lensing signal in the direction of galaxy clusters in the Dark Energy Survey Year 3 (DES Y3) redMaPPer sample, using cosmic microwave background (CMB) temperature data from SPT-3G, the third-generation CMB camera on the South Pole Telescope (SPT). We estimate the lensing signal using temperature maps constructed from the initial 2 years of data from the SPT-3G 'Main' survey, covering 1500 deg$^2$ of the Southern sky. We then use this signal as a proxy for the mean cluster mass of the DES sample. In this work, we employ three versions of the redMaPPer catalogue: a Flux-Limited sample containing 8865 clusters, a Volume-Limited sample with 5391 clusters, and a Volume&Redshift-Limited sample with 4450 clusters. For the three samples, we find the mean cluster masses to be ${M}_{200{\rm{m}}}=1.66\pm0.13$ [stat.]$\pm0.03$ [sys.], $1.97\pm0.18$ [stat.]$\pm0.05$ [sys.], and $2.11\pm0.20$ [stat.]$\pm0.05$ [sys.]$\times{10}^{14}\ {\rm{M}}_{\odot }$, respectively. This is a factor of $\sim2$ improvement relative to the precision of measurements with previous generations of SPT surveys and the most constraining cluster mass measurements using CMB cluster lensing to date. Overall, we find no significant tensions between our results and masses given by redMaPPer mass-richness scaling relations of previous works, which were calibrated using CMB cluster lensing, optical weak lensing, and velocity dispersion measurements from various combinations of DES, SDSS and Planck data. We then divide our sample into 3 redshift and 3 richness bins, finding no significant tensions with optical weak-lensing calibrated masses in these bins. We forecast a $5.7\%$ constraint on the mean cluster mass of the DES Y3 sample with the complete SPT-3G surveys when using both temperature and polarization data and including an additional $\sim1400$ deg$^2$ of observations from the 'Extended' SPT-3G survey.
△ Less
Submitted 12 June, 2024; v1 submitted 2 April, 2024;
originally announced April 2024.
-
GreenSaliency: A Lightweight and Efficient Image Saliency Detection Method
Authors:
Zhanxuan Mei,
Yun-Cheng Wang,
C. -C. Jay Kuo
Abstract:
Image saliency detection is crucial in understanding human gaze patterns from visual stimuli. The escalating demand for research in image saliency detection is driven by the growing necessity to incorporate such techniques into various computer vision tasks and to understand human visual systems. Many existing image saliency detection methods rely on deep neural networks (DNNs) to achieve good per…
▽ More
Image saliency detection is crucial in understanding human gaze patterns from visual stimuli. The escalating demand for research in image saliency detection is driven by the growing necessity to incorporate such techniques into various computer vision tasks and to understand human visual systems. Many existing image saliency detection methods rely on deep neural networks (DNNs) to achieve good performance. However, the high computational complexity associated with these approaches impedes their integration with other modules or deployment on resource-constrained platforms, such as mobile devices. To address this need, we propose a novel image saliency detection method named GreenSaliency, which has a small model size, minimal carbon footprint, and low computational complexity. GreenSaliency can be a competitive alternative to the existing deep-learning-based (DL-based) image saliency detection methods with limited computation resources. GreenSaliency comprises two primary steps: 1) multi-layer hybrid feature extraction and 2) multi-path saliency prediction. Experimental results demonstrate that GreenSaliency achieves comparable performance to the state-of-the-art DL-based methods while possessing a considerably smaller model size and significantly reduced computational complexity.
△ Less
Submitted 30 March, 2024;
originally announced April 2024.
-
Testing the $\mathbfΛ$CDM Cosmological Model with Forthcoming Measurements of the Cosmic Microwave Background with SPT-3G
Authors:
K. Prabhu,
S. Raghunathan,
M. Millea,
G. Lynch,
P. A. R. Ade,
E. Anderes,
A. J. Anderson,
B. Ansarinejad,
M. Archipley,
L. Balkenhol,
K. Benabed,
A. N. Bender,
B. A. Benson,
F. Bianchini,
L. E. Bleem,
F. R. Bouchet,
L. Bryant,
E. Camphuis,
J. E. Carlstrom,
T. W. Cecil,
C. L. Chang,
P. Chaubal,
P. M. Chichura,
T. -L. Chou,
A. Coerver
, et al. (76 additional authors not shown)
Abstract:
We forecast constraints on cosmological parameters enabled by three surveys conducted with SPT-3G, the third-generation camera on the South Pole Telescope. The surveys cover separate regions of 1500, 2650, and 6000 ${\rm deg}^{2}$ to different depths, in total observing 25% of the sky. These regions will be measured to white noise levels of roughly 2.5, 9, and 12 $μ{\rm K-arcmin}$, respectively, i…
▽ More
We forecast constraints on cosmological parameters enabled by three surveys conducted with SPT-3G, the third-generation camera on the South Pole Telescope. The surveys cover separate regions of 1500, 2650, and 6000 ${\rm deg}^{2}$ to different depths, in total observing 25% of the sky. These regions will be measured to white noise levels of roughly 2.5, 9, and 12 $μ{\rm K-arcmin}$, respectively, in CMB temperature units at 150 GHz by the end of 2024. The survey also includes measurements at 95 and 220 GHz, which have noise levels a factor of ~1.2 and 3.5 times higher than 150 GHz, respectively, with each band having a polarization noise level ~$\sqrt{\text{2}}$ times higher than the temperature noise. We use a novel approach to obtain the covariance matrices for jointly and optimally estimated gravitational lensing potential bandpowers and unlensed CMB temperature and polarization bandpowers. We demonstrate the ability to test the $Λ{\rm CDM}$ model via the consistency of cosmological parameters constrained independently from SPT-3G and Planck data, and consider the improvement in constraints on $Λ{\rm CDM}$ extension parameters from a joint analysis of SPT-3G and Planck data. The $Λ{\rm CDM}$ cosmological parameters are typically constrained with uncertainties up to ~2 times smaller with SPT-3G data, compared to Planck, with the two data sets measuring significantly different angular scales and polarization levels, providing additional tests of the standard cosmological model.
△ Less
Submitted 5 July, 2024; v1 submitted 26 March, 2024;
originally announced March 2024.
-
PSHop: A Lightweight Feed-Forward Method for 3D Prostate Gland Segmentation
Authors:
Yijing Yang,
Vasileios Magoulianitis,
Jiaxin Yang,
Jintang Xue,
Masatomo Kaneko,
Giovanni Cacciamani,
Andre Abreu,
Vinay Duddalwar,
C. -C. Jay Kuo,
Inderbir S. Gill,
Chrysostomos Nikias
Abstract:
Automatic prostate segmentation is an important step in computer-aided diagnosis of prostate cancer and treatment planning. Existing methods of prostate segmentation are based on deep learning models which have a large size and lack of transparency which is essential for physicians. In this paper, a new data-driven 3D prostate segmentation method on MRI is proposed, named PSHop. Different from dee…
▽ More
Automatic prostate segmentation is an important step in computer-aided diagnosis of prostate cancer and treatment planning. Existing methods of prostate segmentation are based on deep learning models which have a large size and lack of transparency which is essential for physicians. In this paper, a new data-driven 3D prostate segmentation method on MRI is proposed, named PSHop. Different from deep learning based methods, the core methodology of PSHop is a feed-forward encoder-decoder system based on successive subspace learning (SSL). It consists of two modules: 1) encoder: fine to coarse unsupervised representation learning with cascaded VoxelHop units, 2) decoder: coarse to fine segmentation prediction with voxel-wise classification and local refinement. Experiments are conducted on the publicly available ISBI-2013 dataset, as well as on a larger private one. Experimental analysis shows that our proposed PSHop is effective, robust and lightweight in the tasks of prostate gland and zonal segmentation, achieving a Dice Similarity Coefficient (DSC) of 0.873 for the gland segmentation task. PSHop achieves a competitive performance comparatively to other deep learning methods, while keeping the model size and inference complexity an order of magnitude smaller.
△ Less
Submitted 23 March, 2024;
originally announced March 2024.
-
PCa-RadHop: A Transparent and Lightweight Feed-forward Method for Clinically Significant Prostate Cancer Segmentation
Authors:
Vasileios Magoulianitis,
Jiaxin Yang,
Yijing Yang,
Jintang Xue,
Masatomo Kaneko,
Giovanni Cacciamani,
Andre Abreu,
Vinay Duddalwar,
C. -C. Jay Kuo,
Inderbir S. Gill,
Chrysostomos Nikias
Abstract:
Prostate Cancer is one of the most frequently occurring cancers in men, with a low survival rate if not early diagnosed. PI-RADS reading has a high false positive rate, thus increasing the diagnostic incurred costs and patient discomfort. Deep learning (DL) models achieve a high segmentation performance, although require a large model size and complexity. Also, DL models lack of feature interpreta…
▽ More
Prostate Cancer is one of the most frequently occurring cancers in men, with a low survival rate if not early diagnosed. PI-RADS reading has a high false positive rate, thus increasing the diagnostic incurred costs and patient discomfort. Deep learning (DL) models achieve a high segmentation performance, although require a large model size and complexity. Also, DL models lack of feature interpretability and are perceived as ``black-boxes" in the medical field. PCa-RadHop pipeline is proposed in this work, aiming to provide a more transparent feature extraction process using a linear model. It adopts the recently introduced Green Learning (GL) paradigm, which offers a small model size and low complexity. PCa-RadHop consists of two stages: Stage-1 extracts data-driven radiomics features from the bi-parametric Magnetic Resonance Imaging (bp-MRI) input and predicts an initial heatmap. To reduce the false positive rate, a subsequent stage-2 is introduced to refine the predictions by including more contextual information and radiomics features from each already detected Region of Interest (ROI). Experiments on the largest publicly available dataset, PI-CAI, show a competitive performance standing of the proposed method among other deep DL models, achieving an area under the curve (AUC) of 0.807 among a cohort of 1,000 patients. Moreover, PCa-RadHop maintains orders of magnitude smaller model size and complexity.
△ Less
Submitted 23 March, 2024;
originally announced March 2024.
-
First Constraints on the Epoch of Reionization Using the non-Gaussianity of the Kinematic Sunyaev-Zel{'}dovich Effect from the South Pole Telescope and {\it Herschel}-SPIRE Observations
Authors:
S. Raghunathan,
P. A. R. Ade,
A. J. Anderson,
B. Ansarinejad,
M. Archipley,
J. E. Austermann,
L. Balkenhol,
J. A. Beall,
K. Benabed,
A. N. Bender,
B. A. Benson,
F. Bianchini,
L. E. Bleem,
J. Bock,
F. R. Bouchet,
L. Bryant,
E. Camphuis,
J. E. Carlstrom,
T. W. Cecil,
C. L. Chang,
P. Chaubal,
H. C. Chiang,
P. M. Chichura,
T. -L. Chou,
R. Citron
, et al. (97 additional authors not shown)
Abstract:
We report results from an analysis aimed at detecting the trispectrum of the kinematic Sunyaev-Zel{'}dovich (kSZ) effect by combining data from the South Pole Telescope (SPT) and {\it Herschel}-SPIRE experiments over a 100 ${\rm deg}^{2}$ field. The SPT observations combine data from the previous and current surveys, namely SPTpol and SPT-3G, to achieve depths of 4.5, 3, and 16 $μ{\rm K-arcmin}$ i…
▽ More
We report results from an analysis aimed at detecting the trispectrum of the kinematic Sunyaev-Zel{'}dovich (kSZ) effect by combining data from the South Pole Telescope (SPT) and {\it Herschel}-SPIRE experiments over a 100 ${\rm deg}^{2}$ field. The SPT observations combine data from the previous and current surveys, namely SPTpol and SPT-3G, to achieve depths of 4.5, 3, and 16 $μ{\rm K-arcmin}$ in bands centered at 95, 150, and 220 GHz. For SPIRE, we include data from the 600 and 857 GHz bands. We reconstruct the velocity-induced large-scale correlation of the small-scale kSZ signal with a quadratic estimator that uses two cosmic microwave background (CMB) temperature maps, constructed by optimally combining data from all the frequency bands. We reject the null hypothesis of a zero trispectrum at $10.3σ$ level. However, the measured trispectrum contains contributions from both the kSZ and other undesired components, such as CMB lensing and astrophysical foregrounds, with kSZ being sub-dominant. We use the \textsc{Agora} simulations to estimate the expected signal from CMB lensing and astrophysical foregrounds. After accounting for the contributions from CMB lensing and foreground signals, we do not detect an excess kSZ-only trispectrum and use this non-detection to set constraints on reionization. By applying a prior based on observations of the Gunn-Peterson trough, we obtain an upper limit on the duration of reionization of $Δz_{\rm re, 50} < 4.5$ (95\% C.L). We find these constraints are fairly robust to foregrounds assumptions. This trispectrum measurement is independent of, but consistent with, {\it Planck}'s optical depth measurement. This result is the first constraint on the epoch of reionization using the non-Gaussian nature of the kSZ signal.
△ Less
Submitted 4 March, 2024;
originally announced March 2024.
-
Treatment-wise Glioblastoma Survival Inference with Multi-parametric Preoperative MRI
Authors:
Xiaofeng Liu,
Nadya Shusharina,
Helen A Shih,
C. -C. Jay Kuo,
Georges El Fakhri,
Jonghye Woo
Abstract:
In this work, we aim to predict the survival time (ST) of glioblastoma (GBM) patients undergoing different treatments based on preoperative magnetic resonance (MR) scans. The personalized and precise treatment planning can be achieved by comparing the ST of different treatments. It is well established that both the current status of the patient (as represented by the MR scans) and the choice of tr…
▽ More
In this work, we aim to predict the survival time (ST) of glioblastoma (GBM) patients undergoing different treatments based on preoperative magnetic resonance (MR) scans. The personalized and precise treatment planning can be achieved by comparing the ST of different treatments. It is well established that both the current status of the patient (as represented by the MR scans) and the choice of treatment are the cause of ST. While previous related MR-based glioblastoma ST studies have focused only on the direct mapping of MR scans to ST, they have not included the underlying causal relationship between treatments and ST. To address this limitation, we propose a treatment-conditioned regression model for glioblastoma ST that incorporates treatment information in addition to MR scans. Our approach allows us to effectively utilize the data from all of the treatments in a unified manner, rather than having to train separate models for each of the treatments. Furthermore, treatment can be effectively injected into each convolutional layer through the adaptive instance normalization we employ. We evaluate our framework on the BraTS20 ST prediction task. Three treatment options are considered: Gross Total Resection (GTR), Subtotal Resection (STR), and no resection. The evaluation results demonstrate the effectiveness of injecting the treatment for estimating GBM survival.
△ Less
Submitted 10 February, 2024;
originally announced February 2024.
-
Ordered magnetic fields around the 3C 84 central black hole
Authors:
G. F. Paraschos,
J. -Y. Kim,
M. Wielgus,
J. Röder,
T. P. Krichbaum,
E. Ros,
I. Agudo,
I. Myserlis,
M. Moscibrodzka,
E. Traianou,
J. A. Zensus,
L. Blackburn,
C. -K. Chan,
S. Issaoun,
M. Janssen,
M. D. Johnson,
V. L. Fish,
K. Akiyama,
A. Alberdi,
W. Alef,
J. C. Algaba,
R. Anantua,
K. Asada,
R. Azulay,
U. Bach
, et al. (258 additional authors not shown)
Abstract:
3C84 is a nearby radio source with a complex total intensity structure, showing linear polarisation and spectral patterns. A detailed investigation of the central engine region necessitates the use of VLBI above the hitherto available maximum frequency of 86GHz. Using ultrahigh resolution VLBI observations at the highest available frequency of 228GHz, we aim to directly detect compact structures a…
▽ More
3C84 is a nearby radio source with a complex total intensity structure, showing linear polarisation and spectral patterns. A detailed investigation of the central engine region necessitates the use of VLBI above the hitherto available maximum frequency of 86GHz. Using ultrahigh resolution VLBI observations at the highest available frequency of 228GHz, we aim to directly detect compact structures and understand the physical conditions in the compact region of 3C84. We used EHT 228GHz observations and, given the limited (u,v)-coverage, applied geometric model fitting to the data. We also employed quasi-simultaneously observed, multi-frequency VLBI data for the source in order to carry out a comprehensive analysis of the core structure. We report the detection of a highly ordered, strong magnetic field around the central, SMBH of 3C84. The brightness temperature analysis suggests that the system is in equipartition. We determined a turnover frequency of $ν_m=(113\pm4)$GHz, a corresponding synchrotron self-absorbed magnetic field of $B_{SSA}=(2.9\pm1.6)$G, and an equipartition magnetic field of $B_{eq}=(5.2\pm0.6)$G. Three components are resolved with the highest fractional polarisation detected for this object ($m_\textrm{net}=(17.0\pm3.9)$%). The positions of the components are compatible with those seen in low-frequency VLBI observations since 2017-2018. We report a steeply negative slope of the spectrum at 228GHz. We used these findings to test models of jet formation, propagation, and Faraday rotation in 3C84. The findings of our investigation into different flow geometries and black hole spins support an advection-dominated accretion flow in a magnetically arrested state around a rapidly rotating supermassive black hole as a model of the jet-launching system in the core of 3C84. However, systematic uncertainties due to the limited (u,v)-coverage, however, cannot be ignored.
△ Less
Submitted 1 February, 2024;
originally announced February 2024.
-
Flaring Stars in a Non-targeted mm-wave Survey with SPT-3G
Authors:
C. Tandoi,
S. Guns,
A. Foster,
P. A. R. Ade,
A. J. Anderson,
B. Ansarinejad,
M. Archipley,
L. Balkenhol,
K. Benabed,
A. N. Bender,
B. A. Benson,
F. Bianchini,
L. E. Bleem,
F. R. Bouchet,
L. Bryant,
E. Camphuis,
J. E. Carlstrom,
T. W. Cecil,
C. L. Chang,
P. Chaubal,
P. M. Chichura,
T. -L. Chou,
A. Coerver,
T. M. Crawford,
A. Cukierman
, et al. (74 additional authors not shown)
Abstract:
We present a flare star catalog from four years of non-targeted millimeter-wave survey data from the South Pole Telescope (SPT). The data were taken with the SPT-3G camera and cover a 1500-square-degree region of the sky from $20^{h}40^{m}0^{s}$ to $3^{h}20^{m}0^{s}$ in right ascension and $-42^{\circ}$ to $-70^{\circ}$ in declination. This region was observed on a nearly daily cadence from 2019-2…
▽ More
We present a flare star catalog from four years of non-targeted millimeter-wave survey data from the South Pole Telescope (SPT). The data were taken with the SPT-3G camera and cover a 1500-square-degree region of the sky from $20^{h}40^{m}0^{s}$ to $3^{h}20^{m}0^{s}$ in right ascension and $-42^{\circ}$ to $-70^{\circ}$ in declination. This region was observed on a nearly daily cadence from 2019-2022 and chosen to avoid the plane of the galaxy. A short-duration transient search of this survey yields 111 flaring events from 66 stars, increasing the number of both flaring events and detected flare stars by an order of magnitude from the previous SPT-3G data release. We provide cross-matching to Gaia DR3, as well as matches to X-ray point sources found in the second ROSAT all-sky survey. We have detected flaring stars across the main sequence, from early-type A stars to M dwarfs, as well as a large population of evolved stars. These stars are mostly nearby, spanning 10 to 1000 parsecs in distance. Most of the flare spectral indices are constant or gently rising as a function of frequency at 95/150/220 GHz. The timescale of these events can range from minutes to hours, and the peak $νL_ν$ luminosities range from $10^{27}$ to $10^{31}$ erg s$^{-1}$ in the SPT-3G frequency bands.
△ Less
Submitted 24 January, 2024;
originally announced January 2024.
-
GWPT: A Green Word-Embedding-based POS Tagger
Authors:
Chengwei Wei,
Runqi Pang,
C. -C. Jay Kuo
Abstract:
As a fundamental tool for natural language processing (NLP), the part-of-speech (POS) tagger assigns the POS label to each word in a sentence. A novel lightweight POS tagger based on word embeddings is proposed and named GWPT (green word-embedding-based POS tagger) in this work. Following the green learning (GL) methodology, GWPT contains three modules in cascade: 1) representation learning, 2) fe…
▽ More
As a fundamental tool for natural language processing (NLP), the part-of-speech (POS) tagger assigns the POS label to each word in a sentence. A novel lightweight POS tagger based on word embeddings is proposed and named GWPT (green word-embedding-based POS tagger) in this work. Following the green learning (GL) methodology, GWPT contains three modules in cascade: 1) representation learning, 2) feature learning, and 3) decision learning modules. The main novelty of GWPT lies in representation learning. It uses non-contextual or contextual word embeddings, partitions embedding dimension indices into low-, medium-, and high-frequency sets, and represents them with different N-grams. It is shown by experimental results that GWPT offers state-of-the-art accuracies with fewer model parameters and significantly lower computational complexity in both training and inference as compared with deep-learning-based methods.
△ Less
Submitted 15 January, 2024;
originally announced January 2024.
-
Enhancing Edge Intelligence with Highly Discriminant LNT Features
Authors:
Xinyu Wang,
Vinod K. Mishra,
C. -C. Jay Kuo
Abstract:
AI algorithms at the edge demand smaller model sizes and lower computational complexity. To achieve these objectives, we adopt a green learning (GL) paradigm rather than the deep learning paradigm. GL has three modules: 1) unsupervised representation learning, 2) supervised feature learning, and 3) supervised decision learning. We focus on the second module in this work. In particular, we derive n…
▽ More
AI algorithms at the edge demand smaller model sizes and lower computational complexity. To achieve these objectives, we adopt a green learning (GL) paradigm rather than the deep learning paradigm. GL has three modules: 1) unsupervised representation learning, 2) supervised feature learning, and 3) supervised decision learning. We focus on the second module in this work. In particular, we derive new discriminant features from proper linear combinations of input features, denoted by x, obtained in the first module. They are called complementary and raw features, respectively. Along this line, we present a novel supervised learning method to generate highly discriminant complementary features based on the least-squares normal transform (LNT). LNT consists of two steps. First, we convert a C-class classification problem to a binary classification problem. The two classes are assigned with 0 and 1, respectively. Next, we formulate a least-squares regression problem from the N-dimensional (N-D) feature space to the 1-D output space, and solve the least-squares normal equation to obtain one N-D normal vector, denoted by a1. Since one normal vector is yielded by one binary split, we can obtain M normal vectors with M splits. Then, Ax is called an LNT of x, where transform matrix A in R^{M by N} by stacking aj^T, j=1, ..., M, and the LNT, Ax, can generate M new features. The newly generated complementary features are shown to be more discriminant than the raw features. Experiments show that the classification performance can be improved by these new features.
△ Less
Submitted 19 December, 2023;
originally announced December 2023.
-
Absolute Flux Density Calibration of the Greenland Telescope Data for Event Horizon Telescope Observations
Authors:
J. Y. Koay,
K. Asada,
S. Matsushita,
C. -Y. Kuo,
C. -W. L. Huang,
C. Romero-Cañizales,
S. Koyama,
J. Park,
W. -P. Lo,
G. Bower,
M. -T. Chen,
S. -H. Chang,
C. -C. Chen,
R. Chilson,
C. C. Han,
P. T. P. Ho,
Y. -D. Huang,
M. Inoue,
B. Jeter,
H. Jiang,
P. M. Koch,
D. Kubo,
C. -T. Li,
C. -T. Liu,
K. -Y. Liu
, et al. (13 additional authors not shown)
Abstract:
Starting from the observing campaign in April 2018, the Greenland Telescope (GLT) has been added as a new station of the Event Horizon Telescope (EHT) array. Visibilities on baselines to the GLT, particularly in the North-South direction, potentially provide valuable new constraints for the modeling and imaging of sources such as M87*. The GLT's location at high Northern latitudes adds unique chal…
▽ More
Starting from the observing campaign in April 2018, the Greenland Telescope (GLT) has been added as a new station of the Event Horizon Telescope (EHT) array. Visibilities on baselines to the GLT, particularly in the North-South direction, potentially provide valuable new constraints for the modeling and imaging of sources such as M87*. The GLT's location at high Northern latitudes adds unique challenges to its calibration strategies. Additionally, the performance of the GLT was not optimal during the 2018 observations due to it being only partially commissioned at the time. This document describes the steps taken to estimate the various parameters (and their uncertainties) required for the absolute flux calibration of the GLT data as part of the EHT. In particular, we consider the non-optimized status of the GLT in 2018, as well as its improved performance during the 2021 EHT campaign.
△ Less
Submitted 5 December, 2023;
originally announced December 2023.
-
Linear dichroic x-ray absorption response of Ti-Ti dimers along the $c$ axis in Ti$_2$O$_3$ upon Mg substitution
Authors:
M. Okawa,
D. Takegami,
D. S. Christovam,
M. Ferreira-Carvalho,
C. -Y. Kuo,
C. T. Chen,
T. Miyoshino,
K. Takasu,
T. Okuda,
C. F. Chang,
L. H. Tjeng,
T. Mizokawa
Abstract:
Corundum oxide Ti$_2$O$_3$ shows the metal-insulator transition around 400-600 K accompanying the nearest Ti$^{3+}$-Ti$^{3+}$ bond ($a_{1g}a_{1g}$ singlet state) formation along the $c$ axis. In order to clarify the hole-doping effect for the $a_{1g}a_{1g}$ singlet bond in Ti$_2$O$_3$, we investigated Ti $3d$ orbital anisotropy between corundum-type Ti$_2$O$_3$ and ilmenite-type MgTiO$_3$ using li…
▽ More
Corundum oxide Ti$_2$O$_3$ shows the metal-insulator transition around 400-600 K accompanying the nearest Ti$^{3+}$-Ti$^{3+}$ bond ($a_{1g}a_{1g}$ singlet state) formation along the $c$ axis. In order to clarify the hole-doping effect for the $a_{1g}a_{1g}$ singlet bond in Ti$_2$O$_3$, we investigated Ti $3d$ orbital anisotropy between corundum-type Ti$_2$O$_3$ and ilmenite-type MgTiO$_3$ using linear dichroism of soft x-ray absorption spectroscopy of the Ti $L_{2,3}$ edge. From the linear dichroic spectral weight in Mg$_y$Ti$_{2-y}$O$_3$, we confirmed that the $a_{1g}a_{1g}$ state is dominant not only in $y=0.01$ (almost Ti$_2$O$_3$), but also in $y = 0.29$, indicating that the Ti-Ti bond survives against a certain level of hole doping. In $y=0.63$ corresponding to 46% hole doping per Ti, the $3d$ orbital symmetry changes from $a_{1g}$ to $e_g^π$.
△ Less
Submitted 8 November, 2023;
originally announced November 2023.
-
Macroscopic approach to the radar echo scatter from high-energy particle cascades
Authors:
E. Huesca Santiago,
K. D. de Vries,
P. Allison,
J. Beatty,
D. Besson,
A. Connolly,
A. Cummings,
C. Deaconu,
S. De Kockere,
D. Frikken,
C. Hast,
C. -Y. Kuo,
A. Kyriacou,
U. A. Latif,
I. Loudon,
V. Lukic,
C. McLennan,
K. Mulrey,
J. Nam,
K. Nivedita,
A. Nozdrina,
E. Oberla,
S. Prohira,
J. P. Ralston,
M. F. H. Seikh
, et al. (6 additional authors not shown)
Abstract:
To probe the cosmic particle flux at the highest energies, large volumes of dense material like ice have to be monitored. This can be achieved by exploiting the radio signal. In this work, we provide a macroscopic model to predict the radar echo signatures found when a radio signal is reflected from a cosmic-ray or neutrino-induced particle cascade propagating in a dense medium like ice. Its macro…
▽ More
To probe the cosmic particle flux at the highest energies, large volumes of dense material like ice have to be monitored. This can be achieved by exploiting the radio signal. In this work, we provide a macroscopic model to predict the radar echo signatures found when a radio signal is reflected from a cosmic-ray or neutrino-induced particle cascade propagating in a dense medium like ice. Its macroscopic nature allows for an energy independent run-time, taking less than 10 s for simulating a single scatter event. As a first application, we discuss basic signal properties and simulate the expected signal for the T-576 beam-test experiment at the Stanford Linear Accelerator Center. We find good signal strength agreement with the only observed radar echo from a high-energy particle cascade to date.
△ Less
Submitted 11 June, 2024; v1 submitted 10 October, 2023;
originally announced October 2023.
-
SemST: Semantically Consistent Multi-Scale Image Translation via Structure-Texture Alignment
Authors:
Ganning Zhao,
Wenhui Cui,
Suya You,
C. -C. Jay Kuo
Abstract:
Unsupervised image-to-image (I2I) translation learns cross-domain image mapping that transfers input from the source domain to output in the target domain while preserving its semantics. One challenge is that different semantic statistics in source and target domains result in content discrepancy known as semantic distortion. To address this problem, a novel I2I method that maintains semantic cons…
▽ More
Unsupervised image-to-image (I2I) translation learns cross-domain image mapping that transfers input from the source domain to output in the target domain while preserving its semantics. One challenge is that different semantic statistics in source and target domains result in content discrepancy known as semantic distortion. To address this problem, a novel I2I method that maintains semantic consistency in translation is proposed and named SemST in this work. SemST reduces semantic distortion by employing contrastive learning and aligning the structural and textural properties of input and output by maximizing their mutual information. Furthermore, a multi-scale approach is introduced to enhance translation performance, thereby enabling the applicability of SemST to domain adaptation in high-resolution images. Experiments show that SemST effectively mitigates semantic distortion and achieves state-of-the-art performance. Also, the application of SemST to domain adaptation (DA) is explored. It is demonstrated by preliminary experiments that SemST can be utilized as a beneficial pre-training for the semantic segmentation task.
△ Less
Submitted 7 October, 2023;
originally announced October 2023.
-
Knowledge Graph Embedding: An Overview
Authors:
Xiou Ge,
Yun-Cheng Wang,
Bin Wang,
C. -C. Jay Kuo
Abstract:
Many mathematical models have been leveraged to design embeddings for representing Knowledge Graph (KG) entities and relations for link prediction and many downstream tasks. These mathematically-inspired models are not only highly scalable for inference in large KGs, but also have many explainable advantages in modeling different relation patterns that can be validated through both formal proofs a…
▽ More
Many mathematical models have been leveraged to design embeddings for representing Knowledge Graph (KG) entities and relations for link prediction and many downstream tasks. These mathematically-inspired models are not only highly scalable for inference in large KGs, but also have many explainable advantages in modeling different relation patterns that can be validated through both formal proofs and empirical results. In this paper, we make a comprehensive overview of the current state of research in KG completion. In particular, we focus on two main branches of KG embedding (KGE) design: 1) distance-based methods and 2) semantic matching-based methods. We discover the connections between recently proposed models and present an underlying trend that might help researchers invent novel and more effective models. Next, we delve into CompoundE and CompoundE3D, which draw inspiration from 2D and 3D affine operations, respectively. They encompass a broad spectrum of techniques including distance-based and semantic-based methods. We will also discuss an emerging approach for KG completion which leverages pre-trained language models (PLMs) and textual descriptions of entities and relations and offer insights into the integration of KGE embedding methods with PLMs for KG completion.
△ Less
Submitted 21 September, 2023;
originally announced September 2023.
-
Preparing pure $^{43}$Ca$^+$ samples in an ion trap with photoionization and parametric excitations
Authors:
C. -H. Kuo,
Y. -C. Hsiao,
C. -Y. Jhang,
Y. -D. Chen,
S. Tung
Abstract:
We present a practical scheme for the efficient preparation of laser-cooled $^{43}$Ca$^+$ ions in an ion trap. Our approach integrates two well-established methods: isotope-selective photoionization and isotope-specific parametric excitation. Drawing inspiration from the individual merits of each method, we have successfully integrated these techniques to prepare extended chains of $^{43}$Ca$^+$ i…
▽ More
We present a practical scheme for the efficient preparation of laser-cooled $^{43}$Ca$^+$ ions in an ion trap. Our approach integrates two well-established methods: isotope-selective photoionization and isotope-specific parametric excitation. Drawing inspiration from the individual merits of each method, we have successfully integrated these techniques to prepare extended chains of $^{43}$Ca$^+$ ions, overcoming the challenge posed by their low natural abundance of 0.135\% in a natural source. Furthermore, we explore the subtleties of our scheme, focusing on the influence of different factors on the purification process. Our investigation contributes to a broader understanding of the technique and highlights the adaptability of established methods in addressing specific isotopic challenges.
△ Less
Submitted 1 February, 2024; v1 submitted 21 September, 2023;
originally announced September 2023.
-
Unsupervised Green Object Tracker (GOT) without Offline Pre-training
Authors:
Zhiruo Zhou,
Suya You,
C. -C. Jay Kuo
Abstract:
Supervised trackers trained on labeled data dominate the single object tracking field for superior tracking accuracy. The labeling cost and the huge computational complexity hinder their applications on edge devices. Unsupervised learning methods have also been investigated to reduce the labeling cost but their complexity remains high. Aiming at lightweight high-performance tracking, feasibility w…
▽ More
Supervised trackers trained on labeled data dominate the single object tracking field for superior tracking accuracy. The labeling cost and the huge computational complexity hinder their applications on edge devices. Unsupervised learning methods have also been investigated to reduce the labeling cost but their complexity remains high. Aiming at lightweight high-performance tracking, feasibility without offline pre-training, and algorithmic transparency, we propose a new single object tracking method, called the green object tracker (GOT), in this work. GOT conducts an ensemble of three prediction branches for robust box tracking: 1) a global object-based correlator to predict the object location roughly, 2) a local patch-based correlator to build temporal correlations of small spatial units, and 3) a superpixel-based segmentator to exploit the spatial information of the target frame. GOT offers competitive tracking accuracy with state-of-the-art unsupervised trackers, which demand heavy offline pre-training, at a lower computation cost. GOT has a tiny model size (<3k parameters) and low inference complexity (around 58M FLOPs per frame). Since its inference complexity is between 0.1%-10% of DL trackers, it can be easily deployed on mobile and edge devices.
△ Less
Submitted 16 September, 2023;
originally announced September 2023.
-
Bias and Fairness in Chatbots: An Overview
Authors:
Jintang Xue,
Yun-Cheng Wang,
Chengwei Wei,
Xiaofeng Liu,
Jonghye Woo,
C. -C. Jay Kuo
Abstract:
Chatbots have been studied for more than half a century. With the rapid development of natural language processing (NLP) technologies in recent years, chatbots using large language models (LLMs) have received much attention nowadays. Compared with traditional ones, modern chatbots are more powerful and have been used in real-world applications. There are however, bias and fairness concerns in mode…
▽ More
Chatbots have been studied for more than half a century. With the rapid development of natural language processing (NLP) technologies in recent years, chatbots using large language models (LLMs) have received much attention nowadays. Compared with traditional ones, modern chatbots are more powerful and have been used in real-world applications. There are however, bias and fairness concerns in modern chatbot design. Due to the huge amounts of training data, extremely large model sizes, and lack of interpretability, bias mitigation and fairness preservation of modern chatbots are challenging. Thus, a comprehensive overview on bias and fairness in chatbot systems is given in this paper. The history of chatbots and their categories are first reviewed. Then, bias sources and potential harms in applications are analyzed. Considerations in designing fair and unbiased chatbot systems are examined. Finally, future research directions are discussed.
△ Less
Submitted 10 December, 2023; v1 submitted 15 September, 2023;
originally announced September 2023.
-
AsyncET: Asynchronous Learning for Knowledge Graph Entity Typing with Auxiliary Relations
Authors:
Yun-Cheng Wang,
Xiou Ge,
Bin Wang,
C. -C. Jay Kuo
Abstract:
Knowledge graph entity typing (KGET) is a task to predict the missing entity types in knowledge graphs (KG). Previously, KG embedding (KGE) methods tried to solve the KGET task by introducing an auxiliary relation, 'hasType', to model the relationship between entities and their types. However, a single auxiliary relation has limited expressiveness for diverse entity-type patterns. We improve the e…
▽ More
Knowledge graph entity typing (KGET) is a task to predict the missing entity types in knowledge graphs (KG). Previously, KG embedding (KGE) methods tried to solve the KGET task by introducing an auxiliary relation, 'hasType', to model the relationship between entities and their types. However, a single auxiliary relation has limited expressiveness for diverse entity-type patterns. We improve the expressiveness of KGE methods by introducing multiple auxiliary relations in this work. Similar entity types are grouped to reduce the number of auxiliary relations and improve their capability to model entity-type patterns with different granularities. With the presence of multiple auxiliary relations, we propose a method adopting an Asynchronous learning scheme for Entity Typing, named AsyncET, which updates the entity and type embeddings alternatively to keep the learned entity embedding up-to-date and informative for entity type prediction. Experiments are conducted on two commonly used KGET datasets to show that the performance of KGE methods on the KGET task can be substantially improved by the proposed multiple auxiliary relations and asynchronous embedding learning. Furthermore, our method has a significant advantage over state-of-the-art methods in model sizes and time complexity.
△ Less
Submitted 30 August, 2023;
originally announced August 2023.
-
A Measurement of Gravitational Lensing of the Cosmic Microwave Background Using SPT-3G 2018 Data
Authors:
Z. Pan,
F. Bianchini,
W. L. K. Wu,
P. A. R. Ade,
Z. Ahmed,
E. Anderes,
A. J. Anderson,
B. Ansarinejad,
M. Archipley,
K. Aylor,
L. Balkenhol,
P. S. Barry,
R. Basu Thakur,
K. Benabed,
A. N. Bender,
B. A. Benson,
L. E. Bleem,
F. R. Bouchet,
L. Bryant,
K. Byrum,
E. Camphuis,
J. E. Carlstrom,
F. W. Carter,
T. W. Cecil,
C. L. Chang
, et al. (111 additional authors not shown)
Abstract:
We present a measurement of gravitational lensing over 1500 deg$^2$ of the Southern sky using SPT-3G temperature data at 95 and 150 GHz taken in 2018. The lensing amplitude relative to a fiducial Planck 2018 $Λ$CDM cosmology is found to be $1.020\pm0.060$, excluding instrumental and astrophysical systematic uncertainties. We conduct extensive systematic and null tests to check the robustness of th…
▽ More
We present a measurement of gravitational lensing over 1500 deg$^2$ of the Southern sky using SPT-3G temperature data at 95 and 150 GHz taken in 2018. The lensing amplitude relative to a fiducial Planck 2018 $Λ$CDM cosmology is found to be $1.020\pm0.060$, excluding instrumental and astrophysical systematic uncertainties. We conduct extensive systematic and null tests to check the robustness of the lensing measurements, and report a minimum-variance combined lensing power spectrum over angular multipoles of $50<L<2000$, which we use to constrain cosmological models. When analyzed alone and jointly with primary cosmic microwave background (CMB) spectra within the $Λ$CDM model, our lensing amplitude measurements are consistent with measurements from SPT-SZ, SPTpol, ACT, and Planck. Incorporating loose priors on the baryon density and other parameters including uncertainties on a foreground bias template, we obtain a $1σ$ constraint on $σ_8 Ω_{\rm m}^{0.25}=0.595 \pm 0.026$ using the SPT-3G 2018 lensing data alone, where $σ_8$ is a common measure of the amplitude of structure today and $Ω_{\rm m}$ is the matter density parameter. Combining SPT-3G 2018 lensing measurements with baryon acoustic oscillation (BAO) data, we derive parameter constraints of $σ_8 = 0.810 \pm 0.033$, $S_8 \equiv σ_8(Ω_{\rm m}/0.3)^{0.5}= 0.836 \pm 0.039$, and Hubble constant $H_0 =68.8^{+1.3}_{-1.6}$ km s$^{-1}$ Mpc$^{-1}$. Using CMB anisotropy and lensing measurements from SPT-3G only, we provide independent constraints on the spatial curvature of $Ω_{K} = 0.014^{+0.023}_{-0.026}$ (95% C.L.) and the dark energy density of $Ω_Λ= 0.722^{+0.031}_{-0.026}$ (68% C.L.). When combining SPT-3G lensing data with SPT-3G CMB anisotropy and BAO data, we find an upper limit on the sum of the neutrino masses of $\sum m_ν< 0.30$ eV (95% C.L.).
△ Less
Submitted 29 January, 2024; v1 submitted 22 August, 2023;
originally announced August 2023.
-
A Comprehensive Overview of Computational Nuclei Segmentation Methods in Digital Pathology
Authors:
Vasileios Magoulianitis,
Catherine A. Alexander,
C. -C. Jay Kuo
Abstract:
In the cancer diagnosis pipeline, digital pathology plays an instrumental role in the identification, staging, and grading of malignant areas on biopsy tissue specimens. High resolution histology images are subject to high variance in appearance, sourcing either from the acquisition devices or the H\&E staining process. Nuclei segmentation is an important task, as it detects the nuclei cells over…
▽ More
In the cancer diagnosis pipeline, digital pathology plays an instrumental role in the identification, staging, and grading of malignant areas on biopsy tissue specimens. High resolution histology images are subject to high variance in appearance, sourcing either from the acquisition devices or the H\&E staining process. Nuclei segmentation is an important task, as it detects the nuclei cells over background tissue and gives rise to the topology, size, and count of nuclei which are determinant factors for cancer detection. Yet, it is a fairly time consuming task for pathologists, with reportedly high subjectivity. Computer Aided Diagnosis (CAD) tools empowered by modern Artificial Intelligence (AI) models enable the automation of nuclei segmentation. This can reduce the subjectivity in analysis and reading time. This paper provides an extensive review, beginning from earlier works use traditional image processing techniques and reaching up to modern approaches following the Deep Learning (DL) paradigm. Our review also focuses on the weak supervision aspect of the problem, motivated by the fact that annotated data is scarce. At the end, the advantages of different models and types of supervision are thoroughly discussed. Furthermore, we try to extrapolate and envision how future research lines will potentially be, so as to minimize the need for labeled data while maintaining high performance. Future methods should emphasize efficient and explainable models with a transparent underlying process so that physicians can trust their output.
△ Less
Submitted 15 August, 2023;
originally announced August 2023.
-
An Overview on Generative AI at Scale with Edge-Cloud Computing
Authors:
Yun-Cheng Wang,
Jintang Xue,
Chengwei Wei,
C. -C. Jay Kuo
Abstract:
As a specific category of artificial intelligence (AI), generative artificial intelligence (GenAI) generates new content that resembles what is created by humans. The rapid development of GenAI systems has created a huge amount of new data on the Internet, posing new challenges to current computing and communication frameworks. Currently, GenAI services rely on the traditional cloud computing fram…
▽ More
As a specific category of artificial intelligence (AI), generative artificial intelligence (GenAI) generates new content that resembles what is created by humans. The rapid development of GenAI systems has created a huge amount of new data on the Internet, posing new challenges to current computing and communication frameworks. Currently, GenAI services rely on the traditional cloud computing framework due to the need for large computation resources. However, such services will encounter high latency because of data transmission and a high volume of requests. On the other hand, edge-cloud computing can provide adequate computation power and low latency at the same time through the collaboration between edges and the cloud. Thus, it is attractive to build GenAI systems at scale by leveraging the edge-cloud computing paradigm. In this overview paper, we review recent developments in GenAI and edge-cloud computing, respectively. Then, we use two exemplary GenAI applications to discuss technical challenges in scaling up their solutions using edge-cloud collaborative systems. Finally, we list design considerations for training and deploying GenAI systems at scale and point out future research directions.
△ Less
Submitted 9 July, 2023; v1 submitted 2 June, 2023;
originally announced June 2023.
-
Blind Video Quality Assessment at the Edge
Authors:
Zhanxuan Mei,
Yun-Cheng Wang,
C. -C. Jay Kuo
Abstract:
Owing to the proliferation of user-generated videos on the Internet, blind video quality assessment (BVQA) at the edge attracts growing attention. The usage of deep-learning-based methods is restricted to be applied at the edge due to their large model sizes and high computational complexity. In light of this, a novel lightweight BVQA method called GreenBVQA is proposed in this work. GreenBVQA fea…
▽ More
Owing to the proliferation of user-generated videos on the Internet, blind video quality assessment (BVQA) at the edge attracts growing attention. The usage of deep-learning-based methods is restricted to be applied at the edge due to their large model sizes and high computational complexity. In light of this, a novel lightweight BVQA method called GreenBVQA is proposed in this work. GreenBVQA features a small model size, low computational complexity, and high performance. Its processing pipeline includes: video data cropping, unsupervised representation generation, supervised feature selection, and mean-opinion-score (MOS) regression and ensembles. We conduct experimental evaluations on three BVQA datasets and show that GreenBVQA can offer state-of-the-art performance in PLCC and SROCC metrics while demanding significantly smaller model sizes and lower computational complexity. Thus, GreenBVQA is well-suited for edge devices.
△ Less
Submitted 29 October, 2023; v1 submitted 17 June, 2023;
originally announced June 2023.
-
Green Steganalyzer: A Green Learning Approach to Image Steganalysis
Authors:
Yao Zhu,
Xinyu Wang,
Hong-Shuo Chen,
Ronald Salloum,
C. -C. Jay Kuo
Abstract:
A novel learning solution to image steganalysis based on the green learning paradigm, called Green Steganalyzer (GS), is proposed in this work. GS consists of three modules: 1) pixel-based anomaly prediction, 2) embedding location detection, and 3) decision fusion for image-level detection. In the first module, GS decomposes an image into patches, adopts Saab transforms for feature extraction, and…
▽ More
A novel learning solution to image steganalysis based on the green learning paradigm, called Green Steganalyzer (GS), is proposed in this work. GS consists of three modules: 1) pixel-based anomaly prediction, 2) embedding location detection, and 3) decision fusion for image-level detection. In the first module, GS decomposes an image into patches, adopts Saab transforms for feature extraction, and conducts self-supervised learning to predict an anomaly score of their center pixel. In the second module, GS analyzes the anomaly scores of a pixel and its neighborhood to find pixels of higher embedding probabilities. In the third module, GS focuses on pixels of higher embedding probabilities and fuses their anomaly scores to make final image-level classification. Compared with state-of-the-art deep-learning models, GS achieves comparable detection performance against S-UNIWARD, WOW and HILL steganography schemes with significantly lower computational complexity and a smaller model size, making it attractive for mobile/edge applications. Furthermore, GS is mathematically transparent because of its modular design.
△ Less
Submitted 6 June, 2023;
originally announced June 2023.
-
Unsupervised Synthetic Image Refinement via Contrastive Learning and Consistent Semantic-Structural Constraints
Authors:
Ganning Zhao,
Tingwei Shen,
Suya You,
C. -C. Jay Kuo
Abstract:
Ensuring the realism of computer-generated synthetic images is crucial to deep neural network (DNN) training. Due to different semantic distributions between synthetic and real-world captured datasets, there exists semantic mismatch between synthetic and refined images, which in turn results in the semantic distortion. Recently, contrastive learning (CL) has been successfully used to pull correlat…
▽ More
Ensuring the realism of computer-generated synthetic images is crucial to deep neural network (DNN) training. Due to different semantic distributions between synthetic and real-world captured datasets, there exists semantic mismatch between synthetic and refined images, which in turn results in the semantic distortion. Recently, contrastive learning (CL) has been successfully used to pull correlated patches together and push uncorrelated ones apart. In this work, we exploit semantic and structural consistency between synthetic and refined images and adopt CL to reduce the semantic distortion. Besides, we incorporate hard negative mining to improve the performance furthermore. We compare the performance of our method with several other benchmarking methods using qualitative and quantitative measures and show that our method offers the state-of-the-art performance.
△ Less
Submitted 26 April, 2023; v1 submitted 25 April, 2023;
originally announced April 2023.
-
Knowledge Graph Embedding with 3D Compound Geometric Transformations
Authors:
Xiou Ge,
Yun-Cheng Wang,
Bin Wang,
C. -C. Jay Kuo
Abstract:
The cascade of 2D geometric transformations were exploited to model relations between entities in a knowledge graph (KG), leading to an effective KG embedding (KGE) model, CompoundE. Furthermore, the rotation in the 3D space was proposed as a new KGE model, Rotate3D, by leveraging its non-commutative property. Inspired by CompoundE and Rotate3D, we leverage 3D compound geometric transformations, i…
▽ More
The cascade of 2D geometric transformations were exploited to model relations between entities in a knowledge graph (KG), leading to an effective KG embedding (KGE) model, CompoundE. Furthermore, the rotation in the 3D space was proposed as a new KGE model, Rotate3D, by leveraging its non-commutative property. Inspired by CompoundE and Rotate3D, we leverage 3D compound geometric transformations, including translation, rotation, scaling, reflection, and shear and propose a family of KGE models, named CompoundE3D, in this work. CompoundE3D allows multiple design variants to match rich underlying characteristics of a KG. Since each variant has its own advantages on a subset of relations, an ensemble of multiple variants can yield superior performance. The effectiveness and flexibility of CompoundE3D are experimentally verified on four popular link prediction datasets.
△ Less
Submitted 1 April, 2023;
originally announced April 2023.
-
Lightweight High-Performance Blind Image Quality Assessment
Authors:
Zhanxuan Mei,
Yun-Cheng Wang,
Xingze He,
Yong Yan,
C. -C. Jay Kuo
Abstract:
Blind image quality assessment (BIQA) is a task that predicts the perceptual quality of an image without its reference. Research on BIQA attracts growing attention due to the increasing amount of user-generated images and emerging mobile applications where reference images are unavailable. The problem is challenging due to the wide range of content and mixed distortion types. Many existing BIQA me…
▽ More
Blind image quality assessment (BIQA) is a task that predicts the perceptual quality of an image without its reference. Research on BIQA attracts growing attention due to the increasing amount of user-generated images and emerging mobile applications where reference images are unavailable. The problem is challenging due to the wide range of content and mixed distortion types. Many existing BIQA methods use deep neural networks (DNNs) to achieve high performance. However, their large model sizes hinder their applicability to edge or mobile devices. To meet the need, a novel BIQA method with a small model, low computational complexity, and high performance is proposed and named "GreenBIQA" in this work. GreenBIQA includes five steps: 1) image cropping, 2) unsupervised representation generation, 3) supervised feature selection, 4) distortion-specific prediction, and 5) regression and decision ensemble. Experimental results show that the performance of GreenBIQA is comparable with that of state-of-the-art deep-learning (DL) solutions while demanding a much smaller model size and significantly lower computational complexity.
△ Less
Submitted 23 March, 2023;
originally announced March 2023.
-
A Tiny Machine Learning Model for Point Cloud Object Classification
Authors:
Min Zhang,
Jintang Xue,
Pranav Kadam,
Hardik Prajapati,
Shan Liu,
C. -C. Jay Kuo
Abstract:
The design of a tiny machine learning model, which can be deployed in mobile and edge devices, for point cloud object classification is investigated in this work. To achieve this objective, we replace the multi-scale representation of a point cloud object with a single-scale representation for complexity reduction, and exploit rich 3D geometric information of a point cloud object for performance i…
▽ More
The design of a tiny machine learning model, which can be deployed in mobile and edge devices, for point cloud object classification is investigated in this work. To achieve this objective, we replace the multi-scale representation of a point cloud object with a single-scale representation for complexity reduction, and exploit rich 3D geometric information of a point cloud object for performance improvement. The proposed solution is named Green-PointHop due to its low computational complexity. We evaluate the performance of Green-PointHop on ModelNet40 and ScanObjectNN two datasets. Green-PointHop has a model size of 64K parameters. It demands 2.3M floating-point operations (FLOPs) to classify a ModelNet40 object of 1024 down-sampled points. Its classification performance gaps against the state-of-the-art DGCNN method are 3% and 7% for ModelNet40 and ScanObjectNN, respectively. On the other hand, the model size and inference complexity of DGCNN are 42X and 1203X of those of Green-PointHop, respectively.
△ Less
Submitted 20 March, 2023;
originally announced March 2023.
-
An Overview on Language Models: Recent Developments and Outlook
Authors:
Chengwei Wei,
Yun-Cheng Wang,
Bin Wang,
C. -C. Jay Kuo
Abstract:
Language modeling studies the probability distributions over strings of texts. It is one of the most fundamental tasks in natural language processing (NLP). It has been widely used in text generation, speech recognition, machine translation, etc. Conventional language models (CLMs) aim to predict the probability of linguistic sequences in a causal manner, while pre-trained language models (PLMs) c…
▽ More
Language modeling studies the probability distributions over strings of texts. It is one of the most fundamental tasks in natural language processing (NLP). It has been widely used in text generation, speech recognition, machine translation, etc. Conventional language models (CLMs) aim to predict the probability of linguistic sequences in a causal manner, while pre-trained language models (PLMs) cover broader concepts and can be used in both causal sequential modeling and fine-tuning for downstream applications. PLMs have their own training paradigms (usually self-supervised) and serve as foundation models in modern NLP systems. This overview paper provides an introduction to both CLMs and PLMs from five aspects, i.e., linguistic units, architectures, training methods, evaluation methods, and applications. Furthermore, we discuss the relationship between CLMs and PLMs and shed light on the future directions of language modeling in the pre-trained era.
△ Less
Submitted 3 July, 2023; v1 submitted 10 March, 2023;
originally announced March 2023.
-
PointFlowHop: Green and Interpretable Scene Flow Estimation from Consecutive Point Clouds
Authors:
Pranav Kadam,
Jiahao Gu,
Shan Liu,
C. -C. Jay Kuo
Abstract:
An efficient 3D scene flow estimation method called PointFlowHop is proposed in this work. PointFlowHop takes two consecutive point clouds and determines the 3D flow vectors for every point in the first point cloud. PointFlowHop decomposes the scene flow estimation task into a set of subtasks, including ego-motion compensation, object association and object-wise motion estimation. It follows the g…
▽ More
An efficient 3D scene flow estimation method called PointFlowHop is proposed in this work. PointFlowHop takes two consecutive point clouds and determines the 3D flow vectors for every point in the first point cloud. PointFlowHop decomposes the scene flow estimation task into a set of subtasks, including ego-motion compensation, object association and object-wise motion estimation. It follows the green learning (GL) pipeline and adopts the feedforward data processing path. As a result, its underlying mechanism is more transparent than deep-learning (DL) solutions based on end-to-end optimization of network parameters. We conduct experiments on the stereoKITTI and the Argoverse LiDAR point cloud datasets and demonstrate that PointFlowHop outperforms deep-learning methods with a small model size and less training time. Furthermore, we compare the Floating Point Operations (FLOPs) required by PointFlowHop and other learning-based methods in inference, and show its big savings in computational complexity.
△ Less
Submitted 27 February, 2023;
originally announced February 2023.
-
LSR: A Light-Weight Super-Resolution Method
Authors:
Wei Wang,
Xuejing Lei,
Yueru Chen,
Ming-Sui Lee,
C. -C. Jay Kuo
Abstract:
A light-weight super-resolution (LSR) method from a single image targeting mobile applications is proposed in this work. LSR predicts the residual image between the interpolated low-resolution (ILR) and high-resolution (HR) images using a self-supervised framework. To lower the computational complexity, LSR does not adopt the end-to-end optimization deep networks. It consists of three modules: 1)…
▽ More
A light-weight super-resolution (LSR) method from a single image targeting mobile applications is proposed in this work. LSR predicts the residual image between the interpolated low-resolution (ILR) and high-resolution (HR) images using a self-supervised framework. To lower the computational complexity, LSR does not adopt the end-to-end optimization deep networks. It consists of three modules: 1) generation of a pool of rich and diversified representations in the neighborhood of a target pixel via unsupervised learning, 2) selecting a subset from the representation pool that is most relevant to the underlying super-resolution task automatically via supervised learning, 3) predicting the residual of the target pixel via regression. LSR has low computational complexity and reasonable model size so that it can be implemented on mobile/edge platforms conveniently. Besides, it offers better visual quality than classical exemplar-based methods in terms of PSNR/SSIM measures.
△ Less
Submitted 27 February, 2023;
originally announced February 2023.
-
S3I-PointHop: SO(3)-Invariant PointHop for 3D Point Cloud Classification
Authors:
Pranav Kadam,
Hardik Prajapati,
Min Zhang,
Jintang Xue,
Shan Liu,
C. -C. Jay Kuo
Abstract:
Many point cloud classification methods are developed under the assumption that all point clouds in the dataset are well aligned with the canonical axes so that the 3D Cartesian point coordinates can be employed to learn features. When input point clouds are not aligned, the classification performance drops significantly. In this work, we focus on a mathematically transparent point cloud classific…
▽ More
Many point cloud classification methods are developed under the assumption that all point clouds in the dataset are well aligned with the canonical axes so that the 3D Cartesian point coordinates can be employed to learn features. When input point clouds are not aligned, the classification performance drops significantly. In this work, we focus on a mathematically transparent point cloud classification method called PointHop, analyze its reason for failure due to pose variations, and solve the problem by replacing its pose dependent modules with rotation invariant counterparts. The proposed method is named SO(3)-Invariant PointHop (or S3I-PointHop in short). We also significantly simplify the PointHop pipeline using only one single hop along with multiple spatial aggregation techniques. The idea of exploiting more spatial information is novel. Experiments on the ModelNet40 dataset demonstrate the superiority of S3I-PointHop over traditional PointHop-like methods.
△ Less
Submitted 22 February, 2023;
originally announced February 2023.
-
gpcgc: a green point cloud geometry coding method
Authors:
Qingyang Zhou,
Shan Liu,
C. -C. Jay Kuo
Abstract:
A low-complexity point cloud compression method called the Green Point Cloud Geometry Codec (GPCGC), is proposed to encode the 3D spatial coordinates of static point clouds efficiently. GPCGC consists of two modules. In the first module, point coordinates of input point clouds are hierarchically organized into an octree structure. Points at each leaf node are projected along one of three axes to y…
▽ More
A low-complexity point cloud compression method called the Green Point Cloud Geometry Codec (GPCGC), is proposed to encode the 3D spatial coordinates of static point clouds efficiently. GPCGC consists of two modules. In the first module, point coordinates of input point clouds are hierarchically organized into an octree structure. Points at each leaf node are projected along one of three axes to yield image maps. In the second module, the occupancy map is clustered into 9 modes while the depth map is coded by a low-complexity high-efficiency image codec, called the green image codec (GIC). GIC is a multi-resolution codec based on vector quantization (VQ). Its complexity is significantly lower than HEVC-Intra. Furthermore, the rate-distortion optimization (RDO) technique is used to select the optimal coding parameters. GPCGC is a progressive codec, and it offers a coding performance competitive with MPEG's V-PCC and G-PCC standards at significantly lower complexity.
△ Less
Submitted 12 February, 2023;
originally announced February 2023.
-
Successive Subspace Learning for Cardiac Disease Classification with Two-phase Deformation Fields from Cine MRI
Authors:
Xiaofeng Liu,
Fangxu Xing,
Hanna K. Gaggin,
C. -C. Jay Kuo,
Georges El Fakhri,
Jonghye Woo
Abstract:
Cardiac cine magnetic resonance imaging (MRI) has been used to characterize cardiovascular diseases (CVD), often providing a noninvasive phenotyping tool.~While recently flourished deep learning based approaches using cine MRI yield accurate characterization results, the performance is often degraded by small training samples. In addition, many deep learning models are deemed a ``black box," for w…
▽ More
Cardiac cine magnetic resonance imaging (MRI) has been used to characterize cardiovascular diseases (CVD), often providing a noninvasive phenotyping tool.~While recently flourished deep learning based approaches using cine MRI yield accurate characterization results, the performance is often degraded by small training samples. In addition, many deep learning models are deemed a ``black box," for which models remain largely elusive in how models yield a prediction and how reliable they are. To alleviate this, this work proposes a lightweight successive subspace learning (SSL) framework for CVD classification, based on an interpretable feedforward design, in conjunction with a cardiac atlas. Specifically, our hierarchical SSL model is based on (i) neighborhood voxel expansion, (ii) unsupervised subspace approximation, (iii) supervised regression, and (iv) multi-level feature integration. In addition, using two-phase 3D deformation fields, including end-diastolic and end-systolic phases, derived between the atlas and individual subjects as input offers objective means of assessing CVD, even with small training samples. We evaluate our framework on the ACDC2017 database, comprising one healthy group and four disease groups. Compared with 3D CNN-based approaches, our framework achieves superior classification performance with 140$\times$ fewer parameters, which supports its potential value in clinical use.
△ Less
Submitted 21 January, 2023;
originally announced January 2023.
-
SALVE: Self-supervised Adaptive Low-light Video Enhancement
Authors:
Zohreh Azizi,
C. -C. Jay Kuo
Abstract:
A self-supervised adaptive low-light video enhancement method, called SALVE, is proposed in this work. SALVE first enhances a few key frames of an input low-light video using a retinex-based low-light image enhancement technique. For each keyframe, it learns a mapping from low-light image patches to enhanced ones via ridge regression. These mappings are then used to enhance the remaining frames in…
▽ More
A self-supervised adaptive low-light video enhancement method, called SALVE, is proposed in this work. SALVE first enhances a few key frames of an input low-light video using a retinex-based low-light image enhancement technique. For each keyframe, it learns a mapping from low-light image patches to enhanced ones via ridge regression. These mappings are then used to enhance the remaining frames in the low-light video. The combination of traditional retinex-based image enhancement and learning-based ridge regression leads to a robust, adaptive and computationally inexpensive solution to enhance low-light videos. Our extensive experiments along with a user study show that 87% of participants prefer SALVE over prior work.
△ Less
Submitted 21 February, 2023; v1 submitted 22 December, 2022;
originally announced December 2022.
-
A Measurement of the CMB Temperature Power Spectrum and Constraints on Cosmology from the SPT-3G 2018 TT/TE/EE Data Set
Authors:
L. Balkenhol,
D. Dutcher,
A. Spurio Mancini,
A. Doussot,
K. Benabed,
S. Galli,
P. A. R. Ade,
A. J. Anderson,
B. Ansarinejad,
M. Archipley,
A. N. Bender,
B. A. Benson,
F. Bianchini,
L. E. Bleem,
F. R. Bouchet,
L. Bryant,
E. Camphuis,
J. E. Carlstrom,
T. W. Cecil,
C. L. Chang,
P. Chaubal,
P. M. Chichura,
T. -L. Chou,
A. Coerver,
T. M. Crawford
, et al. (62 additional authors not shown)
Abstract:
We present a sample-variance-limited measurement of the temperature power spectrum ($TT$) of the cosmic microwave background (CMB) using observations of a $\sim\! 1500 \,\mathrm{deg}^2$ field made by SPT-3G in 2018. We report multifrequency power spectrum measurements at 95, 150, and 220GHz covering the angular multipole range $750 \leq \ell < 3000$. We combine this $TT$ measurement with the publi…
▽ More
We present a sample-variance-limited measurement of the temperature power spectrum ($TT$) of the cosmic microwave background (CMB) using observations of a $\sim\! 1500 \,\mathrm{deg}^2$ field made by SPT-3G in 2018. We report multifrequency power spectrum measurements at 95, 150, and 220GHz covering the angular multipole range $750 \leq \ell < 3000$. We combine this $TT$ measurement with the published polarization power spectrum measurements from the 2018 observing season and update their associated covariance matrix to complete the SPT-3G 2018 $TT/TE/EE$ data set. This is the first analysis to present cosmological constraints from SPT $TT$, $TE$, and $EE$ power spectrum measurements jointly. We blind the cosmological results and subject the data set to a series of consistency tests at the power spectrum and parameter level. We find excellent agreement between frequencies and spectrum types and our results are robust to the modeling of astrophysical foregrounds. We report results for $Λ$CDM and a series of extensions, drawing on the following parameters: the amplitude of the gravitational lensing effect on primary power spectra $A_\mathrm{L}$, the effective number of neutrino species $N_{\mathrm{eff}}$, the primordial helium abundance $Y_{\mathrm{P}}$, and the baryon clumping factor due to primordial magnetic fields $b$. We find that the SPT-3G 2018 $T/TE/EE$ data are well fit by $Λ$CDM with a probability-to-exceed of $15\%$. For $Λ$CDM, we constrain the expansion rate today to $H_0 = 68.3 \pm 1.5\,\mathrm{km\,s^{-1}\,Mpc^{-1}}$ and the combined structure growth parameter to $S_8 = 0.797 \pm 0.042$. The SPT-based results are effectively independent of Planck, and the cosmological parameter constraints from either data set are within $<1\,σ$ of each other. (abridged)
△ Less
Submitted 27 July, 2023; v1 submitted 11 December, 2022;
originally announced December 2022.
-
LGSQE: Lightweight Generated Sample Quality Evaluatoin
Authors:
Ganning Zhao,
Vasileios Magoulianitis,
Suya You,
C. -C. Jay Kuo
Abstract:
Despite prolific work on evaluating generative models, little research has been done on the quality evaluation of an individual generated sample. To address this problem, a lightweight generated sample quality evaluation (LGSQE) method is proposed in this work. In the training stage of LGSQE, a binary classifier is trained on real and synthetic samples, where real and synthetic data are labeled by…
▽ More
Despite prolific work on evaluating generative models, little research has been done on the quality evaluation of an individual generated sample. To address this problem, a lightweight generated sample quality evaluation (LGSQE) method is proposed in this work. In the training stage of LGSQE, a binary classifier is trained on real and synthetic samples, where real and synthetic data are labeled by 0 and 1, respectively. In the inference stage, the classifier assigns soft labels (ranging from 0 to 1) to each generated sample. The value of soft label indicates the quality level; namely, the quality is better if its soft label is closer to 0. LGSQE can serve as a post-processing module for quality control. Furthermore, LGSQE can be used to evaluate the performance of generative models, such as accuracy, AUC, precision and recall, by aggregating sample-level quality. Experiments are conducted on CIFAR-10 and MNIST to demonstrate that LGSQE can preserve the same performance rank order as that predicted by the Frechet Inception Distance (FID) but with significantly lower complexity.
△ Less
Submitted 8 November, 2022;
originally announced November 2022.
-
Recovering Sign Bits of DCT Coefficients in Digital Images as an Optimization Problem
Authors:
Ruiyuan Lin,
Sheng Liu,
Jun Jiang,
Shujun Li,
Chengqing Li,
C. -C. Jay Kuo
Abstract:
Recovering unknown, missing, damaged, distorted, or lost information in DCT coefficients is a common task in multiple applications of digital image processing, including image compression, selective image encryption, and image communication. This paper investigates the recovery of sign bits in DCT coefficients of digital images, by proposing two different approximation methods to solve a mixed int…
▽ More
Recovering unknown, missing, damaged, distorted, or lost information in DCT coefficients is a common task in multiple applications of digital image processing, including image compression, selective image encryption, and image communication. This paper investigates the recovery of sign bits in DCT coefficients of digital images, by proposing two different approximation methods to solve a mixed integer linear programming (MILP) problem, which is NP-hard in general. One method is a relaxation of the MILP problem to a linear programming (LP) problem, and the other splits the original MILP problem into some smaller MILP problems and an LP problem. We considered how the proposed methods can be applied to JPEG-encoded images and conducted extensive experiments to validate their performances. The experimental results showed that the proposed methods outperformed other existing methods by a substantial margin, both according to objective quality metrics and our subjective evaluation.
△ Less
Submitted 8 January, 2024; v1 submitted 2 November, 2022;
originally announced November 2022.
-
GENHOP: An Image Generation Method Based on Successive Subspace Learning
Authors:
Xuejing Lei,
Wei Wang,
C. -C. Jay Kuo
Abstract:
Being different from deep-learning-based (DL-based) image generation methods, a new image generative model built upon successive subspace learning principle is proposed and named GenHop (an acronym of Generative PixelHop) in this work. GenHop consists of three modules: 1) high-to-low dimension reduction, 2) seed image generation, and 3) low-to-high dimension expansion. In the first module, it buil…
▽ More
Being different from deep-learning-based (DL-based) image generation methods, a new image generative model built upon successive subspace learning principle is proposed and named GenHop (an acronym of Generative PixelHop) in this work. GenHop consists of three modules: 1) high-to-low dimension reduction, 2) seed image generation, and 3) low-to-high dimension expansion. In the first module, it builds a sequence of high-to-low dimensional subspaces through a sequence of whitening processes, each of which contains samples of joint-spatial-spectral representation. In the second module, it generates samples in the lowest dimensional subspace. In the third module, it finds a proper high-dimensional sample for a seed image by adding details back via locally linear embedding (LLE) and a sequence of coloring processes. Experiments show that GenHop can generate visually pleasant images whose FID scores are comparable or even better than those of DL-based generative models for MNIST, Fashion-MNIST and CelebA datasets.
△ Less
Submitted 7 October, 2022;
originally announced October 2022.
-
Green Learning: Introduction, Examples and Outlook
Authors:
C. -C. Jay Kuo,
Azad M. Madni
Abstract:
Rapid advances in artificial intelligence (AI) in the last decade have largely been built upon the wide applications of deep learning (DL). However, the high carbon footprint yielded by larger and larger DL networks becomes a concern for sustainability. Furthermore, DL decision mechanism is somewhat obsecure and can only be verified by test data. Green learning (GL) has been proposed as an alterna…
▽ More
Rapid advances in artificial intelligence (AI) in the last decade have largely been built upon the wide applications of deep learning (DL). However, the high carbon footprint yielded by larger and larger DL networks becomes a concern for sustainability. Furthermore, DL decision mechanism is somewhat obsecure and can only be verified by test data. Green learning (GL) has been proposed as an alternative paradigm to address these concerns. GL is characterized by low carbon footprints, small model sizes, low computational complexity, and logical transparency. It offers energy-effective solutions in cloud centers as well as mobile/edge devices. GL also provides a clear and logical decision-making process to gain people's trust. Several statistical tools have been developed to achieve this goal in recent years. They include subspace approximation, unsupervised and supervised representation learning, supervised discriminant feature selection, and feature space partitioning. We have seen a few successful GL examples with performance comparable with state-of-the-art DL solutions. This paper offers an introduction to GL, its demonstrated applications, and future outlook.
△ Less
Submitted 3 October, 2022;
originally announced October 2022.
-
Lightweight Image Codec via Multi-Grid Multi-Block-Size Vector Quantization (MGBVQ)
Authors:
Yifan Wang,
Zhanxuan Mei,
Ioannis Katsavounidis,
C. -C. Jay Kuo
Abstract:
A multi-grid multi-block-size vector quantization (MGBVQ) method is proposed for image coding in this work. The fundamental idea of image coding is to remove correlations among pixels before quantization and entropy coding, e.g., the discrete cosine transform (DCT) and intra predictions, adopted by modern image coding standards. We present a new method to remove pixel correlations. First, by decom…
▽ More
A multi-grid multi-block-size vector quantization (MGBVQ) method is proposed for image coding in this work. The fundamental idea of image coding is to remove correlations among pixels before quantization and entropy coding, e.g., the discrete cosine transform (DCT) and intra predictions, adopted by modern image coding standards. We present a new method to remove pixel correlations. First, by decomposing correlations into long- and short-range correlations, we represent long-range correlations in coarser grids due to their smoothness, thus leading to a multi-grid (MG) coding architecture. Second, we show that short-range correlations can be effectively coded by a suite of vector quantizers (VQs). Along this line, we argue the effectiveness of VQs of very large block sizes and present a convenient way to implement them. It is shown by experimental results that MGBVQ offers excellent rate-distortion (RD) performance, which is comparable with existing image coders, at much lower complexity. Besides, it provides a progressive coded bitstream.
△ Less
Submitted 25 September, 2022;
originally announced September 2022.
-
MAGIC: Mask-Guided Image Synthesis by Inverting a Quasi-Robust Classifier
Authors:
Mozhdeh Rouhsedaghat,
Masoud Monajatipoor,
C. -C. Jay Kuo,
Iacopo Masi
Abstract:
We offer a method for one-shot mask-guided image synthesis that allows controlling manipulations of a single image by inverting a quasi-robust classifier equipped with strong regularizers. Our proposed method, entitled MAGIC, leverages structured gradients from a pre-trained quasi-robust classifier to better preserve the input semantics while preserving its classification accuracy, thereby guarant…
▽ More
We offer a method for one-shot mask-guided image synthesis that allows controlling manipulations of a single image by inverting a quasi-robust classifier equipped with strong regularizers. Our proposed method, entitled MAGIC, leverages structured gradients from a pre-trained quasi-robust classifier to better preserve the input semantics while preserving its classification accuracy, thereby guaranteeing credibility in the synthesis. Unlike current methods that use complex primitives to supervise the process or use attention maps as a weak supervisory signal, MAGIC aggregates gradients over the input, driven by a guide binary mask that enforces a strong, spatial prior. MAGIC implements a series of manipulations with a single framework achieving shape and location control, intense non-rigid shape deformations, and copy/move operations in the presence of repeating objects and gives users firm control over the synthesis by requiring to simply specify binary guide masks. Our study and findings are supported by various qualitative comparisons with the state-of-the-art on the same images sampled from ImageNet and quantitative analysis using machine perception along with a user survey of 100+ participants that endorse our synthesis quality. Project page at https://mozhdehrouhsedaghat.github.io/magic.html. Code is available at https://github.com/mozhdehrouhsedaghat/magic
△ Less
Submitted 30 June, 2023; v1 submitted 23 September, 2022;
originally announced September 2022.
-
Design of the ECCE Detector for the Electron Ion Collider
Authors:
J. K. Adkins,
Y. Akiba,
A. Albataineh,
M. Amaryan,
I. C. Arsene,
C. Ayerbe Gayoso,
J. Bae,
X. Bai,
M. D. Baker,
M. Bashkanov,
R. Bellwied,
F. Benmokhtar,
V. Berdnikov,
J. C. Bernauer,
F. Bock,
W. Boeglin,
M. Borysova,
E. Brash,
P. Brindza,
W. J. Briscoe,
M. Brooks,
S. Bueltmann,
M. H. S. Bukhari,
A. Bylinkin,
R. Capobianco
, et al. (259 additional authors not shown)
Abstract:
The EIC Comprehensive Chromodynamics Experiment (ECCE) detector has been designed to address the full scope of the proposed Electron Ion Collider (EIC) physics program as presented by the National Academy of Science and provide a deeper understanding of the quark-gluon structure of matter. To accomplish this, the ECCE detector offers nearly acceptance and energy coverage along with excellent track…
▽ More
The EIC Comprehensive Chromodynamics Experiment (ECCE) detector has been designed to address the full scope of the proposed Electron Ion Collider (EIC) physics program as presented by the National Academy of Science and provide a deeper understanding of the quark-gluon structure of matter. To accomplish this, the ECCE detector offers nearly acceptance and energy coverage along with excellent tracking and particle identification. The ECCE detector was designed to be built within the budget envelope set out by the EIC project while simultaneously managing cost and schedule risks. This detector concept has been selected to be the basis for the EIC project detector.
△ Less
Submitted 20 July, 2024; v1 submitted 6 September, 2022;
originally announced September 2022.
-
Detector Requirements and Simulation Results for the EIC Exclusive, Diffractive and Tagging Physics Program using the ECCE Detector Concept
Authors:
A. Bylinkin,
C. T. Dean,
S. Fegan,
D. Gangadharan,
K. Gates,
S. J. D. Kay,
I. Korover,
W. B. Li,
X. Li,
R. Montgomery,
D. Nguyen,
G. Penman,
J. R. Pybus,
N. Santiesteban,
R. Trotta,
A. Usman,
M. D. Baker,
J. Frantz,
D. I. Glazier,
D. W. Higinbotham,
T. Horn,
J. Huang,
G. Huber,
R. Reed,
J. Roche
, et al. (258 additional authors not shown)
Abstract:
This article presents a collection of simulation studies using the ECCE detector concept in the context of the EIC's exclusive, diffractive, and tagging physics program, which aims to further explore the rich quark-gluon structure of nucleons and nuclei. To successfully execute the program, ECCE proposed to utilize the detecter system close to the beamline to ensure exclusivity and tag ion beam/fr…
▽ More
This article presents a collection of simulation studies using the ECCE detector concept in the context of the EIC's exclusive, diffractive, and tagging physics program, which aims to further explore the rich quark-gluon structure of nucleons and nuclei. To successfully execute the program, ECCE proposed to utilize the detecter system close to the beamline to ensure exclusivity and tag ion beam/fragments for a particular reaction of interest. Preliminary studies confirmed the proposed technology and design satisfy the requirements. The projected physics impact results are based on the projected detector performance from the simulation at 10 or 100 fb^-1 of integrated luminosity. Additionally, a few insights on the potential 2nd Interaction Region can (IR) were also documented which could serve as a guidepost for the future development of a second EIC detector.
△ Less
Submitted 6 March, 2023; v1 submitted 30 August, 2022;
originally announced August 2022.