Full-Length Transcriptome Profile of Apis cerana Revealed by Nanopore Sequencing
Abstract
:1. Introduction
2. Results
2.1. Transcriptome Sequencing Using Oxford Nanopore Technology
2.2. Structure Analysis and Functional Annotation of Novel Transcripts
2.3. Unique Transcripts from Queen, Worker and Drone Data
2.4. AS Events
2.5. APA Sites
2.6. SSRs
2.7. LncRNAs
2.8. Differentially Expressed Transcripts (DETs) in the Reproductive Glands of A. cerana
3. Discussion
4. Materials and Methods
4.1. Sample Source
4.2. Library Construction and Sequencing
4.3. Analysis of the Raw Data
4.4. Analysis of Alternative Splicing
4.5. Identification and Analysis of Novel Transcripts
4.6. Identification of APA Sites and SSRs
4.7. Prediction of LncRNAs
4.8. Identification of Differentially Expressed Transcripts
4.9. RT-PCR
5. Conclusions
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Cheng, S.L. Honey bee germplasm resources. In The Apicultural Science in China, 1st ed.; Liu, B.H., Ed.; Chinese Agricultural Press: Beijing, China, 2001; pp. 16–24. [Google Scholar]
- Park, D.; Jung, J.W.; Choi, B.S.; Jayakodi, M.; Lee, J.; Lim, J.; Yu, Y.; Choi, Y.S.; Lee, M.L.; Park, Y.; et al. Uncovering the novel characteristics of Asian honey bee, Apis cerana, by whole genome sequencing. BMC Genom. 2015, 16, 1. [Google Scholar] [CrossRef] [PubMed]
- Diao, Q.; Sun, L.; Zheng, H.; Zeng, Z.; Wang, S.; Xu, S.; Zheng, H.; Chen, Y.; Shi, Y.; Wang, Y.; et al. Genomic and transcriptomic analysis of the Asian honeybee Apis cerana provides novel insights into honeybee biology. Sci. Rep. 2018, 8, 822. [Google Scholar] [CrossRef] [PubMed]
- Wang, Z.L.; Zhu, Y.Q.; Yan, Q.; Yan, W.Y.; Zheng, H.J.; Zeng, Z.J. A Chromosome-Scale Assembly of the Asian Honeybee Apis cerana Genome. Front. Genet. 2020, 11, 279. [Google Scholar] [CrossRef]
- Wang, Z.L.; Liu, T.T.; Huang, Z.Y.; Wu, X.B.; Yan, W.Y.; Zeng, Z.J. Transcriptome analysis of the Asian honey bee Apis cerana cerana. PLoS ONE 2012, 7, e47954. [Google Scholar] [CrossRef]
- Kang, I.; Kim, W.; Lim, J.Y.; Lee, Y.; Shin, C. Organ-specific transcriptome analysis reveals differential gene expression in different castes under natural conditions in Apis cerana. Sci. Rep. 2021, 11, 11267. [Google Scholar] [CrossRef]
- Dong, Z.; Chen, Y. Transcriptomics: Advances and approaches. Sci. China Life Sci. 2013, 56, 960–967. [Google Scholar] [CrossRef] [PubMed]
- Keren, H.; Lev-Maor, G.; Ast, G. Alternative splicing and evolution: Diversification, exon definition and function. Nat. Rev. Genet. 2010, 11, 345–355. [Google Scholar] [CrossRef] [PubMed]
- Nilsen, T.W.; Graveley, B.R. Expansion of the eukaryotic proteome by alternative splicing. Nature 2010, 463, 457–463. [Google Scholar] [CrossRef]
- Pan, Q.; Shai, O.; Lee, L.J.; Frey, B.J.; Blencowe, B.J. Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing. Nat. Genet. 2008, 40, 1413–1415. [Google Scholar] [CrossRef]
- Marquez, Y.; Brown, J.W.S.; Simpson, C.; Barta, A.; Kalyna, M. Transcriptome survey reveals increased complexity of the alternative splicing landscape in Arabidopsis. Genome Res. 2012, 22, 1184–1195. [Google Scholar] [CrossRef]
- Nygard, A.B.; Cirera, S.; Gilchrist, M.J.; Gorodkin, J.; Jørgensen, C.B.; Fredholm, M. A study of alternative splicing in the pig. BMC Res. Notes 2010, 3, 123. [Google Scholar] [CrossRef] [PubMed]
- Staiger, D.; Brown, J.W. Alternative splicing at the intersection of biological timing, development, and stress responses. Plant Cell. 2013, 25, 3640–3656. [Google Scholar] [CrossRef] [PubMed]
- Gamazon, E.R.; Stranger, B.E. Genomics of alternative splicing: Evolution, development and pathophysiology. Hum. Genet. 2014, 133, 679–687. [Google Scholar] [CrossRef]
- McCombie, W.R.; McPherson, J.D.; Mardis, E.R. Next-Generation Sequencing Technologies. Cold Spring Harb. Perspect. Med. 2019, 9, a036798. [Google Scholar] [CrossRef]
- Hu, T.; Chitnis, N.; Monos, D.; Dinh, A. Next-generation sequencing technologies: An overview. Hum. Immunol. 2021, 82, 801–811. [Google Scholar] [CrossRef]
- Rhoads, A.; Au, K.F. PacBio Sequencing and Its Applications. Genom. Proteom. Bioinf. 2015, 13, 278–289. [Google Scholar] [CrossRef]
- MacKenzie, M.; Argyropoulos, C. An Introduction to Nanopore Sequencing: Past, Present, and Future Considerations. Micromachines 2023, 14, 459. [Google Scholar] [CrossRef]
- Gibilisco, L.; Zhou, Q.; Mahajan, S.; Bachtrog, D. Alternative Splicing within and between Drosophila Species, Sexes, Tissues, and Developmental Stages. PLoS Genet. 2016, 12, e1006464. [Google Scholar] [CrossRef]
- Shao, W.; Zhao, Q.Y.; Wang, X.Y.; Xu, X.Y.; Tang, Q.; Li, M.; Li, X.; Xu, Y.Z. Alternative splicing and trans-splicing events revealed by analysis of the Bombyx mori transcriptome. RNA 2012, 18, 1395–1407. [Google Scholar] [CrossRef]
- Liu, K.; Su, Q.; Kang, K.; Chen, M.; Wang, W.X.; Zhang, W.Q.; Pang, R. Genome-wide Analysis of Alternative Gene Splicing Associated with Virulence in the Brown Planthopper Nilaparvata lugens (Hemiptera: Delphacidae). J. Econ. Entomol. 2021, 114, 2512–2523. [Google Scholar] [CrossRef]
- Zhao, Q.; Zhong, W.; He, W.; Li, Y.; Li, Y.; Li, T.; Vasseur, L.; You, M. Genome-wide profiling of the alternative splicing provides insights into development in Plutella xylostella. BMC Genom. 2019, 20, 463. [Google Scholar] [CrossRef] [PubMed]
- Zheng, S.Y.; Pan, L.X.; Cheng, F.P.; Jin, M.J.; Wang, Z.L. A Global Survey of the Full-Length Transcriptome of Apis mellifera by Single-Molecule Long-Read Sequencing. Int. J. Mol. Sci. 2023, 24, 5827. [Google Scholar] [CrossRef]
- Tian, B.; Manley, J.L. Alternative polyadenylation of mRNA precursors. Nat. Rev. Mol. Cell Biol. 2017, 18, 18–30. [Google Scholar] [CrossRef]
- Smibert, P.; Miura, P.; Westholm, J.O.; Shenker, S.; May, G.; Duff, M.O.; Zhang, D.; Eads, B.D.; Carlson, J.; Brown, J.B.; et al. Global patterns of tissue-specific alternative polyadenylation in Drosophila. Cell Rep. 2012, 1, 277–289. [Google Scholar] [CrossRef]
- Jan, C.H.; Friedman, R.C.; Ruby, J.G.; Bartel, D.P. Formation, regulation and evolution of Caenorhabditis elegans 3’UTRs. Nature 2011, 469, 97–101. [Google Scholar] [CrossRef] [PubMed]
- Ulitsky, I.; Shkumatava, A.; Jan, C.H.; Subtelny, A.O.; Koppstein, D.; Bell, G.W.; Sive, H.; Bartel, D.P. Extensive alternative polyadenylation during zebrafish development. Genome Res. 2012, 22, 2054–2066. [Google Scholar] [CrossRef]
- Quinn, J.J.; Chang, H.Y. Unique features of long non-coding RNA biogenesis and function. Nat. Rev. Genet. 2016, 17, 47–62. [Google Scholar] [CrossRef] [PubMed]
- Wu, Y.; Cheng, T.; Liu, C.; Liu, D.; Zhang, Q.; Long, R.; Zhao, P.; Xia, Q. Systematic Identification and Characterization of Long Non-Coding RNAs in the Silkworm, Bombyx mori. PLoS ONE 2016, 11, e0147147. [Google Scholar] [CrossRef]
- Li, W.J.; Song, Y.J.; Han, H.L.; Xu, H.Q.; Wei, D.; Smagghe, G.; Wang, J.J. Genome-wide analysis of long non-coding RNAs in adult tissues of the melon fly, Zeugodacus cucurbitae (Coquillett). BMC Genom. 2020, 21, 600. [Google Scholar] [CrossRef]
- Azlan, A.; Obeidat, S.M.; Das, K.T.; Yunus, M.A.; Azzam, G. Genome-wide identification of Aedes albopictus long noncoding RNAs and their association with dengue and Zika virus infection. PLoS Negl. Trop. Dis. 2021, 15, e0008351. [Google Scholar] [CrossRef]
- Meng, L.W.; Yuan, G.R.; Chen, M.L.; Dou, W.; Jing, T.X.; Zheng, L.S.; Peng, M.L.; Bai, W.J.; Wang, J.J. Genome-wide identification of long non-coding RNAs (lncRNAs) associated with malathion resistance in Bactrocera dorsalis. Pest Manag. Sci. 2021, 77, 2292–2301. [Google Scholar] [CrossRef] [PubMed]
- Oxford Nanopore Technologies Guppy v4.0.14. 2022. Available online: https://github.com/nanoporetech/pyguppyclient (accessed on 8 June 2022).
- Li, H. Minimap2: Pairwise alignment for nucleotide sequences. Bioinformatics 2018, 34, 3094–3100. [Google Scholar] [CrossRef] [PubMed]
- Foissac, S.; Sammeth, M. ASTALAVISTA: Dynamic and flexible analysis of alternative splicing events in custom gene datasets. Nucleic Acids Res. 2007, 35, W297–W299. [Google Scholar] [CrossRef]
- Pertea, G.; Pertea, M. GFF Utilities: GffRead and GffCompare. F1000Research 2020, 9, ISCB Comm J-304. [Google Scholar] [CrossRef]
- Haas, B.J.; Papanicolaou, A.; Yassour, M.; Grabherr, M.; Blood, P.D.; Bowden, J.; Couger, M.B.; Eccles, D.; Li, B.; Lieber, M.; et al. De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis. Nat. Protoc. 2013, 8, 1494–1512. [Google Scholar] [CrossRef] [PubMed]
- Abdel-Ghany, S.E.; Hamilton, M.; Jacobi, J.L.; Ngam, P.; Devitt, N.; Schilkey, F.; Ben-Hur, A.; Reddy, A.S. A survey of the sorghum transcriptome using single-molecule long reads. Nat. Commun. 2016, 7, 11706. [Google Scholar] [CrossRef]
- Bailey, T.L. DREME: Motif discovery in transcription factor ChIP-seq data. Bioinformatics 2011, 27, 1653–1659. [Google Scholar] [CrossRef]
- Thiel, T.; Michalek, W.; Varshney, R.K.; Graner, A. Exploiting EST databases for the development and characterization of gene-derived SSR-markers in barley (Hordeum vulgare L.). Theor. Appl. Genet. 2003, 106, 411–422. [Google Scholar] [CrossRef]
- Kong, L.; Zhang, Y.; Ye, Z.Q.; Liu, X.Q.; Zhao, S.Q.; Wei, L.; Gao, G. CPC: Assess the protein-coding potential of transcripts using sequence features and support vector machine. Nucleic Acids Res. 2007, 35, W345–W349. [Google Scholar] [CrossRef]
- Sun, L.; Luo, H.; Bu, D.; Zhao, G.; Yu, K.; Zhang, C.; Liu, Y.; Chen, R.; Zhao, Y. Utilizing sequence intrinsic composition to classify protein-coding and long non-coding transcripts. Nucleic Acids Res. 2013, 41, e166. [Google Scholar] [CrossRef]
- Wang, L.; Park, H.J.; Dasari, S.; Wang, S.; Kocher, J.P.; Li, W. CPAT: Coding-Potential Assessment Tool using an alignment-free logistic regression model. Nucleic Acids Res. 2013, 41, e74. [Google Scholar] [CrossRef] [PubMed]
- Finn, R.D.; Bateman, A.; Clements, J.; Coggill, P.; Eberhardt, R.Y.; Eddy, S.R.; Heger, A.; Hetherington, K.; Holm, L.; Mistry, J.; et al. Pfam: The protein families database. Nucleic Acids Res. 2014, 42, D222–D230. [Google Scholar] [CrossRef] [PubMed]
Sample | Clean Reads | Base | Mean Length | Clean Reads (Except rRNA) | Flnc Reads | Flnc Ratio | Non-Redundant Flnc Reads |
---|---|---|---|---|---|---|---|
Queen | 48,440,954 | 59,474,666,129 | 1227 | 46,283,004 | 38,813,295 | 83.86% | 100,927 |
Worker | 29,142,322 | 38,028,781,394 | 1304 | 27,547,203 | 22,254,923 | 80.79% | 71,554 |
Drone | 47,319,616 | 56,083,323,832 | 1185 | 45,012,392 | 37,986,725 | 84.39% | 105,584 |
Database | Transcript |
---|---|
Total | 160,811 (100.00%) |
NR | 130,198 (80.96%) |
COG | 43,349 (26.96%) |
KOG | 92,473 (57.50%) |
Pfam | 4526 (2.81%) |
Swiss-Prot | 84,235 (52.38%) |
eggNOG | 120,122 (74.70%) |
GO | 90,872 (56.51%) |
KEGG | 82,740 (51.45%) |
Unannotated | 30,444 (18.93%) |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Hu, X.-F.; Jin, M.-J.; Gong, Z.-X.; Lin, Z.-L.; Zhang, L.-Z.; Zeng, Z.-J.; Wang, Z.-L. Full-Length Transcriptome Profile of Apis cerana Revealed by Nanopore Sequencing. Int. J. Mol. Sci. 2024, 25, 10833. https://doi.org/10.3390/ijms251910833
Hu X-F, Jin M-J, Gong Z-X, Lin Z-L, Zhang L-Z, Zeng Z-J, Wang Z-L. Full-Length Transcriptome Profile of Apis cerana Revealed by Nanopore Sequencing. International Journal of Molecular Sciences. 2024; 25(19):10833. https://doi.org/10.3390/ijms251910833
Chicago/Turabian StyleHu, Xiao-Fen, Meng-Jie Jin, Zhi-Xian Gong, Zong-Liang Lin, Li-Zhen Zhang, Zhi-Jiang Zeng, and Zi-Long Wang. 2024. "Full-Length Transcriptome Profile of Apis cerana Revealed by Nanopore Sequencing" International Journal of Molecular Sciences 25, no. 19: 10833. https://doi.org/10.3390/ijms251910833