To investigate the interaction characteristics between the first intron and other introns and interpret the important information of the interaction contained in the circular matching feature between introns, the mitochondrial ribosomal protein gene sequences of five species, including Homo sapiens, Mus musculus, Fugu rubripes, Drosophila melanogaster and Caenorhabditis elegans, were used as samples to study the circular matching features between the first intron and other introns. Firstly, the Smith-Waterman algorithm was used to obtain the optimal matched segments of the reverse complementary sequences between the first intron and other introns of each gene through local alignment, and then these fragments were classified into different types based on the 8-mer CG motif. After standardization of sequence length of all first intron sequences in the genome, the relative position distribution of the optimal matched segments and CG pairs in different types of fragments on the first intron were calculated, and their species differences were analyzed. The results showed that the relative position distribution of the optimal matched segments did not present species differences, while the relative position distribution of CG pairs in different types of fragments presented species differences. Moreover, the most probable positions of CG pairs in the 1CG type fragments exhibited a directed evolutionary trend with species evolution. The result suggested that the optimal matched segments might be some kinds of organized and functional elements, and the CG motif reflected important characteristics of these functional elements.
PALMITERR D, SANDGRENE P, AVARBOCKM R, et al. Heterologous introns can enhance expression of transgenes in mice[J]. Proceedings of the National Academy of Sciences of the United States of America, 1991, 88(2): 478-482.
[3]
LIN Q, YANGJ, CUIL, et al. Expression of intronic miRNAs and their host gene Igf2 in a murine unilateral ureteral obstruction model[J]. Brazilian Journal of Medical and Biological Research, 2015, 48(6): 486-492.
[4]
ABOUA M, CELLIL, BELOTTIG, et al. GC-AG Introns features in long non-coding and protein-coding genes suggest their role in gene expression regulation[J]. Frontiers in Genetics, 2020 (11): 488.
[5]
BOS L, LIH, ZHANGQ, et al. Potential relations between post-spliced introns and mature mRNAs in the caenorhabditis elegans genome[J]. Journal of Theoretical Biology, 2019,467: 7-14.
[6]
MATTICKJ S, GAGENM J. The evolution of controlled multitasked gene networks: The role of introns and other noncoding RNAs in the development of complex organisms[J]. Molecular Biology and Evolution, 2001, 18(9): 1611-1630.
[7]
HANZ P, CHENH F, GUOZ H, et al. Circular RNAs and their role in exosomes[J]. Frontiers in Oncology, 2022 (12): 848341.
[8]
HOUS Q, LIG, XUB B, et al. Trans-splicing facilitated by RNA pairing greatly expands sDscam isoform diversity but not homophilic binding specificity[J]. Science Advances, 2022, 8(27): eabn9458.
[9]
ZHANGY, ZHANGX O, CHENT, et al. Circular intronic long noncoding RNAs[J]. Molecular Cell, 2013, 51(6): 792-806.
JIAOS H, WUS, HUANGS, et al. Advances in the identification of circular RNAs and research into circRNAs in human diseases[J]. Frontiers in Genetics, 2021 (12): 665233.
[12]
FUL Z, CRAWFORDL, TONGA, et al. Sperm associated antigen 7 is activated by T3 during xenopus tropicalis metamorphosis via a thyroid hormone response element within the first intron[J]. Development Growth & Differentiation, 2022, 64(1): 48-58.
[13]
SINGHO P, MISHRAS, SHARMAG, et al. Evaluation of intron-1 of odorant-binding protein-1 of anopheles stephensi as a marker for the identification of biological forms or putative sibling specie[J]. PLoS One, 2022, 17(7): e0270760.
[14]
SPIJKERH M V, STACKPOLEE E, ALMEIDAS, et al. Ribosome profiling reveals novel regulation of C9ORF72 GGGGCC repeat-containing RNA translation[J]. RNA, 2022, 28(2): 123-138.
[15]
VOSSEBERGJ, SCHINKELM, GREMMENS, et al. The spread of the first introns in proto-eukaryotic paralogs[J]. Communication Biology, 2022, 5(1): 476.
[16]
CSUROSM, NOE L, KUCHEROVG. Reconsidering the significance of genomic word frequencies[J]. Trends in Genetics, 2007, 23(11): 543-546.
[17]
TULLERT, CHORB, NELSONN. Forbidden penta-peptides[J]. Protein Science, 2007, 16(10): 2251-2259.
[18]
SUBIRANAJ A, MESSEGUERX. The most frequent short sequences in non-coding DNA[J]. Nucleic Acids Research, 2010, 38(4): 1172-1181.
[19]
HARIHARANR, SIMONR, PILLAIM R, et al. Comparative analysis of DNA word abundances in four yeast genomes using a novel statistical background model[J]. PLoS One, 2013, 8(3): e58038.
[20]
YUH J. Segmented K-mer and its application on similarity analysis of mitochondrial genome sequences[J]. Gene, 2013, 518(2): 419-424.
[21]
BINAM, WYSSP, LAZARUSS A, et al. Discovering sequences with potential regulatory characteristics[J]. Genomics, 2009, 93(4): 314-322.
[22]
BINAM, WYSSP, RENW H, et al. Exploring the characteristics of sequence elements in proximal promoters of human genes[J]. Genomics, 2004, 84(6): 929-940.
YANGZ H, LIH, JIAY, et al. Intrinsic laws of k-mer spectra of genome sequences and evolution mechanism of genomes[J]. BMC Evolutionary Biology, 2020, 20(1): 157.
[25]
JIAY, LIH, WANGJ F, et al. Spectrum structures and biological functions of 8-mers in the human genome[J]. Genomics, 2019, 111(3): 483-491.
[26]
ZHANGQ, LIH, ZHAOX Q, et al. The evolution mechanism of intron length[J]. Genomics, 2016, 108(2): 47-55.
[27]
CHORB, HORND, GOLDMANN, et al. Genomic DNA k-mer spectra: Models and modalities [J]. Genome Biology, 2009, 10(10): R108.
[28]
HEX M, TILLOD, VIERSTRAJ, et al. Methylated cytosines mutate to transcription factor binding sites that drive tetrapod evolution[J]. Genome Biology and Evolution, 2015, 7(11): 3155-3169.