introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
62 rows where transcript_id = 720738
This data as json, CSV (advanced)
Suggested facets: length, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
3821374 | GT-AG | 0 | 1.000000099473604e-05 | 56 | rna-gnl|I4U23|002721-T1 720738 | 1 | 8591783 | 8591838 | Adineta vaga 104782 | AAA|GTAATATCAT...CTATTTTCAACA/TCTATTTTCAAC...TTTAG|TTT | 0 | 1 | 2.501 |
3821375 | GT-AG | 0 | 0.0003406420474217 | 48 | rna-gnl|I4U23|002721-T1 720738 | 2 | 8591676 | 8591723 | Adineta vaga 104782 | AAG|GTAAATTTTG...GATCTTTTGATT/TTTGATTTGATT...TCAAG|ATC | 2 | 1 | 3.079 |
3821376 | GT-AG | 0 | 6.729636779129117e-05 | 61 | rna-gnl|I4U23|002721-T1 720738 | 3 | 8591534 | 8591594 | Adineta vaga 104782 | TAT|GTAATTAAAA...TTCTTTTTATCA/CTTTTTATCATT...TTAAG|ATT | 2 | 1 | 3.874 |
3821377 | GT-AG | 0 | 1.000000099473604e-05 | 56 | rna-gnl|I4U23|002721-T1 720738 | 4 | 8591310 | 8591365 | Adineta vaga 104782 | TCG|GTTGTTCATG...TTGTCATTACCG/ATGATTGTCATT...TCAAG|AAT | 2 | 1 | 5.521 |
3821378 | GT-AG | 0 | 0.0007233039002282 | 60 | rna-gnl|I4U23|002721-T1 720738 | 5 | 8591216 | 8591275 | Adineta vaga 104782 | GTC|GTAAATCTTT...CTTTTTTTATCG/TCTTTTTTTATC...TTTAG|GAT | 0 | 1 | 5.855 |
3821379 | GT-AG | 0 | 1.000000099473604e-05 | 56 | rna-gnl|I4U23|002721-T1 720738 | 6 | 8590986 | 8591041 | Adineta vaga 104782 | CAA|GTAAATAAAA...ACGTTTCTAATG/ACGTTTCTAATG...TTCAG|CAA | 0 | 1 | 7.561 |
3821380 | GT-AG | 0 | 1.8986885838132737e-05 | 55 | rna-gnl|I4U23|002721-T1 720738 | 7 | 8590854 | 8590908 | Adineta vaga 104782 | TCG|GTAAATATTC...ACTGTGTTATTT/AACTGTGTTATT...TTCAG|CGC | 2 | 1 | 8.316 |
3821381 | GT-AG | 0 | 1.000000099473604e-05 | 54 | rna-gnl|I4U23|002721-T1 720738 | 8 | 8590388 | 8590441 | Adineta vaga 104782 | GCT|GTAAGAAAGA...ATTTTCAAAACA/TCGATTTTCAAA...TTTAG|CTT | 0 | 1 | 12.357 |
3821382 | GT-AG | 0 | 6.453436110164381e-05 | 51 | rna-gnl|I4U23|002721-T1 720738 | 9 | 8590076 | 8590126 | Adineta vaga 104782 | GGA|GTAAGATTTT...TTTGTCATGATT/TTTGTCATGATT...TATAG|ATG | 0 | 1 | 14.916 |
3821383 | GT-AG | 0 | 1.000000099473604e-05 | 54 | rna-gnl|I4U23|002721-T1 720738 | 10 | 8589891 | 8589944 | Adineta vaga 104782 | AGC|GTGAGTAATA...TATTCATTGATT/TTGTTATTCATT...TTTAG|ACA | 2 | 1 | 16.201 |
3821384 | GT-AG | 0 | 1.000000099473604e-05 | 62 | rna-gnl|I4U23|002721-T1 720738 | 11 | 8589673 | 8589734 | Adineta vaga 104782 | GAA|GTAAATGAAT...TTTTTCATATTT/TTTTTTTTCATA...TTTAG|GAA | 2 | 1 | 17.731 |
3821385 | GT-AG | 0 | 0.0001281474859372 | 80 | rna-gnl|I4U23|002721-T1 720738 | 12 | 8589480 | 8589559 | Adineta vaga 104782 | TAG|GTATAAAGTT...ATCTCTTTATTG/TATCTCTTTATT...TTTAG|CCT | 1 | 1 | 18.839 |
3821386 | GT-AG | 0 | 1.000000099473604e-05 | 42 | rna-gnl|I4U23|002721-T1 720738 | 13 | 8589341 | 8589382 | Adineta vaga 104782 | TCG|GTAAGATGAA...AAAATTTTACTC/ATTTTACTCATC...TTCAG|TCT | 2 | 1 | 19.79 |
3821387 | GT-AG | 0 | 1.000000099473604e-05 | 49 | rna-gnl|I4U23|002721-T1 720738 | 14 | 8589190 | 8589238 | Adineta vaga 104782 | AAG|GTTAGATCAA...GCACTATTAAAT/ATATCAATGATT...TTTAG|TAA | 2 | 1 | 20.79 |
3821388 | GT-AG | 0 | 0.001395161181359 | 56 | rna-gnl|I4U23|002721-T1 720738 | 15 | 8588972 | 8589027 | Adineta vaga 104782 | GCC|GTAAGTTTAT...AATACTTTACCT/CAATACTTTACC...TTTAG|ACG | 2 | 1 | 22.379 |
3821389 | GT-AG | 0 | 0.0011964442227719 | 61 | rna-gnl|I4U23|002721-T1 720738 | 16 | 8588760 | 8588820 | Adineta vaga 104782 | TTG|GTTTGTTTGA...TCTGTTTTATTT/TTCTGTTTTATT...TATAG|ATT | 0 | 1 | 23.86 |
3821390 | GT-AG | 0 | 7.055972107008594e-05 | 54 | rna-gnl|I4U23|002721-T1 720738 | 17 | 8588606 | 8588659 | Adineta vaga 104782 | AAG|GTAAATTTTA...TTTCTATTGATC/TTTCTATTGATC...TGTAG|AAA | 1 | 1 | 24.841 |
3821391 | GT-AG | 0 | 0.0003701312631867 | 45 | rna-gnl|I4U23|002721-T1 720738 | 18 | 8588283 | 8588327 | Adineta vaga 104782 | AAA|GTAAGTTTCT...TTTGTTTTATTT/TATTTTCTCAAA...TTTAG|ACG | 0 | 1 | 27.567 |
3821392 | GT-AG | 0 | 0.0061299988954157 | 50 | rna-gnl|I4U23|002721-T1 720738 | 19 | 8588068 | 8588117 | Adineta vaga 104782 | GAA|GTAAATTTCT...TTGTTTTTAATT/TTGTTTTTAATT...CTTAG|ACA | 0 | 1 | 29.185 |
3821393 | GT-AG | 0 | 2.5751919555913373e-05 | 59 | rna-gnl|I4U23|002721-T1 720738 | 20 | 8587730 | 8587788 | Adineta vaga 104782 | AAA|GTAAATGTTT...CTTTTCTTTGTA/TTTCTTTGTATT...TTAAG|GTT | 0 | 1 | 31.921 |
3821394 | GT-AG | 0 | 1.000000099473604e-05 | 50 | rna-gnl|I4U23|002721-T1 720738 | 21 | 8587531 | 8587580 | Adineta vaga 104782 | GCG|GTAAGAATAA...TGTTTTTTATCT/TTGTTTTTTATC...CAAAG|AGC | 2 | 1 | 33.382 |
3821395 | GT-AG | 0 | 1.000000099473604e-05 | 65 | rna-gnl|I4U23|002721-T1 720738 | 22 | 8587289 | 8587353 | Adineta vaga 104782 | AAG|GTAAGAATGA...CAATGCTTAATT/CTTAATTTCATG...TATAG|ATC | 2 | 1 | 35.118 |
3821396 | GT-AG | 0 | 1.000000099473604e-05 | 61 | rna-gnl|I4U23|002721-T1 720738 | 23 | 8587092 | 8587152 | Adineta vaga 104782 | GAA|GTAAGGATAA...TGTTCGTTGAAT/TGTTCGTTGAAT...TTCAG|ATT | 0 | 1 | 36.452 |
3821397 | GT-AG | 0 | 1.4631001141890616e-05 | 162 | rna-gnl|I4U23|002721-T1 720738 | 24 | 8586821 | 8586982 | Adineta vaga 104782 | AAA|GTAAATATAC...TTTGATTTAATA/ATATATTTGATT...TATAG|ATT | 1 | 1 | 37.521 |
3821398 | GT-AG | 0 | 9.217016969520328e-05 | 51 | rna-gnl|I4U23|002721-T1 720738 | 25 | 8586155 | 8586205 | Adineta vaga 104782 | GAG|GTTTGTTTTG...AAACATTTATCA/AAAACATTTATC...TTTAG|AAT | 1 | 1 | 43.552 |
3821399 | GT-AG | 0 | 0.0487455787150767 | 58 | rna-gnl|I4U23|002721-T1 720738 | 26 | 8585940 | 8585997 | Adineta vaga 104782 | ACC|GTATATATTT...TAATATTTATTC/TATTTATTCATT...CTTAG|ATT | 2 | 1 | 45.092 |
3821400 | GT-AG | 0 | 0.0012529560099649 | 77 | rna-gnl|I4U23|002721-T1 720738 | 27 | 8585814 | 8585890 | Adineta vaga 104782 | CCG|GTATGTTAAT...TTTACATTGATT/TTTACATTGATT...TTTAG|GAA | 0 | 1 | 45.572 |
3821401 | GT-AG | 0 | 0.0584049458639151 | 65 | rna-gnl|I4U23|002721-T1 720738 | 28 | 8585668 | 8585732 | Adineta vaga 104782 | CTG|GTATATTTTA...TTCTTCTTCTCA/CTTCTTCTCAAA...CTTAG|ACT | 0 | 1 | 46.367 |
3821402 | GT-AG | 0 | 1.000000099473604e-05 | 67 | rna-gnl|I4U23|002721-T1 720738 | 29 | 8584741 | 8584807 | Adineta vaga 104782 | TCG|GTAAGTTCGA...TAAATTTCAATT/ATAAATTTCAAT...TTTAG|TCG | 2 | 1 | 54.8 |
3821403 | GT-AG | 0 | 0.532321843341135 | 59 | rna-gnl|I4U23|002721-T1 720738 | 30 | 8584570 | 8584628 | Adineta vaga 104782 | TCT|GTATGTTTCA...TTGTTCTTTCTA/AAAAATTCCATT...TTCAG|TTT | 0 | 1 | 55.899 |
3821404 | GT-AG | 0 | 3.857673427827557e-05 | 60 | rna-gnl|I4U23|002721-T1 720738 | 31 | 8584339 | 8584398 | Adineta vaga 104782 | CAA|GTAAAATTTG...AATCTCTTTTCT/GTCTTTCTAAAG...ATTAG|CTT | 0 | 1 | 57.576 |
3821405 | GT-AG | 0 | 1.000000099473604e-05 | 52 | rna-gnl|I4U23|002721-T1 720738 | 32 | 8584164 | 8584215 | Adineta vaga 104782 | CAG|GTTCGATAGT...TGTACCTCAATA/ATGTACCTCAAT...TTTAG|TTT | 0 | 1 | 58.782 |
3821406 | GT-AG | 0 | 0.00107014421894 | 53 | rna-gnl|I4U23|002721-T1 720738 | 33 | 8583760 | 8583812 | Adineta vaga 104782 | CAA|GTATGTATAG...AAAATCTAAATA/AATAGATTCAAC...TTCAG|AAT | 0 | 1 | 62.224 |
3821407 | GT-AG | 0 | 1.000000099473604e-05 | 56 | rna-gnl|I4U23|002721-T1 720738 | 34 | 8583617 | 8583672 | Adineta vaga 104782 | AAA|GTAATCGAAT...AAAGTTTTATTA/TTATTATTGATA...TATAG|GGT | 0 | 1 | 63.077 |
3821408 | GT-AG | 0 | 1.000000099473604e-05 | 53 | rna-gnl|I4U23|002721-T1 720738 | 35 | 8583463 | 8583515 | Adineta vaga 104782 | ACG|GTTAGTGTTT...GGATTTTTGTTT/GAAATATTCATG...TTTAG|AGA | 2 | 1 | 64.068 |
3821409 | GT-AG | 0 | 0.0002149022968845 | 57 | rna-gnl|I4U23|002721-T1 720738 | 36 | 8583346 | 8583402 | Adineta vaga 104782 | TAG|GTATGTATAC...ATAGATTTAATA/ATAGATTTAATA...TTTAG|TCG | 2 | 1 | 64.656 |
3821410 | GT-AG | 0 | 1.000000099473604e-05 | 54 | rna-gnl|I4U23|002721-T1 720738 | 37 | 8583198 | 8583251 | Adineta vaga 104782 | CAA|GTAAAAACAA...TGATATTTAATG/TTGATATTTAAT...TCTAG|ATT | 0 | 1 | 65.578 |
3821411 | GT-AG | 0 | 1.000000099473604e-05 | 55 | rna-gnl|I4U23|002721-T1 720738 | 38 | 8582757 | 8582811 | Adineta vaga 104782 | TGG|GTAAAAACGA...ATCTTCTTTGTA/TATGAATTAATT...TTTAG|TGA | 2 | 1 | 69.364 |
3821412 | GT-AG | 0 | 1.000000099473604e-05 | 55 | rna-gnl|I4U23|002721-T1 720738 | 39 | 8582522 | 8582576 | Adineta vaga 104782 | TAG|GTTCGAATTC...TTTCTTCTAATG/TTTCTTCTAATG...TTTAG|ACG | 2 | 1 | 71.129 |
3821413 | GT-AG | 0 | 1.000000099473604e-05 | 50 | rna-gnl|I4U23|002721-T1 720738 | 40 | 8582372 | 8582421 | Adineta vaga 104782 | GAA|GTAAGTGAAT...GTTCGATTATCG/TGTTCGATTATC...TGTAG|GAA | 0 | 1 | 72.109 |
3821414 | GT-AG | 0 | 1.888501275621712e-05 | 46 | rna-gnl|I4U23|002721-T1 720738 | 41 | 8582179 | 8582224 | Adineta vaga 104782 | CAA|GTATGAAAAT...AATTGTTTAATG/AATTGTTTAATG...TTTAG|AGC | 0 | 1 | 73.551 |
3821415 | GT-AG | 0 | 0.0232058047955902 | 59 | rna-gnl|I4U23|002721-T1 720738 | 42 | 8581926 | 8581984 | Adineta vaga 104782 | AAC|GTAACGTTAC...CATATCTTGATT/CATATCTTGATT...TGTAG|ATG | 2 | 1 | 75.454 |
3821416 | GT-AG | 0 | 0.0009793919893707 | 53 | rna-gnl|I4U23|002721-T1 720738 | 43 | 8581728 | 8581780 | Adineta vaga 104782 | GAA|GTAGATTAGA...GTTTTTTTGAAA/GCATTTCTCATA...TCTAG|ATA | 0 | 1 | 76.876 |
3821417 | GT-AG | 0 | 1.000000099473604e-05 | 46 | rna-gnl|I4U23|002721-T1 720738 | 44 | 8581600 | 8581645 | Adineta vaga 104782 | AAA|GTAAAGATTA...AATTCAATATCA/ATTCAATTCAAT...TTTAG|ATT | 1 | 1 | 77.68 |
3821418 | GT-AG | 0 | 1.000000099473604e-05 | 62 | rna-gnl|I4U23|002721-T1 720738 | 45 | 8581323 | 8581384 | Adineta vaga 104782 | ATA|GTAAGTAAAA...TCTTCTTTGTTT/TTTAAAATGATA...TTCAG|TTG | 0 | 1 | 79.788 |
3821419 | GT-AG | 0 | 1.000000099473604e-05 | 112 | rna-gnl|I4U23|002721-T1 720738 | 46 | 8581136 | 8581247 | Adineta vaga 104782 | AAG|GTTAGTACTC...AGAATCTTATTA/AAGAATCTTATT...AATAG|GAT | 0 | 1 | 80.524 |
3821420 | GT-AG | 0 | 0.0001771768833178 | 52 | rna-gnl|I4U23|002721-T1 720738 | 47 | 8580946 | 8580997 | Adineta vaga 104782 | AAC|GTAGGTTAAT...TTTTTCTCAACT/ATTTTTCTCAAC...AATAG|CTA | 0 | 1 | 81.877 |
3821421 | GT-AG | 0 | 6.267200718187251e-05 | 53 | rna-gnl|I4U23|002721-T1 720738 | 48 | 8580826 | 8580878 | Adineta vaga 104782 | TAT|GTATGACAAA...CATCCGTTAATT/TTAATTCTCATT...TTTAG|CAT | 1 | 1 | 82.534 |
3821422 | GT-AG | 0 | 2.3682466278888677e-05 | 51 | rna-gnl|I4U23|002721-T1 720738 | 49 | 8580629 | 8580679 | Adineta vaga 104782 | CAG|GTCTTGTTAG...ATATCATTGATC/ATATCATTGATC...TGTAG|ACA | 0 | 1 | 83.966 |
3821423 | GT-AG | 0 | 1.000000099473604e-05 | 57 | rna-gnl|I4U23|002721-T1 720738 | 50 | 8580454 | 8580510 | Adineta vaga 104782 | ATA|GTAAAAGATA...TTTTTCTTCATT/TTTTTCTTCATT...GATAG|ATT | 1 | 1 | 85.123 |
3821424 | GT-AG | 0 | 1.000000099473604e-05 | 56 | rna-gnl|I4U23|002721-T1 720738 | 51 | 8580318 | 8580373 | Adineta vaga 104782 | TAT|GTAAGTGAAT...TCTTTCTTATTA/TTCTTTCTTATT...CATAG|CTT | 0 | 1 | 85.908 |
3821425 | GT-AG | 0 | 1.000000099473604e-05 | 56 | rna-gnl|I4U23|002721-T1 720738 | 52 | 8580047 | 8580102 | Adineta vaga 104782 | TGA|GTAAGAACAA...TTTTCTCTAATA/TTTTCTCTAATA...CTTAG|ACA | 2 | 1 | 88.016 |
3821426 | GT-AG | 0 | 1.000000099473604e-05 | 56 | rna-gnl|I4U23|002721-T1 720738 | 53 | 8579843 | 8579898 | Adineta vaga 104782 | GCG|GTAAGATATA...CTATTTTTATTC/GCTATTTTTATT...AACAG|GAT | 0 | 1 | 89.467 |
3821427 | GT-AG | 0 | 0.0009223372312396 | 52 | rna-gnl|I4U23|002721-T1 720738 | 54 | 8579614 | 8579665 | Adineta vaga 104782 | GAT|GTATGTAAAA...TTCTCATTATTT/TTTCTTCTCATT...TTTAG|AAT | 0 | 1 | 91.203 |
3821428 | GT-AG | 0 | 0.0001069051590363 | 78 | rna-gnl|I4U23|002721-T1 720738 | 55 | 8579411 | 8579488 | Adineta vaga 104782 | AAA|GTAAACAATA...ATATTGTTAAAT/ATATTGTTAAAT...TACAG|AGG | 2 | 1 | 92.429 |
3821429 | GT-AG | 0 | 0.0109436847701824 | 64 | rna-gnl|I4U23|002721-T1 720738 | 56 | 8579268 | 8579331 | Adineta vaga 104782 | GCA|GTTTGTTTTG...ATCTTTTTGTTT/CAATAATTGAGA...TTTAG|CCA | 0 | 1 | 93.204 |
3821430 | GT-AG | 0 | 0.0013406367675324 | 103 | rna-gnl|I4U23|002721-T1 720738 | 57 | 8579039 | 8579141 | Adineta vaga 104782 | CGC|GTACGTATCT...AAATTTTTACTT/TAAATTTTTACT...CTTAG|GAC | 0 | 1 | 94.44 |
3821431 | GT-AG | 0 | 0.053734033815569 | 53 | rna-gnl|I4U23|002721-T1 720738 | 58 | 8578887 | 8578939 | Adineta vaga 104782 | AAA|GTATGTTTCA...TTTTCTTCAATA/TATTTATTTATT...GTTAG|ACC | 0 | 1 | 95.41 |
3821432 | GT-AG | 0 | 1.000000099473604e-05 | 59 | rna-gnl|I4U23|002721-T1 720738 | 59 | 8578664 | 8578722 | Adineta vaga 104782 | TCA|GTAATAAGGA...TTTTTCTTCTTT/TAGTATTTAAAT...TCTAG|AGT | 2 | 1 | 97.019 |
3821433 | GT-AG | 0 | 1.000000099473604e-05 | 52 | rna-gnl|I4U23|002721-T1 720738 | 60 | 8578568 | 8578619 | Adineta vaga 104782 | ATG|GTAAGTATAT...TCGAGTTTGATC/TCGAGTTTGATC...AATAG|ACA | 1 | 1 | 97.45 |
3821434 | GT-AG | 0 | 0.0003144562993439 | 56 | rna-gnl|I4U23|002721-T1 720738 | 61 | 8578437 | 8578492 | Adineta vaga 104782 | AAG|GTTTGTTTCG...TATTCTTCAATT/ATATTCTTCAAT...TTTAG|TTA | 1 | 1 | 98.186 |
3821435 | GT-AG | 0 | 0.0001714680864414 | 52 | rna-gnl|I4U23|002721-T1 720738 | 62 | 8578278 | 8578329 | Adineta vaga 104782 | CAA|GTAAATTTCG...AAATTCTCAATC/GAAATTCTCAAT...TCTAG|GTT | 0 | 1 | 99.235 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);