introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
62 rows where transcript_id = 14424018
This data as json, CSV (advanced)
Suggested facets: phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 77101226 | GT-AG | 0 | 1.000000099473604e-05 | 557 | rna-XM_024148542.1 14424018 | 1 | 10215919 | 10216475 | Eutrema salsugineum 72664 | AAG|GTCAGATGGT...TTGCCTTTAGTT/TAGTTTTTTATT...CGCAG|GAG | 1 | 1 | 0.769 |
| 77101227 | GT-AG | 0 | 4.21277466194467e-05 | 285 | rna-XM_024148542.1 14424018 | 2 | 10216580 | 10216864 | Eutrema salsugineum 72664 | AAG|GTTTGTTGTG...TGTTTCATAATC/ATATGTTTCATA...TTTAG|GTA | 0 | 1 | 1.594 |
| 77101228 | GT-AG | 0 | 1.000000099473604e-05 | 82 | rna-XM_024148542.1 14424018 | 3 | 10217000 | 10217081 | Eutrema salsugineum 72664 | GTG|GTGAGATAGG...TTTTCCATGATC/CATGATCTGATT...TACAG|GAA | 0 | 1 | 2.664 |
| 77101229 | GT-AG | 0 | 1.000000099473604e-05 | 88 | rna-XM_024148542.1 14424018 | 4 | 10217133 | 10217220 | Eutrema salsugineum 72664 | ATG|GTGTGGCTAG...ATTTTCTTATTC/TATTTTCTTATT...CAAAG|AAC | 0 | 1 | 3.069 |
| 77101230 | GT-AG | 0 | 4.662113275140457e-05 | 84 | rna-XM_024148542.1 14424018 | 5 | 10217316 | 10217399 | Eutrema salsugineum 72664 | GAG|GTAAATTAAT...TTGTTTTTGATT/TTGTTTTTGATT...TGCAG|CAA | 2 | 1 | 3.822 |
| 77101231 | GT-AG | 0 | 1.3076732119630316e-05 | 400 | rna-XM_024148542.1 14424018 | 6 | 10217518 | 10217917 | Eutrema salsugineum 72664 | AAG|GTACAGATTT...TTATCTTTGTTT/GCTATAATTATC...TGCAG|TCT | 0 | 1 | 4.757 |
| 77101232 | GT-AG | 0 | 2.4645371322533247e-05 | 98 | rna-XM_024148542.1 14424018 | 7 | 10218020 | 10218117 | Eutrema salsugineum 72664 | CAG|GTATTGGCAA...TGTCTCTTACCC/TTGTCTCTTACC...TCTAG|ATT | 0 | 1 | 5.566 |
| 77101233 | GT-AG | 0 | 1.000000099473604e-05 | 86 | rna-XM_024148542.1 14424018 | 8 | 10218307 | 10218392 | Eutrema salsugineum 72664 | AAG|GTTAGCAAAA...GTGGTCTTGATT/GTGGTCTTGATT...CTCAG|GGA | 0 | 1 | 7.065 |
| 77101234 | GT-AG | 0 | 0.018944015581515 | 146 | rna-XM_024148542.1 14424018 | 9 | 10218554 | 10218699 | Eutrema salsugineum 72664 | AAG|GTATCTAAAT...AGATCGTTATTA/GTTTTGTTTACA...GACAG|CGG | 2 | 1 | 8.341 |
| 77101235 | GT-AG | 0 | 1.000000099473604e-05 | 264 | rna-XM_024148542.1 14424018 | 10 | 10218871 | 10219134 | Eutrema salsugineum 72664 | GAG|GTAAGATACT...ACATGTTTAATT/ACATGTTTAATT...AATAG|GAT | 2 | 1 | 9.697 |
| 77101236 | GT-AG | 0 | 1.000000099473604e-05 | 82 | rna-XM_024148542.1 14424018 | 11 | 10219228 | 10219309 | Eutrema salsugineum 72664 | AGG|GTGAATGCTC...AACTCCTTTTCT/CCTTTTCTCAAA...TTCAG|GAA | 2 | 1 | 10.435 |
| 77101237 | GT-AG | 0 | 2.652932378144564e-05 | 86 | rna-XM_024148542.1 14424018 | 12 | 10219632 | 10219717 | Eutrema salsugineum 72664 | GAG|GTTTGTTACG...ACTCTCATAATA/TAAACTCTCATA...TGCAG|AGT | 0 | 1 | 12.988 |
| 77101238 | GT-AG | 0 | 1.3804995009993414e-05 | 80 | rna-XM_024148542.1 14424018 | 13 | 10219823 | 10219902 | Eutrema salsugineum 72664 | ACT|GTGCGTGATC...TCTTTCTTAAAT/ATGTGACTAATT...TGCAG|TAT | 0 | 1 | 13.82 |
| 77101239 | GT-AG | 0 | 2.561251304236241e-05 | 403 | rna-XM_024148542.1 14424018 | 14 | 10219999 | 10220401 | Eutrema salsugineum 72664 | CAG|GTTTGTCTAT...GAACCTTTATTT/TTATTTCTGAAT...TTCAG|TCA | 0 | 1 | 14.581 |
| 77101240 | GT-AG | 0 | 2.0290326639122556e-05 | 78 | rna-XM_024148542.1 14424018 | 15 | 10220470 | 10220547 | Eutrema salsugineum 72664 | AAG|GTCTGTTCGT...GACATTTTACAA/CTGTTTGTTATA...ACCAG|GTT | 2 | 1 | 15.121 |
| 77101241 | GT-AG | 0 | 1.000000099473604e-05 | 88 | rna-XM_024148542.1 14424018 | 16 | 10220669 | 10220756 | Eutrema salsugineum 72664 | CAG|GTAAATAGAA...CATTCTCTAATT/CATTCTCTAATT...TGCAG|GAC | 0 | 1 | 16.08 |
| 77101242 | GT-AG | 0 | 1.0813713639278073e-05 | 97 | rna-XM_024148542.1 14424018 | 17 | 10220964 | 10221060 | Eutrema salsugineum 72664 | CAG|GTAAAATTCT...TAGTTTTTAGCA/GTAGTTTTTAGC...TGTAG|ATT | 0 | 1 | 17.721 |
| 77101243 | GT-AG | 0 | 1.000000099473604e-05 | 102 | rna-XM_024148542.1 14424018 | 18 | 10221265 | 10221366 | Eutrema salsugineum 72664 | AAG|GTTAGTGCCT...TTGTCCTTTCCA/ATTTATTTCATT...AATAG|GGT | 0 | 1 | 19.339 |
| 77101244 | GT-AG | 0 | 0.0007788832366451 | 406 | rna-XM_024148542.1 14424018 | 19 | 10221468 | 10221873 | Eutrema salsugineum 72664 | CAG|GTATTATTCC...TATTTCTTTGCG/GTATTTGTCATA...TGTAG|CTT | 2 | 1 | 20.14 |
| 77101245 | GT-AG | 0 | 0.0017121446882004 | 166 | rna-XM_024148542.1 14424018 | 20 | 10221971 | 10222136 | Eutrema salsugineum 72664 | AAG|GTTTACATCT...CATCTTTTAATT/CATCTTTTAATT...TCTAG|GTT | 0 | 1 | 20.909 |
| 77101246 | GT-AG | 0 | 0.0001576737594222 | 120 | rna-XM_024148542.1 14424018 | 21 | 10222236 | 10222355 | Eutrema salsugineum 72664 | TCG|GTAATTTTAA...ATTTACTTAAAT/TTTAAATTTACT...TGCAG|GGT | 0 | 1 | 21.694 |
| 77101247 | GT-AG | 0 | 1.7993766228472823e-05 | 84 | rna-XM_024148542.1 14424018 | 22 | 10222515 | 10222598 | Eutrema salsugineum 72664 | CAG|GTAAACATAA...ATGCTTTTACTG/CTTTTACTGAGT...TGTAG|CAT | 0 | 1 | 22.954 |
| 77101248 | GT-AG | 0 | 1.000000099473604e-05 | 81 | rna-XM_024148542.1 14424018 | 23 | 10222674 | 10222754 | Eutrema salsugineum 72664 | AAG|GTAAGATGGT...CTTTTCTTGAAT/CTTTTCTTGAAT...ACCAG|GTT | 0 | 1 | 23.549 |
| 77101249 | GT-AG | 0 | 1.000000099473604e-05 | 106 | rna-XM_024148542.1 14424018 | 24 | 10223181 | 10223286 | Eutrema salsugineum 72664 | CAG|GTTGATAGAT...GAAGCCTTACCT/TAATCACTCATC...TGTAG|GTT | 0 | 1 | 26.927 |
| 77101250 | GT-AG | 0 | 1.000000099473604e-05 | 119 | rna-XM_024148542.1 14424018 | 25 | 10223617 | 10223735 | Eutrema salsugineum 72664 | AAG|GTGTGATTAT...ATTTCCATGAGG/TCTGTGCTGATT...TGAAG|GTC | 0 | 1 | 29.543 |
| 77101251 | GT-AG | 0 | 16.015089003477424 | 114 | rna-XM_024148542.1 14424018 | 26 | 10223868 | 10223981 | Eutrema salsugineum 72664 | GAG|GTATCCGTTG...TTTTTTTTATTT/CTTTTTTTTATT...GCTAG|TTG | 0 | 1 | 30.59 |
| 77101252 | GT-AG | 0 | 1.179948998958694e-05 | 72 | rna-XM_024148542.1 14424018 | 27 | 10224096 | 10224167 | Eutrema salsugineum 72664 | GAG|GTAGTTCACT...TTCTCCTTACAA/TTTCTCCTTACA...TGTAG|GTT | 0 | 1 | 31.494 |
| 77101253 | GT-AG | 0 | 4.6219650597513206e-05 | 135 | rna-XM_024148542.1 14424018 | 28 | 10224353 | 10224487 | Eutrema salsugineum 72664 | TGA|GTAAGTTCAT...ATTTTTTTCTCT/TCTCCATTTATT...TGCAG|TTA | 2 | 1 | 32.961 |
| 77101254 | GT-AG | 0 | 1.000000099473604e-05 | 148 | rna-XM_024148542.1 14424018 | 29 | 10224597 | 10224744 | Eutrema salsugineum 72664 | ATG|GTGAGAACTC...ATTCCCCTGACC/TTTGTGTTTATT...CATAG|GTA | 0 | 1 | 33.825 |
| 77101255 | GT-AG | 0 | 4.141573011170967e-05 | 78 | rna-XM_024148542.1 14424018 | 30 | 10224889 | 10224966 | Eutrema salsugineum 72664 | GAG|GTAAACGGTA...ACTTTCTTACTC/TACTTTCTTACT...ATCAG|ATA | 0 | 1 | 34.967 |
| 77101256 | GT-AG | 0 | 5.517392815877604e-05 | 516 | rna-XM_024148542.1 14424018 | 31 | 10225252 | 10225767 | Eutrema salsugineum 72664 | CAG|GTCCGCTTCC...TAATTCTTTTTC/TCATGGCTAATT...TACAG|ATC | 0 | 1 | 37.226 |
| 77101257 | GT-AG | 0 | 1.396165609318926e-05 | 99 | rna-XM_024148542.1 14424018 | 32 | 10226413 | 10226511 | Eutrema salsugineum 72664 | AAG|GTATAGAAGT...TTGTCCTTTTTG/ACTTTTTTCATT...TCCAG|AAT | 0 | 1 | 42.341 |
| 77101258 | GT-AG | 0 | 1.000000099473604e-05 | 93 | rna-XM_024148542.1 14424018 | 33 | 10226704 | 10226796 | Eutrema salsugineum 72664 | CAG|GTTCTTATAG...TCCATTTTAATG/TCCATTTTAATG...TGCAG|GCT | 0 | 1 | 43.863 |
| 77101259 | GT-AG | 0 | 1.000000099473604e-05 | 130 | rna-XM_024148542.1 14424018 | 34 | 10226901 | 10227030 | Eutrema salsugineum 72664 | CAG|GTTAATGGCT...ATAATTTTATCG/CTGTTGCTCATA...TATAG|GGT | 2 | 1 | 44.688 |
| 77101260 | GT-AG | 0 | 0.0021269538794049 | 78 | rna-XM_024148542.1 14424018 | 35 | 10227321 | 10227398 | Eutrema salsugineum 72664 | GAA|GTATGTCCAA...CTGGTTTTACCT/CTGGTACTTATC...TGCAG|ATC | 1 | 1 | 46.987 |
| 77101261 | GT-AG | 0 | 1.000000099473604e-05 | 83 | rna-XM_024148542.1 14424018 | 36 | 10227484 | 10227566 | Eutrema salsugineum 72664 | ATC|GTTAGTAAAC...AGAGCCTTTGCA/ATTGGTCTAATA...GGCAG|GGA | 2 | 1 | 47.661 |
| 77101262 | GT-AG | 0 | 0.4127170239191221 | 463 | rna-XM_024148542.1 14424018 | 37 | 10227882 | 10228344 | Eutrema salsugineum 72664 | TAT|GTATGCTGCT...CTTCCATTGATG/ACAATTTTCATG...CGCAG|TTC | 2 | 1 | 50.159 |
| 77101263 | GT-AG | 0 | 1.000000099473604e-05 | 89 | rna-XM_024148542.1 14424018 | 38 | 10228699 | 10228787 | Eutrema salsugineum 72664 | TGG|GTAAGTCATA...GGATGCTTATTC/TGGATGCTTATT...TGTAG|GTA | 2 | 1 | 52.965 |
| 77101264 | GT-AG | 0 | 1.000000099473604e-05 | 74 | rna-XM_024148542.1 14424018 | 39 | 10229278 | 10229351 | Eutrema salsugineum 72664 | CTG|GTAAGAAAAC...AATGGCTTATCT/TAATGGCTTATC...TGCAG|ATG | 0 | 1 | 56.851 |
| 77101265 | GT-AG | 0 | 1.1185545619682844e-05 | 183 | rna-XM_024148542.1 14424018 | 40 | 10229516 | 10229698 | Eutrema salsugineum 72664 | AAG|GTTCTTTTTA...TATATGTTATAT/TTATATATGACC...TGCAG|GTA | 2 | 1 | 58.151 |
| 77101266 | GT-AG | 0 | 1.000000099473604e-05 | 127 | rna-XM_024148542.1 14424018 | 41 | 10229862 | 10229988 | Eutrema salsugineum 72664 | AGA|GTGAGAACTT...TCTTTCTTGACT/TCTTTCTTGACT...TATAG|GGA | 0 | 1 | 59.443 |
| 77101267 | GT-AG | 0 | 0.3264398920037386 | 304 | rna-XM_024148542.1 14424018 | 42 | 10230259 | 10230562 | Eutrema salsugineum 72664 | CAG|GTACCTTTGT...GTTTTCTTACAT/TGTTTTCTTACA...TATAG|CTT | 0 | 1 | 61.584 |
| 77101268 | GT-AG | 0 | 0.1841273108010271 | 382 | rna-XM_024148542.1 14424018 | 43 | 10230758 | 10231139 | Eutrema salsugineum 72664 | CTG|GTATTCTCGC...CTTCTTTTAATT/CTTCTTTTAATT...TCCAG|GAT | 0 | 1 | 63.13 |
| 77101269 | GT-AG | 0 | 1.000000099473604e-05 | 108 | rna-XM_024148542.1 14424018 | 44 | 10231183 | 10231290 | Eutrema salsugineum 72664 | AAG|GTTCGTTTCC...TTTGTTTTCACG/TTTGTTTTCACG...TGCAG|GTG | 1 | 1 | 63.471 |
| 77101270 | GT-AG | 0 | 1.000000099473604e-05 | 181 | rna-XM_024148542.1 14424018 | 45 | 10231416 | 10231596 | Eutrema salsugineum 72664 | ATG|GTAAAATTCC...ATTGCCTTTTCT/CGATTGCTTACA...TGCAG|AGC | 0 | 1 | 64.462 |
| 77101271 | GT-AG | 0 | 1.000000099473604e-05 | 106 | rna-XM_024148542.1 14424018 | 46 | 10232488 | 10232593 | Eutrema salsugineum 72664 | CAT|GTAAGTTAGG...TCTCCCTTGTAT/ATTTATTTCATC...GGCAG|GAA | 0 | 1 | 71.527 |
| 77101272 | GT-AG | 0 | 8.200828034026429e-05 | 83 | rna-XM_024148542.1 14424018 | 47 | 10232668 | 10232750 | Eutrema salsugineum 72664 | AAG|GTCTTGTTCT...TTGGTCTTATAC/TTTGGTCTTATA...AGCAG|GAT | 2 | 1 | 72.114 |
| 77101273 | GT-AG | 0 | 3.5757756263448766e-05 | 100 | rna-XM_024148542.1 14424018 | 48 | 10233100 | 10233199 | Eutrema salsugineum 72664 | ATG|GTAAGTTCTA...TATTTTTTGATG/TATTTTTTGATG...AATAG|GAT | 0 | 1 | 74.881 |
| 77101274 | GT-AG | 0 | 1.000000099473604e-05 | 66 | rna-XM_024148542.1 14424018 | 49 | 10233299 | 10233364 | Eutrema salsugineum 72664 | AAG|GTAAAAGCAG...TCCTCCTTCTTT/TTATTACTTACT...ATCAG|ATC | 0 | 1 | 75.666 |
| 77101275 | GT-AG | 0 | 1.000000099473604e-05 | 324 | rna-XM_024148542.1 14424018 | 50 | 10233518 | 10233841 | Eutrema salsugineum 72664 | CAG|GTGAGCTGTC...ATCTTCTTATAA/TATGTTTTCATT...TTTAG|GTC | 0 | 1 | 76.879 |
| 77101276 | GT-AG | 0 | 25.49773735221876 | 92 | rna-XM_024148542.1 14424018 | 51 | 10234024 | 10234115 | Eutrema salsugineum 72664 | GAG|GTATCTTCTC...ATATCCTTATCT/TGTTTGTTAACT...ACCAG|GGT | 2 | 1 | 78.322 |
| 77101277 | GT-AG | 0 | 1.000000099473604e-05 | 368 | rna-XM_024148542.1 14424018 | 52 | 10234635 | 10235002 | Eutrema salsugineum 72664 | CAG|GTAAAATCAA...TTTTTCTTATGG/TTTTTTCTTATG...TCCAG|GTT | 2 | 1 | 82.437 |
| 77101278 | GT-AG | 0 | 0.0002375696179701 | 93 | rna-XM_024148542.1 14424018 | 53 | 10235181 | 10235273 | Eutrema salsugineum 72664 | CGA|GTACGTATTC...TGCAATTTGATT/TGCAATTTGATT...GACAG|GTG | 0 | 1 | 83.849 |
| 77101279 | GT-AG | 0 | 0.0058279376496982 | 125 | rna-XM_024148542.1 14424018 | 54 | 10235411 | 10235535 | Eutrema salsugineum 72664 | AAA|GTACTTTTCT...TTGTTTGTGATT/TTGTTTGTGATT...TTCAG|CCT | 2 | 1 | 84.935 |
| 77101280 | GT-AG | 0 | 1.000000099473604e-05 | 429 | rna-XM_024148542.1 14424018 | 55 | 10235663 | 10236091 | Eutrema salsugineum 72664 | CAG|GTGATTTGAA...TGTTTCAAAATT/ATGTTGCTAATG...TTTAG|GTT | 0 | 1 | 85.942 |
| 77101281 | GT-AG | 0 | 0.0003896873170562 | 72 | rna-XM_024148542.1 14424018 | 56 | 10236314 | 10236385 | Eutrema salsugineum 72664 | CAG|GTACATCTTT...GTTTACTTAAAA/ATTGAGTTGATC...ATCAG|GTT | 0 | 1 | 87.702 |
| 77101282 | GT-AG | 0 | 0.0097488884021885 | 289 | rna-XM_024148542.1 14424018 | 57 | 10236746 | 10237034 | Eutrema salsugineum 72664 | CAG|GTACCTCCTC...TGTTTGTTATTT/CTGGTTCTGATT...TGCAG|ATG | 0 | 1 | 90.557 |
| 77101283 | GT-AG | 0 | 1.000000099473604e-05 | 90 | rna-XM_024148542.1 14424018 | 58 | 10237194 | 10237283 | Eutrema salsugineum 72664 | CAG|GTTAAACCAA...ATTATTTTATTT/AATTATTTTATT...TGTAG|TGC | 0 | 1 | 91.817 |
| 77101284 | GT-AG | 0 | 1.000000099473604e-05 | 275 | rna-XM_024148542.1 14424018 | 59 | 10237569 | 10237843 | Eutrema salsugineum 72664 | AAG|GTTATTGCTA...TCCATCTTCTCG/CCACACCTAATG...TGCAG|GTT | 0 | 1 | 94.077 |
| 77101285 | GT-AG | 0 | 1.000000099473604e-05 | 492 | rna-XM_024148542.1 14424018 | 60 | 10238069 | 10238560 | Eutrema salsugineum 72664 | CAG|GTAATTCCTT...GTTTTCTTACAG/CGTTTTCTTACA...TTCAG|GTT | 0 | 1 | 95.861 |
| 77101286 | GT-AG | 0 | 1.000000099473604e-05 | 145 | rna-XM_024148542.1 14424018 | 61 | 10238614 | 10238758 | Eutrema salsugineum 72664 | GGG|GTACGTAAAG...TGTTTTTTCTTT/GTTTTCGTCATT...GCCAG|ATT | 2 | 1 | 96.281 |
| 77101287 | GT-AG | 0 | 3.601677259122398e-05 | 172 | rna-XM_024148542.1 14424018 | 62 | 10238868 | 10239039 | Eutrema salsugineum 72664 | CTC|GTAAGTATCT...TCGCCATTGATA/ATGTTAATCACT...CTCAG|GTG | 0 | 1 | 97.146 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);