introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
58 rows where transcript_id = 21436520
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 115633262 | GT-AG | 0 | 1.000000099473604e-05 | 18420 | rna-XM_034070145.1 21436520 | 1 | 21094123 | 21112542 | Melopsittacus undulatus 13146 | CCG|GTGAGGGGCA...GTCTTCTAAACA/TGTCTTCTAAAC...TTCAG|ATT | 2 | 1 | 0.738 |
| 115633263 | GT-AG | 0 | 1.000000099473604e-05 | 494 | rna-XM_034070145.1 21436520 | 2 | 21112588 | 21113081 | Melopsittacus undulatus 13146 | CAG|GTAAGACATA...ACATTCTCACTT/TACATTCTCACT...TTCAG|CCA | 2 | 1 | 1.274 |
| 115633264 | GT-AG | 0 | 1.000000099473604e-05 | 182 | rna-XM_034070145.1 21436520 | 3 | 21113164 | 21113345 | Melopsittacus undulatus 13146 | GAG|GTAAATCACT...TTTTTTTTGTTT/ACTTGTTTTACA...TGCAG|GAT | 0 | 1 | 2.251 |
| 115633265 | GT-AG | 0 | 1.000000099473604e-05 | 779 | rna-XM_034070145.1 21436520 | 4 | 21113423 | 21114201 | Melopsittacus undulatus 13146 | TGG|GTAAGAAACC...CCATCTTTTACT/TTGTGACTGATT...TTCAG|TTC | 2 | 1 | 3.168 |
| 115633266 | GT-AG | 0 | 6.071537721878002e-05 | 2783 | rna-XM_034070145.1 21436520 | 5 | 21114322 | 21117104 | Melopsittacus undulatus 13146 | TGG|GTAAGTTTTG...GTCACCTAATCT/TGTCACCTAATC...TCTAG|AAG | 2 | 1 | 4.597 |
| 115633267 | GT-AG | 0 | 1.000000099473604e-05 | 2137 | rna-XM_034070145.1 21436520 | 6 | 21117295 | 21119431 | Melopsittacus undulatus 13146 | CAG|GTATGTGAAC...TGACCAGTAACC/GGTGCGCTGACA...TACAG|GCA | 0 | 1 | 6.86 |
| 115633268 | GT-AG | 0 | 1.282758415278108e-05 | 7595 | rna-XM_034070145.1 21436520 | 7 | 21119592 | 21127186 | Melopsittacus undulatus 13146 | GAG|GTTTGTATCC...AGACCCTTGTCT/TGATTATTCACT...TATAG|AGG | 1 | 1 | 8.765 |
| 115633269 | GT-AG | 0 | 0.0002112113426801 | 1947 | rna-XM_034070145.1 21436520 | 8 | 21127337 | 21129283 | Melopsittacus undulatus 13146 | GAG|GTAATTTTGC...TTACTTTTATCC/CTTACTTTTATC...ACCAG|TTC | 1 | 1 | 10.551 |
| 115633270 | GT-AG | 0 | 1.000000099473604e-05 | 3965 | rna-XM_034070145.1 21436520 | 9 | 21129496 | 21133460 | Melopsittacus undulatus 13146 | AAG|GTACTGAATT...GTTATCTTACAC/TGTTATCTTACA...TGTAG|GAT | 0 | 1 | 13.076 |
| 115633271 | GT-AG | 0 | 1.000000099473604e-05 | 78 | rna-XM_034070145.1 21436520 | 10 | 21133578 | 21133655 | Melopsittacus undulatus 13146 | CAG|GTACTGTGTA...AATATCTAAAAT/GAATATCTAAAA...TAAAG|AAT | 0 | 1 | 14.469 |
| 115633272 | GT-AG | 0 | 1.000000099473604e-05 | 826 | rna-XM_034070145.1 21436520 | 11 | 21133767 | 21134592 | Melopsittacus undulatus 13146 | AAG|GTAGGAGATG...TTTTTCTTCCCT/ATTACATTAAAT...CTTAG|GTT | 0 | 1 | 15.791 |
| 115633273 | GT-AG | 0 | 1.000000099473604e-05 | 1980 | rna-XM_034070145.1 21436520 | 12 | 21134748 | 21136727 | Melopsittacus undulatus 13146 | GTG|GTAAGTAAAT...CCTTTCTAAAAT/TGAGCATTTATT...CACAG|GGG | 2 | 1 | 17.637 |
| 115633274 | GC-AG | 0 | 1.000000099473604e-05 | 817 | rna-XM_034070145.1 21436520 | 13 | 21136846 | 21137662 | Melopsittacus undulatus 13146 | CAG|GCAAGTGTAA...TTTCCCTCAGTA/GTTGTTCTCATA...TGTAG|GTT | 0 | 1 | 19.043 |
| 115633275 | GT-AG | 0 | 1.5747807328963173e-05 | 760 | rna-XM_034070145.1 21436520 | 14 | 21137848 | 21138607 | Melopsittacus undulatus 13146 | AAA|GTAAGTATTG...TTCACCTTTTTC/AAAAAATTCACC...TCCAG|ACC | 2 | 1 | 21.246 |
| 115633276 | GT-AG | 0 | 1.000000099473604e-05 | 1524 | rna-XM_034070145.1 21436520 | 15 | 21138718 | 21140241 | Melopsittacus undulatus 13146 | ACA|GTAAGTAATA...TCCATTTTGGTA/CTTTAATTAAAT...CTCAG|AGC | 1 | 1 | 22.556 |
| 115633277 | GT-AG | 0 | 1.000000099473604e-05 | 3967 | rna-XM_034070145.1 21436520 | 16 | 21140355 | 21144321 | Melopsittacus undulatus 13146 | AAG|GTTTGTCTCA...AAAACTGTAATG/TTTCTGTTCAAA...TGTAG|GTG | 0 | 1 | 23.901 |
| 115633278 | GT-AG | 0 | 0.0015896523363119 | 98 | rna-XM_034070145.1 21436520 | 17 | 21144466 | 21144563 | Melopsittacus undulatus 13146 | CAG|GTATGTCTTT...TACATCTTAAAA/GTGAGTTTAATT...TTCAG|GTT | 0 | 1 | 25.616 |
| 115633279 | GT-AG | 0 | 1.000000099473604e-05 | 204 | rna-XM_034070145.1 21436520 | 18 | 21144676 | 21144879 | Melopsittacus undulatus 13146 | AAG|GTAAGTAGGA...AAGACATTAACT/AAGACATTAACT...TGAAG|GAG | 1 | 1 | 26.95 |
| 115633280 | GT-AG | 0 | 1.000000099473604e-05 | 928 | rna-XM_034070145.1 21436520 | 19 | 21145035 | 21145962 | Melopsittacus undulatus 13146 | CAG|GTTTGTGGTT...AGAGACTTAATG/TAGAGACTTAAT...TGTAG|GAA | 0 | 1 | 28.796 |
| 115633281 | GT-AG | 0 | 1.000000099473604e-05 | 1098 | rna-XM_034070145.1 21436520 | 20 | 21146161 | 21147258 | Melopsittacus undulatus 13146 | GAG|GTAAAGTATT...AATACTTTATTC/TGTATTTTAAAT...TGCAG|AAA | 0 | 1 | 31.154 |
| 115633282 | GT-AG | 0 | 1.000000099473604e-05 | 3233 | rna-XM_034070145.1 21436520 | 21 | 21147470 | 21150702 | Melopsittacus undulatus 13146 | AGG|GTGAGCTCAA...TTTTTCTCATCT/CTTTTTCTCATC...TTCAG|AAG | 1 | 1 | 33.667 |
| 115633283 | GT-AG | 0 | 4.4531516187587226e-05 | 1340 | rna-XM_034070145.1 21436520 | 22 | 21150830 | 21152169 | Melopsittacus undulatus 13146 | AAG|GTAAACTGAC...TATACCTTTGTT/CCCTTTTTCAAT...ATTAG|TTT | 2 | 1 | 35.179 |
| 115633284 | GT-AG | 0 | 1.000000099473604e-05 | 393 | rna-XM_034070145.1 21436520 | 23 | 21152322 | 21152714 | Melopsittacus undulatus 13146 | CGG|GTAAGTCTGC...TATGTATTGATA/TATGTATTGATA...GAAAG|ATG | 1 | 1 | 36.989 |
| 115633285 | GT-AG | 0 | 1.000000099473604e-05 | 431 | rna-XM_034070145.1 21436520 | 24 | 21152811 | 21153241 | Melopsittacus undulatus 13146 | CAG|GTGAGGACAT...TTTTTCTTTTTT/CATTTTTTGAGG...TACAG|GTC | 1 | 1 | 38.133 |
| 115633286 | GT-AG | 0 | 1.0078303685912605e-05 | 329 | rna-XM_034070145.1 21436520 | 25 | 21153366 | 21153694 | Melopsittacus undulatus 13146 | TAA|GTAAGGATTT...TGTGTTTTAACT/TGTGTTTTAACT...CCTAG|GGA | 2 | 1 | 39.609 |
| 115633287 | GT-AG | 0 | 0.000710263073934 | 872 | rna-XM_034070145.1 21436520 | 26 | 21153793 | 21154664 | Melopsittacus undulatus 13146 | AAG|GTACACCAGT...TGGTTTTTAACA/TGGTTTTTAACA...TTTAG|CCG | 1 | 1 | 40.776 |
| 115633288 | GT-AG | 0 | 1.000000099473604e-05 | 419 | rna-XM_034070145.1 21436520 | 27 | 21154814 | 21155232 | Melopsittacus undulatus 13146 | CAG|GTAAGGGTTT...TTGGTATTACTT/TTTGGTATTACT...TTTAG|GAT | 0 | 1 | 42.551 |
| 115633289 | GT-AG | 0 | 1.000000099473604e-05 | 945 | rna-XM_034070145.1 21436520 | 28 | 21155325 | 21156269 | Melopsittacus undulatus 13146 | CAA|GTAAGTATTG...CTCACCTTTTCT/TTGCAGCTCACC...TGTAG|ACT | 2 | 1 | 43.647 |
| 115633290 | GC-AG | 0 | 1.000000099473604e-05 | 91 | rna-XM_034070145.1 21436520 | 29 | 21156417 | 21156507 | Melopsittacus undulatus 13146 | CAG|GCAAGTTTAT...TATTTTTTAAAT/TATTTTTTAAAT...TACAG|GGG | 2 | 1 | 45.397 |
| 115633291 | GT-AG | 0 | 1.000000099473604e-05 | 80 | rna-XM_034070145.1 21436520 | 30 | 21156627 | 21156706 | Melopsittacus undulatus 13146 | AAG|GTAAGAGCTG...TCTTTCTTAAGT/TCTTTCTTAAGT...CTTAG|ATT | 1 | 1 | 46.814 |
| 115633292 | GT-AG | 0 | 0.0007435523731383 | 100 | rna-XM_034070145.1 21436520 | 31 | 21156834 | 21156933 | Melopsittacus undulatus 13146 | CCC|GTGTGTATTT...ATTTCCTTTCTT/ATAAAATTCATT...CCAAG|TCT | 2 | 1 | 48.327 |
| 115633293 | GT-AG | 0 | 2.706620900569982e-05 | 225 | rna-XM_034070145.1 21436520 | 32 | 21157064 | 21157288 | Melopsittacus undulatus 13146 | TTG|GTAAGCACTT...TGCCTCTAAATT/GTTGTAATGACC...TACAG|CTC | 0 | 1 | 49.875 |
| 115633294 | GT-AG | 0 | 1.000000099473604e-05 | 536 | rna-XM_034070145.1 21436520 | 33 | 21157450 | 21157985 | Melopsittacus undulatus 13146 | AAA|GTAAGTGAAG...TAACCTTTATTT/AGTTATTTAACC...CCTAG|CAA | 2 | 1 | 51.792 |
| 115633295 | GT-AG | 0 | 1.000000099473604e-05 | 182 | rna-XM_034070145.1 21436520 | 34 | 21158226 | 21158407 | Melopsittacus undulatus 13146 | CAG|GTAAAATTCA...CTACTTTTGATT/CTACTTTTGATT...TAAAG|TTC | 2 | 1 | 54.65 |
| 115633296 | GT-AG | 0 | 1.000000099473604e-05 | 772 | rna-XM_034070145.1 21436520 | 35 | 21158532 | 21159303 | Melopsittacus undulatus 13146 | GAG|GTTGGTTTTC...TTATCTCTGATA/TTATCTCTGATA...CCCAG|GTT | 0 | 1 | 56.127 |
| 115633297 | GT-AG | 0 | 4.114587377368131e-05 | 1584 | rna-XM_034070145.1 21436520 | 36 | 21159416 | 21160999 | Melopsittacus undulatus 13146 | ATG|GTACGTATAA...TTTTCCATATTC/CATATTCTAAAC...TGTAG|GCA | 1 | 1 | 57.461 |
| 115633298 | GT-AG | 0 | 3.2444552379607566e-05 | 911 | rna-XM_034070145.1 21436520 | 37 | 21161132 | 21162042 | Melopsittacus undulatus 13146 | CAG|GTACACCACA...TATTCGTTGTTT/TTGACAGTTATT...CACAG|GAG | 1 | 1 | 59.033 |
| 115633299 | GC-AG | 0 | 1.000000099473604e-05 | 345 | rna-XM_034070145.1 21436520 | 38 | 21162301 | 21162645 | Melopsittacus undulatus 13146 | CAG|GCAAGTCTGA...AATTCATTAATT/GATAAATTCATT...TTTAG|CAA | 1 | 1 | 62.106 |
| 115633300 | GT-AG | 0 | 1.000000099473604e-05 | 1296 | rna-XM_034070145.1 21436520 | 39 | 21162915 | 21164210 | Melopsittacus undulatus 13146 | CAG|GTAAGAACAT...GCATTCTTGTCA/TTACATCTCATT...TATAG|AAT | 0 | 1 | 65.309 |
| 115633301 | GT-AG | 0 | 6.3492254276688e-05 | 344 | rna-XM_034070145.1 21436520 | 40 | 21164423 | 21164766 | Melopsittacus undulatus 13146 | AAG|GTATAAATAT...CAATCCTTTTCT/TCTGTATTCAAT...ACTAG|ACG | 2 | 1 | 67.834 |
| 115633302 | GT-AG | 0 | 1.000000099473604e-05 | 1812 | rna-XM_034070145.1 21436520 | 41 | 21164998 | 21166809 | Melopsittacus undulatus 13146 | AAG|GTACTGAGCC...TGTGTCTTGTCT/TCTTGTCTAATG...TTAAG|GAC | 2 | 1 | 70.585 |
| 115633303 | GT-AG | 0 | 1.000000099473604e-05 | 552 | rna-XM_034070145.1 21436520 | 42 | 21166884 | 21167435 | Melopsittacus undulatus 13146 | AAA|GTGAGTGTAC...TTTGCTTTTATA/TTTGCTTTTATA...TTTAG|GTG | 1 | 1 | 71.466 |
| 115633304 | GT-AG | 0 | 1.000000099473604e-05 | 99 | rna-XM_034070145.1 21436520 | 43 | 21167609 | 21167707 | Melopsittacus undulatus 13146 | CAG|GTGACATCAG...CTTCTGTTAATT/ATATTATTTACC...TTCAG|CCA | 0 | 1 | 73.526 |
| 115633305 | GT-AG | 0 | 8.422216988391768e-05 | 535 | rna-XM_034070145.1 21436520 | 44 | 21167822 | 21168356 | Melopsittacus undulatus 13146 | GAG|GTAGATTTAA...AAAGCCTTATAG/TTAAATTTTACT...TTTAG|GTT | 0 | 1 | 74.884 |
| 115633306 | GT-AG | 0 | 1.000000099473604e-05 | 629 | rna-XM_034070145.1 21436520 | 45 | 21168578 | 21169206 | Melopsittacus undulatus 13146 | CAG|GTAGTTTAAG...CTGACTTTGCTT/TTTAAGCTGACT...GTCAG|GCC | 2 | 1 | 77.516 |
| 115633307 | GT-AG | 0 | 0.1179546063830525 | 86 | rna-XM_034070145.1 21436520 | 46 | 21169328 | 21169413 | Melopsittacus undulatus 13146 | TCA|GTATGTATTT...CTCATTTTGATT/CTCATTTTGATT...TTAAG|ATC | 0 | 1 | 78.957 |
| 115633308 | GT-AG | 0 | 1.000000099473604e-05 | 245 | rna-XM_034070145.1 21436520 | 47 | 21169513 | 21169757 | Melopsittacus undulatus 13146 | GAG|GTAAAAAATA...AACTCTTTTGTG/GTGGAGCTGAAA...ATTAG|GTT | 0 | 1 | 80.136 |
| 115633309 | GT-AG | 0 | 1.000000099473604e-05 | 579 | rna-XM_034070145.1 21436520 | 48 | 21170002 | 21170580 | Melopsittacus undulatus 13146 | CAA|GTAAGTGATA...GATGTTTAAACT/TTTAAACTTATG...TTCAG|GTC | 1 | 1 | 83.042 |
| 115633310 | GT-AG | 0 | 1.000000099473604e-05 | 535 | rna-XM_034070145.1 21436520 | 49 | 21170693 | 21171227 | Melopsittacus undulatus 13146 | TAG|GTAAGTGATA...TTTTTCTGAAAT/TTTTTTCTGAAA...TTAAG|GGA | 2 | 1 | 84.375 |
| 115633311 | GT-AG | 0 | 1.1988782371976593e-05 | 288 | rna-XM_034070145.1 21436520 | 50 | 21171373 | 21171660 | Melopsittacus undulatus 13146 | CCA|GTAAGTCTTA...TAATTCCTTATC/TAATTCCTTATC...CCCAG|GCA | 0 | 1 | 86.102 |
| 115633312 | GT-AG | 0 | 1.000000099473604e-05 | 229 | rna-XM_034070145.1 21436520 | 51 | 21171792 | 21172020 | Melopsittacus undulatus 13146 | CAG|GTAAAGTATC...CATTCCTTACTG/CCATTCCTTACT...TCAAG|GGA | 2 | 1 | 87.662 |
| 115633313 | GT-AG | 0 | 1.000000099473604e-05 | 247 | rna-XM_034070145.1 21436520 | 52 | 21172082 | 21172328 | Melopsittacus undulatus 13146 | CAG|GTAATGAATT...AGTATGTTAATT/GCTATTCTGACT...AACAG|GAA | 0 | 1 | 88.389 |
| 115633314 | GT-AG | 0 | 0.0045759995009752 | 402 | rna-XM_034070145.1 21436520 | 53 | 21172505 | 21172906 | Melopsittacus undulatus 13146 | CAG|GTATGCTACT...ATTTCTTTTGTG/TTTGTGTTGAAT...TTCAG|AAT | 2 | 1 | 90.485 |
| 115633315 | GT-AG | 0 | 1.526098678289925e-05 | 1340 | rna-XM_034070145.1 21436520 | 54 | 21172992 | 21174331 | Melopsittacus undulatus 13146 | AAG|GTAAACTGAC...GAGTCCTTCAAA/ATTACTTTCAAA...TGTAG|GTG | 0 | 1 | 91.497 |
| 115633316 | GT-AG | 0 | 1.000000099473604e-05 | 366 | rna-XM_034070145.1 21436520 | 55 | 21174488 | 21174853 | Melopsittacus undulatus 13146 | CAG|GTAAGCACAG...TTCTTTTTAAAT/TTCTTTTTAAAT...CATAG|GTT | 0 | 1 | 93.355 |
| 115633317 | GT-AG | 0 | 1.3642339587293518e-05 | 1357 | rna-XM_034070145.1 21436520 | 56 | 21174968 | 21176324 | Melopsittacus undulatus 13146 | CAC|GTAAGTCTTT...ATGACTATAATG/ATGATTATGACT...TTAAG|GCA | 0 | 1 | 94.712 |
| 115633318 | GT-AG | 0 | 1.3028961264322393e-05 | 592 | rna-XM_034070145.1 21436520 | 57 | 21176470 | 21177061 | Melopsittacus undulatus 13146 | CAG|GTATGAAATA...CGTTTTTTATTT/TCGTTTTTTATT...CACAG|GAG | 1 | 1 | 96.439 |
| 115633319 | GT-AG | 0 | 0.0001884076353141 | 1067 | rna-XM_034070145.1 21436520 | 58 | 21177148 | 21178214 | Melopsittacus undulatus 13146 | CTT|GTAAGTTGTC...CAGATTTTAGCC/ACCATTTTAAAT...TTCAG|GTT | 0 | 1 | 97.463 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);