introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
35 rows where transcript_id = 22173145
This data as json, CSV (advanced)
Suggested facets: score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 120104297 | GT-AG | 0 | 1.000000099473604e-05 | 2364 | rna-XM_036389940.1 22173145 | 1 | 93650395 | 93652758 | Molothrus ater 84834 | GCT|GTAAGAACAT...TTTTTTTTTTCC/TTTTTTTTCCCC...AACAG|AAA | 2 | 1 | 1.583 |
| 120104298 | GT-AG | 0 | 1.000000099473604e-05 | 319 | rna-XM_036389940.1 22173145 | 2 | 93652946 | 93653264 | Molothrus ater 84834 | CAG|GTGAGGAAGA...CCATTCTGAATG/ATGGTTATAACT...TGCAG|GTC | 0 | 1 | 5.754 |
| 120104299 | GT-AG | 0 | 0.0003435490620444 | 728 | rna-XM_036389940.1 22173145 | 3 | 93653428 | 93654155 | Molothrus ater 84834 | TTG|GTATGGATAG...AGAGCCTTGATT/CTTGATTTCATT...TGCAG|TCA | 1 | 1 | 9.389 |
| 120104300 | GT-AG | 0 | 1.000000099473604e-05 | 308 | rna-XM_036389940.1 22173145 | 4 | 93654209 | 93654516 | Molothrus ater 84834 | ACA|GTAAGTCAAG...GGTAACTTATCT/TGGTAACTTATC...CACAG|CAG | 0 | 1 | 10.571 |
| 120104301 | GT-AG | 0 | 1.000000099473604e-05 | 670 | rna-XM_036389940.1 22173145 | 5 | 93654538 | 93655207 | Molothrus ater 84834 | AAG|GTAAAGCTCC...AGTCTTTTTGCA/TCTTTTTGCACT...TGCAG|GAC | 0 | 1 | 11.039 |
| 120104302 | GT-AG | 0 | 1.000000099473604e-05 | 426 | rna-XM_036389940.1 22173145 | 6 | 93655362 | 93655787 | Molothrus ater 84834 | ATG|GTAAGAGTCT...TTTCTCTTGCTT/TCTATTCTCACT...CACAG|TGC | 1 | 1 | 14.474 |
| 120104303 | GT-AG | 0 | 1.000000099473604e-05 | 555 | rna-XM_036389940.1 22173145 | 7 | 93655873 | 93656427 | Molothrus ater 84834 | CAT|GTGAGCTGAC...TGTTTCTGGGTG/TTCTGGGTGAGT...GCCAG|GTA | 2 | 1 | 16.369 |
| 120104304 | GT-AG | 0 | 1.000000099473604e-05 | 111 | rna-XM_036389940.1 22173145 | 8 | 93656549 | 93656659 | Molothrus ater 84834 | GAG|GTGAGCACGG...GCCTTCCTGATT/TACAATTTCATA...CCCAG|ACA | 0 | 1 | 19.068 |
| 120104305 | GT-AG | 0 | 1.000000099473604e-05 | 862 | rna-XM_036389940.1 22173145 | 10 | 93656777 | 93657638 | Molothrus ater 84834 | CAG|GTAGAAAAAA...CCTGCTTAGACA/ACTCTACTCACC...CCCAG|GAG | 1 | 1 | 21.632 |
| 120104306 | GT-AG | 0 | 1.000000099473604e-05 | 349 | rna-XM_036389940.1 22173145 | 11 | 93657749 | 93658097 | Molothrus ater 84834 | AAT|GTGAGTGAAA...AACTCTCTGACT/TGGATTTTCAAT...TACAG|CTG | 0 | 1 | 24.086 |
| 120104307 | GT-AG | 0 | 1.000000099473604e-05 | 448 | rna-XM_036389940.1 22173145 | 12 | 93658269 | 93658716 | Molothrus ater 84834 | CAG|GTTAGGAACC...AAATCCTCAGAC/GAAATCCTCAGA...CACAG|GCA | 0 | 1 | 27.899 |
| 120104308 | GT-AG | 0 | 1.000000099473604e-05 | 125 | rna-XM_036389940.1 22173145 | 13 | 93658942 | 93659066 | Molothrus ater 84834 | CTG|GTGAGTGCGG...TGTTCATTACCC/GTACTGTTCATT...TTTAG|GTT | 0 | 1 | 32.917 |
| 120104309 | GT-AG | 0 | 1.000000099473604e-05 | 507 | rna-XM_036389940.1 22173145 | 14 | 93659131 | 93659637 | Molothrus ater 84834 | CAG|GTAAGGGCAT...GCTTTTTTAATT/GCTTTTTTAATT...TTCAG|GTC | 1 | 1 | 34.344 |
| 120104310 | GT-AG | 0 | 5.058141177534356e-05 | 158 | rna-XM_036389940.1 22173145 | 15 | 93659784 | 93659941 | Molothrus ater 84834 | CAT|GTAAGCAGTG...CTTTCCTTCGTC/TCGTCTCTCAGT...TTCAG|GTG | 0 | 1 | 37.6 |
| 120104311 | GT-AG | 0 | 1.000000099473604e-05 | 825 | rna-XM_036389940.1 22173145 | 16 | 93660092 | 93660916 | Molothrus ater 84834 | AGT|GTGAGTCACT...AAAATCTTCACA/ACTGTTTTCATT...TACAG|GTG | 0 | 1 | 40.946 |
| 120104312 | GT-AG | 0 | 1.000000099473604e-05 | 668 | rna-XM_036389940.1 22173145 | 17 | 93661115 | 93661782 | Molothrus ater 84834 | AAG|GTAAATCTAA...TATGCCTTCTTT/AACCCCTCCACA...TCCAG|AAT | 0 | 1 | 45.361 |
| 120104313 | GT-AG | 0 | 0.024124859591552 | 610 | rna-XM_036389940.1 22173145 | 18 | 93661871 | 93662480 | Molothrus ater 84834 | TGT|GTACATCTCC...CTGCCTTTGAAA/CCTGGTTTCACA...AACAG|CTT | 1 | 1 | 47.324 |
| 120104314 | GT-AG | 0 | 1.000000099473604e-05 | 785 | rna-XM_036389940.1 22173145 | 19 | 93662635 | 93663419 | Molothrus ater 84834 | CAA|GTGAGTAATT...ACCTCTTTTACT/ACCTCTTTTACT...CTCAG|TGA | 2 | 1 | 50.758 |
| 120104315 | GT-AG | 0 | 1.000000099473604e-05 | 362 | rna-XM_036389940.1 22173145 | 20 | 93663649 | 93664010 | Molothrus ater 84834 | GCG|GTGAGTGGCT...GTATCCTGAACT/TTTTGTTTCACC...TCTAG|CTC | 0 | 1 | 55.865 |
| 120104316 | GT-AG | 0 | 1.000000099473604e-05 | 180 | rna-XM_036389940.1 22173145 | 21 | 93664135 | 93664314 | Molothrus ater 84834 | TAG|GTAAGGGCTC...GATGTTTTGCTT/AGCCTCCTGATG...TTTAG|GGA | 1 | 1 | 58.631 |
| 120104317 | GT-AG | 0 | 1.000000099473604e-05 | 406 | rna-XM_036389940.1 22173145 | 23 | 93664438 | 93664843 | Molothrus ater 84834 | AAG|GTCAGGCTGC...CTGATCTTACAG/TCTGATCTTACA...ACCAG|GCA | 0 | 1 | 61.351 |
| 120104318 | GT-AG | 0 | 1.000000099473604e-05 | 395 | rna-XM_036389940.1 22173145 | 24 | 93664896 | 93665290 | Molothrus ater 84834 | AAG|GTAGGTGCAG...ATGTTTTTGCTG/GTTGTTCTGATG...TGCAG|GAA | 1 | 1 | 62.511 |
| 120104319 | GT-AG | 0 | 1.000000099473604e-05 | 361 | rna-XM_036389940.1 22173145 | 25 | 93665375 | 93665735 | Molothrus ater 84834 | TAG|GTGAGTTTTT...GCTCCCAAAGCT/ATGGCTCCCAAA...CACAG|GTG | 1 | 1 | 64.384 |
| 120104320 | GT-AG | 0 | 1.000000099473604e-05 | 304 | rna-XM_036389940.1 22173145 | 26 | 93665913 | 93666216 | Molothrus ater 84834 | GCG|GTGAGTGAAT...AGCATCTTCACT/AGCATCTTCACT...TGCAG|GAT | 1 | 1 | 68.332 |
| 120104321 | GT-AG | 0 | 1.000000099473604e-05 | 862 | rna-XM_036389940.1 22173145 | 27 | 93666299 | 93667160 | Molothrus ater 84834 | TTG|GTAAGCAACA...TCCTCCTTCCTT/TGACAATTCAGT...TCTAG|GCT | 2 | 1 | 70.161 |
| 120104322 | GT-AG | 0 | 1.3316862676276867e-05 | 592 | rna-XM_036389940.1 22173145 | 28 | 93667318 | 93667909 | Molothrus ater 84834 | ATG|GTATGACATG...TCTCCCTTCTCA/TCCCTTCTCATA...TGCAG|GGT | 0 | 1 | 73.662 |
| 120104323 | GT-AG | 0 | 1.000000099473604e-05 | 156 | rna-XM_036389940.1 22173145 | 29 | 93667985 | 93668140 | Molothrus ater 84834 | TCC|GTGAGTATCG...CTGCCCATAACG/AGTGCTTTCAAT...CCTAG|GAC | 0 | 1 | 75.335 |
| 120104324 | GT-AG | 0 | 0.0002561715967667 | 476 | rna-XM_036389940.1 22173145 | 30 | 93668313 | 93668788 | Molothrus ater 84834 | CAG|GTACCACAGA...ACTCCATTAATG/AATGTTCCCACT...TCTAG|ATG | 1 | 1 | 79.17 |
| 120104325 | GT-AG | 0 | 1.000000099473604e-05 | 276 | rna-XM_036389940.1 22173145 | 31 | 93669004 | 93669279 | Molothrus ater 84834 | CAG|GTAATCCCCT...CTTTTCCTGATA/CTTTTCCTGATA...CTCAG|GAC | 0 | 1 | 83.965 |
| 120104326 | GT-AG | 0 | 1.000000099473604e-05 | 781 | rna-XM_036389940.1 22173145 | 32 | 93669496 | 93670276 | Molothrus ater 84834 | CAG|GTAAGCCCAG...AGCTTCTTTCTG/TTCTCTGCCATC...GGCAG|GCC | 0 | 1 | 88.782 |
| 120104327 | GT-AG | 0 | 1.000000099473604e-05 | 654 | rna-XM_036389940.1 22173145 | 33 | 93670393 | 93671046 | Molothrus ater 84834 | TCG|GTAGGAGACA...GTAACATTAGTG/TCATGGCTCAGA...TGCAG|CTA | 2 | 1 | 91.369 |
| 120104328 | GT-AG | 0 | 1.000000099473604e-05 | 283 | rna-XM_036389940.1 22173145 | 34 | 93671138 | 93671420 | Molothrus ater 84834 | GAG|GTGAGATCGT...TGCTTTTTATTA/CTGCTTTTTATT...CTCAG|CTA | 0 | 1 | 93.399 |
| 120104329 | GT-AG | 0 | 1.8338142714352512e-05 | 1213 | rna-XM_036389940.1 22173145 | 35 | 93671490 | 93672702 | Molothrus ater 84834 | GAA|GTAAGTGTAA...TTGTTTTTAATC/TTGTTTTTAATC...CTTAG|CTG | 0 | 1 | 94.938 |
| 120104330 | GT-AG | 0 | 1.000000099473604e-05 | 682 | rna-XM_036389940.1 22173145 | 36 | 93672806 | 93673487 | Molothrus ater 84834 | CAG|GTGAGCTCAG...GGGGTTTTGACA/ATGTGTCTCACC...TTCAG|AAG | 1 | 1 | 97.235 |
| 120112044 | GT-AG | 0 | 1.000000099473604e-05 | 1047 | rna-XM_036389940.1 22173145 | 37 | 93673530 | 93674576 | Molothrus ater 84834 | GAG|GTGAGGGCCC...TTAGTTTTAATA/TTAGTTTTAATA...CACAG|GAA | 0 | 98.171 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);