introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
44 rows where transcript_id = 22173095
This data as json, CSV (advanced)
Suggested facets: score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 120102789 | GT-AG | 0 | 7.249301457430677e-05 | 3577 | rna-XM_036386669.1 22173095 | 2 | 20586052 | 20589628 | Molothrus ater 84834 | CAG|GTATGGTATT...TAGTACTTGAAG/GTATTTTCCATC...TTTAG|ACT | 0 | 1 | 5.344 |
| 120102790 | GT-AG | 0 | 0.0002860871197153 | 303 | rna-XM_036386669.1 22173095 | 3 | 20585603 | 20585905 | Molothrus ater 84834 | CAG|GTATTTTCAC...GATTTTTCAGAA/TGATTTTTCAGA...TATAG|GCC | 2 | 1 | 7.171 |
| 120102791 | GT-AG | 0 | 0.0002199408114058 | 4052 | rna-XM_036386669.1 22173095 | 4 | 20581471 | 20585522 | Molothrus ater 84834 | AAG|GTAACTGAAA...TCTTTCTTATTT/TTCTTTCTTATT...AATAG|GCC | 1 | 1 | 8.173 |
| 120102792 | GT-AG | 0 | 2.754347676962964e-05 | 332 | rna-XM_036386669.1 22173095 | 5 | 20581026 | 20581357 | Molothrus ater 84834 | CAT|GTGAGTTTTT...ACATTCTTATTT/TTTTTTTTTATG...AACAG|AGA | 0 | 1 | 9.587 |
| 120102793 | GT-AG | 0 | 1.000000099473604e-05 | 1181 | rna-XM_036386669.1 22173095 | 6 | 20579626 | 20580806 | Molothrus ater 84834 | AAG|GTGTGAGAGC...TATTTCTAAATG/ATGGTACTTAGT...TACAG|GGT | 0 | 1 | 12.328 |
| 120102794 | GT-AG | 0 | 0.0012096432652643 | 258 | rna-XM_036386669.1 22173095 | 7 | 20579252 | 20579509 | Molothrus ater 84834 | CAA|GTAAGCTTTT...TTTTTCTTTGTT/TGACTATTGATT...TTTAG|GCC | 2 | 1 | 13.78 |
| 120102795 | GT-AG | 0 | 1.000000099473604e-05 | 77 | rna-XM_036386669.1 22173095 | 8 | 20578923 | 20578999 | Molothrus ater 84834 | TAG|GTAAGATGCT...CATGTCTTACAT/ACATGTCTTACA...CACAG|GTT | 2 | 1 | 16.934 |
| 120102796 | GT-AG | 0 | 1.000000099473604e-05 | 1493 | rna-XM_036386669.1 22173095 | 9 | 20577291 | 20578783 | Molothrus ater 84834 | GCA|GTAAGTGCTC...TTTCTTTTTATA/TTTCTTTTTATA...TTCAG|GAA | 0 | 1 | 18.673 |
| 120102797 | GT-AG | 0 | 1.000000099473604e-05 | 3681 | rna-XM_036386669.1 22173095 | 10 | 20573457 | 20577137 | Molothrus ater 84834 | CAG|GTAAATAATT...TGCTGCTTGACT/GCTTGACTTATT...GACAG|GCA | 0 | 1 | 20.588 |
| 120102798 | GT-AG | 0 | 0.0001181876506601 | 1468 | rna-XM_036386669.1 22173095 | 11 | 20571884 | 20573351 | Molothrus ater 84834 | AAG|GTAATTTTGT...TTTTTTTTAAAT/TTTTTTTTAAAT...CTTAG|GCA | 0 | 1 | 21.902 |
| 120102799 | GT-AG | 0 | 1.000000099473604e-05 | 830 | rna-XM_036386669.1 22173095 | 12 | 20570847 | 20571676 | Molothrus ater 84834 | CAG|GTGAGTAAAC...TTGATTTTGATT/TTGATTTTGATT...TTAAG|GAT | 0 | 1 | 24.493 |
| 120102800 | GT-AG | 0 | 1.000000099473604e-05 | 5077 | rna-XM_036386669.1 22173095 | 13 | 20565633 | 20570709 | Molothrus ater 84834 | GAG|GTAAGAATGG...CTTCCTCTAACA/TTGTGTTTTAAT...TTCAG|TCA | 2 | 1 | 26.208 |
| 120102801 | GT-AG | 0 | 1.000000099473604e-05 | 4251 | rna-XM_036386669.1 22173095 | 14 | 20561248 | 20565498 | Molothrus ater 84834 | AAG|GTGAGTTTTA...CTTTTCTGAGTG/AAATGTTTTATT...TGCAG|AGC | 1 | 1 | 27.885 |
| 120102802 | GT-AG | 0 | 1.548742170117344e-05 | 1558 | rna-XM_036386669.1 22173095 | 15 | 20559602 | 20561159 | Molothrus ater 84834 | ACG|GTGAGCTTTA...GTGCTGTTAATT/GTGCTGTTAATT...TCAAG|ATT | 2 | 1 | 28.986 |
| 120102803 | GT-AG | 0 | 1.000000099473604e-05 | 1415 | rna-XM_036386669.1 22173095 | 16 | 20557844 | 20559258 | Molothrus ater 84834 | AGG|GTGAGTTAAT...CAAGTTTTAAGA/TTTTTTCCAATT...GTTAG|GTT | 0 | 1 | 33.279 |
| 120102804 | GT-AG | 0 | 1.000000099473604e-05 | 1589 | rna-XM_036386669.1 22173095 | 17 | 20556159 | 20557747 | Molothrus ater 84834 | CAG|GTAAGAAAAA...AATTTCTTAATG/CAATTTCTTAAT...ATTAG|GTA | 0 | 1 | 34.481 |
| 120102805 | GT-AG | 0 | 5.634930553273332e-05 | 1541 | rna-XM_036386669.1 22173095 | 18 | 20554406 | 20555946 | Molothrus ater 84834 | AAG|GTATGTGTGA...ATAACTTTTGCT/TTTCTATTTATT...TTCAG|AGC | 2 | 1 | 37.134 |
| 120102806 | GT-AG | 0 | 1.000000099473604e-05 | 208 | rna-XM_036386669.1 22173095 | 19 | 20553957 | 20554164 | Molothrus ater 84834 | ACG|GTTAGTGTTG...AGAATCTTAATT/AGAATCTTAATT...TTCAG|CTA | 0 | 1 | 40.15 |
| 120102807 | GT-AG | 0 | 0.0006104104040863 | 103 | rna-XM_036386669.1 22173095 | 20 | 20553704 | 20553806 | Molothrus ater 84834 | GTG|GTAAGCATTT...ATTTTTTTAATT/ATTTTTTTAATT...TTAAG|ATC | 0 | 1 | 42.028 |
| 120102808 | GT-AG | 0 | 1.000000099473604e-05 | 464 | rna-XM_036386669.1 22173095 | 21 | 20553119 | 20553582 | Molothrus ater 84834 | CAG|GTAAAACTTA...AGTACTTTTGCT/AAAATGCTTATA...TGCAG|ATA | 1 | 1 | 43.542 |
| 120102809 | GT-AG | 0 | 1.000000099473604e-05 | 258 | rna-XM_036386669.1 22173095 | 22 | 20552730 | 20552987 | Molothrus ater 84834 | GAG|GTTAGTGATG...TTTCCATTAGTT/CATTAGTTAACT...TACAG|GTA | 0 | 1 | 45.181 |
| 120102810 | GT-AG | 0 | 1.000000099473604e-05 | 3149 | rna-XM_036386669.1 22173095 | 23 | 20549302 | 20552450 | Molothrus ater 84834 | CCG|GTAATATAAA...TTTTTTTTTTCC/TTTTTTCCCATG...TCCAG|ATA | 0 | 1 | 48.673 |
| 120102811 | GT-AG | 0 | 1.000000099473604e-05 | 2008 | rna-XM_036386669.1 22173095 | 24 | 20547168 | 20549175 | Molothrus ater 84834 | GAG|GTTGGTGAAT...GTTGCTTTAGTT/CTTTAGTTAATT...TTCAG|GCT | 0 | 1 | 50.25 |
| 120102812 | GT-AG | 0 | 1.000000099473604e-05 | 2904 | rna-XM_036386669.1 22173095 | 25 | 20544138 | 20547041 | Molothrus ater 84834 | AAG|GTAAGAATTC...TTTTCTGTACCC/TCTGTACCCACT...CACAG|ACA | 0 | 1 | 51.827 |
| 120102813 | GT-AG | 0 | 9.263445026788178e-05 | 3323 | rna-XM_036386669.1 22173095 | 26 | 20540648 | 20543970 | Molothrus ater 84834 | CAA|GTAAGTTTTG...TTGAATTTAATG/TTTCTATTCAAT...TCCAG|AAC | 2 | 1 | 53.917 |
| 120102814 | GT-AG | 0 | 0.0001419194583003 | 583 | rna-XM_036386669.1 22173095 | 27 | 20539956 | 20540538 | Molothrus ater 84834 | GGG|GTAAGTTTTC...AATTCTTTGCTT/CAGTTATAAATT...CTCAG|AGC | 0 | 1 | 55.282 |
| 120102815 | GT-AG | 0 | 1.955981893789643e-05 | 129 | rna-XM_036386669.1 22173095 | 28 | 20539680 | 20539808 | Molothrus ater 84834 | AGG|GTAAGTTTCA...AGTGTCCTAACT/AGTGTCCTAACT...GGCAG|GAT | 0 | 1 | 57.121 |
| 120102816 | GT-AG | 0 | 5.1882995162245127e-05 | 809 | rna-XM_036386669.1 22173095 | 29 | 20538724 | 20539532 | Molothrus ater 84834 | AAA|GTAAGTTCAA...ATTGTCTTATTT/TATTGTCTTATT...TTCAG|GAG | 0 | 1 | 58.961 |
| 120102817 | GT-AG | 0 | 0.0045484584672294 | 2292 | rna-XM_036386669.1 22173095 | 30 | 20536209 | 20538500 | Molothrus ater 84834 | CTA|GTATGTAATA...AGTTCCCTGACC/CTTTATTTTATA...CATAG|CTT | 1 | 1 | 61.752 |
| 120102818 | GT-AG | 0 | 1.000000099473604e-05 | 406 | rna-XM_036386669.1 22173095 | 31 | 20535582 | 20535987 | Molothrus ater 84834 | GAG|GTAAATTTAA...AATTTTCTGCTT/ATGGAAGTAAAT...TTCAG|AGC | 0 | 1 | 64.518 |
| 120102819 | GT-AG | 0 | 1.000000099473604e-05 | 421 | rna-XM_036386669.1 22173095 | 32 | 20534970 | 20535390 | Molothrus ater 84834 | CAG|GTAAGTAGAA...TGTGCTTTCATT/TGTGCTTTCATT...TTCAG|ACT | 2 | 1 | 66.909 |
| 120102820 | GT-AG | 0 | 1.000000099473604e-05 | 1884 | rna-XM_036386669.1 22173095 | 33 | 20532912 | 20534795 | Molothrus ater 84834 | TCG|GTAAGTGTTG...TACCTCTTCATT/GCCTTTCTCATA...GCTAG|GTA | 2 | 1 | 69.086 |
| 120102821 | GT-AG | 0 | 1.000000099473604e-05 | 175 | rna-XM_036386669.1 22173095 | 34 | 20532595 | 20532769 | Molothrus ater 84834 | AAG|GTAATGATTC...GAGCTGGTAACT/TAACTGTTCACG...TTTAG|GTG | 0 | 1 | 70.864 |
| 120102822 | GT-AG | 0 | 1.000000099473604e-05 | 506 | rna-XM_036386669.1 22173095 | 35 | 20531338 | 20531843 | Molothrus ater 84834 | CAG|GTTAGTATGG...TTGCTTTTACTT/AGTATTCTGATT...TTTAG|GAC | 1 | 1 | 80.263 |
| 120102823 | GT-AG | 0 | 0.0226483672555286 | 1255 | rna-XM_036386669.1 22173095 | 36 | 20529959 | 20531213 | Molothrus ater 84834 | TTG|GTATTTTGTC...TTGCTCATAACT/CATTTGCTCATA...TGCAG|GTA | 2 | 1 | 81.815 |
| 120102824 | GT-AG | 0 | 4.210545418830863e-05 | 1022 | rna-XM_036386669.1 22173095 | 37 | 20528711 | 20529732 | Molothrus ater 84834 | CAG|GTATTTAATT...CAGCCACTAACA/GATAGATTCAGC...TGTAG|GCT | 0 | 1 | 84.643 |
| 120102825 | GT-AG | 0 | 1.000000099473604e-05 | 1628 | rna-XM_036386669.1 22173095 | 38 | 20526953 | 20528580 | Molothrus ater 84834 | TAG|GTAAGCAAAT...TTTCCCTTCCCT/AATATTTTCAGT...TAAAG|GTG | 1 | 1 | 86.27 |
| 120102826 | GT-AG | 0 | 3.304443582680591e-05 | 1677 | rna-XM_036386669.1 22173095 | 39 | 20525090 | 20526766 | Molothrus ater 84834 | ATG|GTACAGTGAA...CACCTTTTAATC/CACCTTTTAATC...TCCAG|GTA | 1 | 1 | 88.598 |
| 120102827 | GT-AG | 0 | 1.000000099473604e-05 | 1051 | rna-XM_036386669.1 22173095 | 40 | 20523818 | 20524868 | Molothrus ater 84834 | CAG|GTAAAAAAAA...TGCATTTTCACT/TGCATTTTCACT...TTTAG|GTT | 0 | 1 | 91.364 |
| 120102828 | GT-AG | 0 | 1.000000099473604e-05 | 2321 | rna-XM_036386669.1 22173095 | 41 | 20521408 | 20523728 | Molothrus ater 84834 | CAG|GTCAGTATTT...AACTCTGTAATA/AGTTTTGTCATT...TCTAG|GAT | 2 | 1 | 92.478 |
| 120102829 | GT-AG | 0 | 1.000000099473604e-05 | 397 | rna-XM_036386669.1 22173095 | 42 | 20520854 | 20521250 | Molothrus ater 84834 | CAG|GTGATACCTT...ATAATTGTGACT/GACTTGTTCATT...TGCAG|AGC | 0 | 1 | 94.443 |
| 120102830 | GT-AG | 0 | 1.000000099473604e-05 | 443 | rna-XM_036386669.1 22173095 | 43 | 20520198 | 20520640 | Molothrus ater 84834 | GAG|GTAAAGAATT...ATGTTTTTAAGA/ATGTTTTTAAGA...TTTAG|GAA | 0 | 1 | 97.109 |
| 120102831 | GT-AG | 0 | 1.000000099473604e-05 | 1439 | rna-XM_036386669.1 22173095 | 44 | 20518663 | 20520101 | Molothrus ater 84834 | CAG|GTAAAAGCTC...ATCCCTTTGAAA/ACAGGGTTAAAT...TGCAG|AAT | 0 | 1 | 98.31 |
| 120112008 | GT-AG | 0 | 1.000000099473604e-05 | 23585 | rna-XM_036386669.1 22173095 | 1 | 20589883 | 20613467 | Molothrus ater 84834 | GGG|GTGAGTGCGG...TTGTCTTTTTTT/CATATGTTAATA...CTCAG|ATT | 0 | 4.143 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);