introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
50 rows where transcript_id = 35103464
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 197656830 | GT-AG | 0 | 0.0002556852284741 | 703 | rna-XM_007052123.2 35103464 | 1 | 35826441 | 35827143 | Theobroma cacao 3641 | CAG|GTTCTTTCTT...ATTTTCTTAATG/AATTTTCTTAAT...GGCAG|TTC | 0 | 1 | 1.367 |
| 197656831 | GT-AG | 0 | 1.000000099473604e-05 | 115 | rna-XM_007052123.2 35103464 | 2 | 35826028 | 35826142 | Theobroma cacao 3641 | GCG|GTAAGTTGCA...ACACTGTTGATC/ACACTGTTGATC...TTTAG|GGA | 1 | 1 | 5.482 |
| 197656832 | GT-AG | 0 | 2.3261867636480636e-05 | 166 | rna-XM_007052123.2 35103464 | 3 | 35825633 | 35825798 | Theobroma cacao 3641 | AAG|GTACTGATTT...TGTTCCTTTTCT/CATCAACTTAAA...TGCAG|GAA | 2 | 1 | 8.644 |
| 197656833 | GT-AG | 0 | 1.000000099473604e-05 | 108 | rna-XM_007052123.2 35103464 | 4 | 35825185 | 35825292 | Theobroma cacao 3641 | CTT|GTGAGTTATG...CTGTTTTTAGAA/ACTGTTTTTAGA...TTCAG|GCT | 0 | 1 | 13.339 |
| 197656834 | GT-AG | 0 | 0.0005713537148676 | 114 | rna-XM_007052123.2 35103464 | 5 | 35824957 | 35825070 | Theobroma cacao 3641 | CAG|GTATTATTGT...ATTTGTTTAATT/TTATTTTTCATT...TGCAG|GAA | 0 | 1 | 14.913 |
| 197656835 | GT-AG | 0 | 1.000000099473604e-05 | 83 | rna-XM_007052123.2 35103464 | 6 | 35824745 | 35824827 | Theobroma cacao 3641 | CTG|GTAAGAGCTC...TTCTTCTTCTCG/ATACTGTTCAAT...TGCAG|GCC | 0 | 1 | 16.694 |
| 197656836 | GT-AG | 0 | 0.0011601316155007 | 98 | rna-XM_007052123.2 35103464 | 7 | 35824455 | 35824552 | Theobroma cacao 3641 | AAT|GTACATATCT...GCATCTTTGTTT/TTTGGGTTGATA...TGCAG|ACG | 0 | 1 | 19.345 |
| 197656837 | GT-AG | 0 | 0.0001362805256685 | 267 | rna-XM_007052123.2 35103464 | 8 | 35823993 | 35824259 | Theobroma cacao 3641 | AAG|GTAAATTTGG...TGTATCTTATCA/TTGTATCTTATC...TGCAG|ATT | 0 | 1 | 22.038 |
| 197656838 | GT-AG | 0 | 1.000000099473604e-05 | 101 | rna-XM_007052123.2 35103464 | 9 | 35823754 | 35823854 | Theobroma cacao 3641 | GAG|GTAATATCGA...GAGCTATTAGTT/TTTGGGTTGACA...TTCAG|GAG | 0 | 1 | 23.944 |
| 197656839 | GT-AG | 0 | 0.0053868215525492 | 90 | rna-XM_007052123.2 35103464 | 10 | 35823541 | 35823630 | Theobroma cacao 3641 | AAG|GTATTATTTC...ATGTTTTTATCT/CATGTTTTTATC...TTCAG|GTC | 0 | 1 | 25.642 |
| 197656840 | GC-AG | 0 | 1.000000099473604e-05 | 290 | rna-XM_007052123.2 35103464 | 11 | 35822997 | 35823286 | Theobroma cacao 3641 | AAG|GCATGTTGTT...GGATGTTTAATA/GGATGTTTAATA...GACAG|GGA | 2 | 1 | 29.149 |
| 197656841 | GT-AG | 0 | 1.000000099473604e-05 | 503 | rna-XM_007052123.2 35103464 | 12 | 35822390 | 35822892 | Theobroma cacao 3641 | TTG|GTGAGTTATT...TTCTCTTTGAAC/TTTGAACTTATG...TTCAG|GCT | 1 | 1 | 30.585 |
| 197656842 | GT-AG | 0 | 1.000000099473604e-05 | 126 | rna-XM_007052123.2 35103464 | 13 | 35822169 | 35822294 | Theobroma cacao 3641 | AAA|GTTAGTGTTA...AATTCTCTAATT/AATTCTCTAATT...TACAG|ATG | 0 | 1 | 31.897 |
| 197656843 | GT-AG | 0 | 1.000000099473604e-05 | 515 | rna-XM_007052123.2 35103464 | 14 | 35821417 | 35821931 | Theobroma cacao 3641 | GAG|GTAAAACCAT...TTTTTTTTATCT/TTTTTTTTTATC...TGCAG|TTA | 0 | 1 | 35.17 |
| 197656844 | GT-AG | 0 | 0.0206213383729518 | 76 | rna-XM_007052123.2 35103464 | 15 | 35821152 | 35821227 | Theobroma cacao 3641 | AAA|GTATGCCTGG...ATATTATTACCA/CTAGAACTAATA...TGCAG|CTG | 0 | 1 | 37.78 |
| 197656845 | GT-AG | 0 | 3.781740705219119e-05 | 143 | rna-XM_007052123.2 35103464 | 16 | 35820927 | 35821069 | Theobroma cacao 3641 | CTA|GTAATAATTT...TTTTTTTTAAAT/TTTTTTTTAAAT...TACAG|GAT | 1 | 1 | 38.912 |
| 197656846 | GT-AG | 0 | 0.0038832959309143 | 865 | rna-XM_007052123.2 35103464 | 17 | 35819994 | 35820858 | Theobroma cacao 3641 | GAG|GTATTTAATG...CTTCTCTTAATC/CTTCTCTTAATC...TACAG|GCT | 0 | 1 | 39.851 |
| 197656847 | GT-AG | 0 | 1.000000099473604e-05 | 115 | rna-XM_007052123.2 35103464 | 18 | 35819733 | 35819847 | Theobroma cacao 3641 | AAG|GTCAAACTTT...ATTAACTTAATC/ATTGAATTAACT...TGCAG|AGC | 2 | 1 | 41.867 |
| 197656848 | GT-AG | 0 | 0.0033234741122315 | 89 | rna-XM_007052123.2 35103464 | 19 | 35819535 | 35819623 | Theobroma cacao 3641 | AAG|GTTTCTGTCT...ATATATTTGACC/AAAATATTTATT...GTTAG|GTT | 0 | 1 | 43.372 |
| 197656849 | GT-AG | 0 | 0.0006237376938812 | 150 | rna-XM_007052123.2 35103464 | 20 | 35819238 | 35819387 | Theobroma cacao 3641 | GAG|GTACTTGTTT...TTGTTCTTATTT/TTTGTTCTTATT...TCCAG|GCA | 0 | 1 | 45.402 |
| 197656850 | GT-AG | 0 | 0.0003819493419297 | 81 | rna-XM_007052123.2 35103464 | 21 | 35819109 | 35819189 | Theobroma cacao 3641 | AAG|GTATGATTAT...TTGTTCTGAACT/CTTGTTCTGAAC...TGCAG|ATT | 0 | 1 | 46.065 |
| 197656851 | GT-AG | 0 | 2.6782989360885656e-05 | 127 | rna-XM_007052123.2 35103464 | 22 | 35818853 | 35818979 | Theobroma cacao 3641 | GAG|GTATTGACAT...TGTGTTTTATTG/CTGTGTTTTATT...TTTAG|GTA | 0 | 1 | 47.846 |
| 197656852 | GT-AG | 0 | 0.0002429657016225 | 81 | rna-XM_007052123.2 35103464 | 23 | 35818532 | 35818612 | Theobroma cacao 3641 | CAG|GTTTGTTTCT...TTTTCTTTGAGT/TTTTCTTTGAGT...TTTAG|TAT | 0 | 1 | 51.16 |
| 197656853 | GT-AG | 0 | 6.416170590385922e-05 | 96 | rna-XM_007052123.2 35103464 | 24 | 35818295 | 35818390 | Theobroma cacao 3641 | CAG|GTTGTCTTTT...GAATTATTAACT/TTTTTGTTTATT...TGTAG|CTT | 0 | 1 | 53.107 |
| 197656854 | GT-AG | 0 | 1.000000099473604e-05 | 89 | rna-XM_007052123.2 35103464 | 25 | 35818120 | 35818208 | Theobroma cacao 3641 | GAG|GTTTGATGCC...TTGGTCTTGGCG/GTTGTATACATT...TGCAG|GGT | 2 | 1 | 54.294 |
| 197656855 | GT-AG | 0 | 1.116903856471134e-05 | 483 | rna-XM_007052123.2 35103464 | 26 | 35817531 | 35818013 | Theobroma cacao 3641 | AAG|GTCTGTAAAG...CTATTCTTAGCT/ATCATTTTAATC...TGTAG|GAT | 0 | 1 | 55.758 |
| 197656856 | GT-AG | 0 | 1.000000099473604e-05 | 128 | rna-XM_007052123.2 35103464 | 27 | 35817307 | 35817434 | Theobroma cacao 3641 | AAG|GTTTTGTCAC...ATGGTTTTAATC/ATGGTTTTAATC...TCTAG|GAA | 0 | 1 | 57.084 |
| 197656857 | GT-AG | 0 | 1.8350440570490347e-05 | 428 | rna-XM_007052123.2 35103464 | 28 | 35816726 | 35817153 | Theobroma cacao 3641 | AAG|GTCAGCTTGT...TTCCCTTTGACT/CTGGTTCTAACT...TATAG|GCA | 0 | 1 | 59.196 |
| 197656858 | GT-AG | 0 | 0.0031962225703904 | 88 | rna-XM_007052123.2 35103464 | 29 | 35816464 | 35816551 | Theobroma cacao 3641 | GAG|GTATTCATAT...TTAGTTTTGAGA/TTAGTTTTGAGA...TTTAG|GAC | 0 | 1 | 61.599 |
| 197656859 | GT-AG | 0 | 1.000000099473604e-05 | 87 | rna-XM_007052123.2 35103464 | 30 | 35816162 | 35816248 | Theobroma cacao 3641 | GAG|GTGTTACATT...TCTATTTGAACT/TTGTTGCTGATC...TTCAG|TGC | 2 | 1 | 64.568 |
| 197656860 | GT-AG | 0 | 1.000000099473604e-05 | 84 | rna-XM_007052123.2 35103464 | 31 | 35815942 | 35816025 | Theobroma cacao 3641 | AAA|GTAAGAAATT...TGCTTTTTACTA/TTTTTACTAATA...GCTAG|GAA | 0 | 1 | 66.446 |
| 197656861 | GT-AG | 0 | 1.000000099473604e-05 | 528 | rna-XM_007052123.2 35103464 | 32 | 35815288 | 35815815 | Theobroma cacao 3641 | AAG|GTTGAATATG...ATGTTTCTGACT/ATGTTTCTGACT...TTCAG|TTG | 0 | 1 | 68.186 |
| 197656862 | GT-AG | 0 | 0.3463242752350556 | 242 | rna-XM_007052123.2 35103464 | 33 | 35815001 | 35815242 | Theobroma cacao 3641 | CAG|GTATGCTTTA...TAATTCTTGACA/TGTTATTTAATT...TGTAG|GGA | 0 | 1 | 68.807 |
| 197656863 | GT-AG | 0 | 0.0024382912445282 | 95 | rna-XM_007052123.2 35103464 | 34 | 35814828 | 35814922 | Theobroma cacao 3641 | AAG|GTGTTCTAAT...TCATCCTTGACT/ATGATTCTCATC...TACAG|GTT | 0 | 1 | 69.884 |
| 197656864 | GT-AG | 0 | 0.0027133028165441 | 418 | rna-XM_007052123.2 35103464 | 35 | 35814293 | 35814710 | Theobroma cacao 3641 | TGG|GTATGTTCTA...TCTGTGTTGATT/TCTGTGTTGATT...TACAG|GTG | 0 | 1 | 71.5 |
| 197656865 | GT-AG | 0 | 0.0195274250924256 | 76 | rna-XM_007052123.2 35103464 | 36 | 35814090 | 35814165 | Theobroma cacao 3641 | ATA|GTATGTATTA...AAATTCTAAATA/AAAATTCTAAAT...TGCAG|AAG | 1 | 1 | 73.253 |
| 197656866 | GT-AG | 0 | 1.5929483704946932e-05 | 126 | rna-XM_007052123.2 35103464 | 37 | 35813863 | 35813988 | Theobroma cacao 3641 | AAG|GTTTGTCATC...TCTCTCTTATTT/CTTCTGTTCATT...TGCAG|GTT | 0 | 1 | 74.648 |
| 197656867 | GT-AG | 0 | 1.000000099473604e-05 | 81 | rna-XM_007052123.2 35103464 | 38 | 35813644 | 35813724 | Theobroma cacao 3641 | AAG|GTCAATGGTT...TTGTTCTTAAGT/TTTCTTCTGACC...AGCAG|GTA | 0 | 1 | 76.553 |
| 197656868 | GT-AG | 0 | 0.0002442625486425 | 750 | rna-XM_007052123.2 35103464 | 39 | 35812810 | 35813559 | Theobroma cacao 3641 | CAG|GTGACCTGAA...AAATTCTTAGTC/TTCTTAGTCATT...TGTAG|GTT | 0 | 1 | 77.713 |
| 197656869 | GT-AG | 0 | 1.000000099473604e-05 | 110 | rna-XM_007052123.2 35103464 | 40 | 35812553 | 35812662 | Theobroma cacao 3641 | ACA|GTGAATCTCT...CATTTTTTGTTT/GCCAAACTCATT...GTTAG|GAA | 0 | 1 | 79.743 |
| 197656870 | GT-AG | 0 | 0.0003950922372057 | 100 | rna-XM_007052123.2 35103464 | 41 | 35812321 | 35812420 | Theobroma cacao 3641 | AAG|GTCTTGTTAT...TTTTTCTTAATT/TTTTTCTTAATT...TTTAG|TAT | 0 | 1 | 81.566 |
| 197656871 | GT-AG | 0 | 1.000000099473604e-05 | 71 | rna-XM_007052123.2 35103464 | 42 | 35812202 | 35812272 | Theobroma cacao 3641 | AAG|GTGAATATTC...TATGTCTTGATT/TATGTCTTGATT...AACAG|ATT | 0 | 1 | 82.229 |
| 197656872 | GT-AG | 0 | 1.000000099473604e-05 | 72 | rna-XM_007052123.2 35103464 | 43 | 35811983 | 35812054 | Theobroma cacao 3641 | CAG|GTAGGTTCAT...TTGTCATTATTG/CTAATTCTCATT...TGCAG|ATT | 0 | 1 | 84.258 |
| 197656873 | GT-AG | 0 | 1.000000099473604e-05 | 589 | rna-XM_007052123.2 35103464 | 44 | 35811332 | 35811920 | Theobroma cacao 3641 | CAG|GTAAGAATTG...CTTATGTTGATT/CTTATGTTGATT...TGTAG|CTT | 2 | 1 | 85.115 |
| 197656874 | GT-AG | 0 | 0.0003757921650923 | 95 | rna-XM_007052123.2 35103464 | 45 | 35811083 | 35811177 | Theobroma cacao 3641 | CCG|GTTTGCCTCT...TTATTCTTTCTT/ATCCCACTTATT...TGTAG|GTT | 0 | 1 | 87.241 |
| 197656875 | GT-AG | 0 | 0.00074883340088 | 76 | rna-XM_007052123.2 35103464 | 46 | 35810815 | 35810890 | Theobroma cacao 3641 | AAG|GTGTCAATTG...GATTGTTTAATT/GATTGTTTAATT...CGCAG|ATT | 0 | 1 | 89.892 |
| 197656876 | GT-AG | 0 | 0.0072102130767594 | 241 | rna-XM_007052123.2 35103464 | 47 | 35810469 | 35810709 | Theobroma cacao 3641 | AAG|GTACCTCAAT...CCTTCCTAAATT/TGTTTGCTCAAA...TGCAG|ACA | 0 | 1 | 91.342 |
| 197656877 | GT-AG | 0 | 0.0066448772216664 | 267 | rna-XM_007052123.2 35103464 | 48 | 35810064 | 35810330 | Theobroma cacao 3641 | CAG|GTATTCTAGA...CACTGTTTAATT/ATTAACTTCACT...TTTAG|GCC | 0 | 1 | 93.248 |
| 197656878 | GT-AG | 0 | 1.000000099473604e-05 | 93 | rna-XM_007052123.2 35103464 | 49 | 35809734 | 35809826 | Theobroma cacao 3641 | CAG|GTGAGAATTT...TATACTTTGCTT/CTGCTGCTCATT...ATCAG|GAA | 0 | 1 | 96.52 |
| 197656879 | GT-AG | 0 | 1.000000099473604e-05 | 84 | rna-XM_007052123.2 35103464 | 50 | 35809548 | 35809631 | Theobroma cacao 3641 | AAG|GTAAAAATTA...GTCCCCTTGAGG/AGGTTGCTCATG...TTCAG|AAT | 0 | 1 | 97.929 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);