introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
21 rows where transcript_id = 35103462
This data as json, CSV (advanced)
Suggested facets: score, length, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 197656804 | GT-AG | 0 | 0.4680644544628896 | 1020 | rna-XM_018124049.1 35103462 | 1 | 30236293 | 30237312 | Theobroma cacao 3641 | CGG|GTATGCTTTG...TGATTTTTACCT/ATGATTTTTACC...TGCAG|CTG | 1 | 1 | 0.621 |
| 197656805 | GT-AG | 0 | 1.000000099473604e-05 | 390 | rna-XM_018124049.1 35103462 | 2 | 30237394 | 30237783 | Theobroma cacao 3641 | TAG|GTAAGCAATG...CTTTTCTTATAA/TCTTTTCTTATA...TGTAG|GAT | 1 | 1 | 1.647 |
| 197656806 | GT-AG | 0 | 1.000000099473604e-05 | 91 | rna-XM_018124049.1 35103462 | 3 | 30237944 | 30238034 | Theobroma cacao 3641 | GAT|GTAAGAATAT...CTTCTTTTGATA/CTTCTTTTGATA...TGCAG|AAT | 2 | 1 | 3.675 |
| 197656807 | GT-AG | 0 | 1.000000099473604e-05 | 982 | rna-XM_018124049.1 35103462 | 4 | 30238193 | 30239174 | Theobroma cacao 3641 | TAG|GTTTGTAAGT...ATCTCCATACTT/GTGAAACTGATA...GAAAG|GTT | 1 | 1 | 5.677 |
| 197656808 | GT-AG | 0 | 1.000000099473604e-05 | 123 | rna-XM_018124049.1 35103462 | 5 | 30239332 | 30239454 | Theobroma cacao 3641 | TAG|GTGATTGCCT...ATACCTTTAGTA/TAGTAGCTTACG...TCAAG|GTT | 2 | 1 | 7.667 |
| 197656809 | GT-AG | 0 | 0.0006337182386217 | 698 | rna-XM_018124049.1 35103462 | 6 | 30239642 | 30240339 | Theobroma cacao 3641 | ATG|GTATGTACTA...TGCTATTTAAAT/TGCTATTTAAAT...TGCAG|CTA | 0 | 1 | 10.037 |
| 197656810 | GT-AG | 0 | 0.0001038366932388 | 622 | rna-XM_018124049.1 35103462 | 7 | 30240456 | 30241077 | Theobroma cacao 3641 | AAG|GTAACACACT...GTTGTTTTAATT/GTTGTTTTAATT...TCCAG|ATC | 2 | 1 | 11.507 |
| 197656811 | GT-AG | 0 | 0.0003396358602602 | 101 | rna-XM_018124049.1 35103462 | 8 | 30241163 | 30241263 | Theobroma cacao 3641 | CCA|GTATGATCGC...TATTTATCAATA/TTATTTATCAAT...GACAG|GTA | 0 | 1 | 12.584 |
| 197656812 | GT-AG | 0 | 1.5232164107634577e-05 | 101 | rna-XM_018124049.1 35103462 | 9 | 30241480 | 30241580 | Theobroma cacao 3641 | AAG|GTACTGCTCT...ATGTTTTTATCA/TATGTTTTTATC...TGCAG|GAT | 0 | 1 | 15.321 |
| 197656813 | GT-AG | 0 | 2.675335475024229e-05 | 391 | rna-XM_018124049.1 35103462 | 10 | 30241759 | 30242149 | Theobroma cacao 3641 | AAG|GTATGAAACT...TACACTTTAATC/CACTTTCTTATG...TACAG|TGG | 1 | 1 | 17.577 |
| 197656814 | GT-AG | 0 | 0.0157150907047568 | 187 | rna-XM_018124049.1 35103462 | 11 | 30242279 | 30242465 | Theobroma cacao 3641 | ATG|GTATGTTTTA...ATGTTCTGACCT/CAATTTCTAATT...TTCAG|TGA | 1 | 1 | 19.212 |
| 197656815 | GT-AG | 0 | 0.0008548387735421 | 137 | rna-XM_018124049.1 35103462 | 12 | 30242640 | 30242776 | Theobroma cacao 3641 | TAG|GTATGCATAA...AACATTTTCTCT/AACAAACTAAAC...TGCAG|GTC | 1 | 1 | 21.417 |
| 197656816 | GT-AG | 0 | 1.6070340480410663 | 602 | rna-XM_018124049.1 35103462 | 13 | 30243011 | 30243612 | Theobroma cacao 3641 | ATG|GTATCATTCC...GTGTACTTAATT/AATTGTTTTATG...TTCAG|AGA | 1 | 1 | 24.382 |
| 197656817 | GT-AG | 0 | 0.0004560718719572 | 393 | rna-XM_018124049.1 35103462 | 14 | 30245164 | 30245556 | Theobroma cacao 3641 | TGG|GTATTACTCT...TGCACTTTAAAG/TTCAATCTAATC...TGCAG|GAC | 1 | 1 | 44.038 |
| 197656818 | GT-AG | 0 | 1.000000099473604e-05 | 158 | rna-XM_018124049.1 35103462 | 15 | 30248043 | 30248200 | Theobroma cacao 3641 | ATG|GTAAGTCTAT...GTTATCTAAATT/AAATTTCTAACT...TGCAG|ATC | 0 | 1 | 75.542 |
| 197656819 | GT-AG | 0 | 1.000000099473604e-05 | 1472 | rna-XM_018124049.1 35103462 | 16 | 30248597 | 30250068 | Theobroma cacao 3641 | AAA|GTAAGTCCCG...TGATTTTTAGAG/TTCTTGCTGATT...TTCAG|CTG | 0 | 1 | 80.56 |
| 197656820 | GT-AG | 0 | 1.000000099473604e-05 | 870 | rna-XM_018124049.1 35103462 | 17 | 30250228 | 30251097 | Theobroma cacao 3641 | CTG|GTAAGGGACT...TATACCTTGATT/TATACCTTGATT...GCCAG|ATG | 0 | 1 | 82.575 |
| 197656821 | GT-AG | 0 | 0.2171182943587509 | 266 | rna-XM_018124049.1 35103462 | 18 | 30251608 | 30251873 | Theobroma cacao 3641 | GAG|GTACCCAGGA...CTATTTTTGACA/CTATTTTTGACA...AACAG|GTA | 0 | 1 | 89.038 |
| 197656822 | GT-AG | 0 | 0.0020716055618242 | 101 | rna-XM_018124049.1 35103462 | 19 | 30252019 | 30252119 | Theobroma cacao 3641 | CCC|GTAAGTTGTG...TTTTCTTTATTA/TTATTACTTATT...ATCAG|AGG | 1 | 1 | 90.876 |
| 197656823 | GT-AG | 0 | 4.717530861015112e-05 | 114 | rna-XM_018124049.1 35103462 | 20 | 30252265 | 30252378 | Theobroma cacao 3641 | TAG|GTAATGTTTT...AGTTCCTTGATA/ATTTCATTCATT...TGTAG|GTT | 2 | 1 | 92.713 |
| 197671344 | GT-AG | 0 | 0.0001125564889304 | 87 | rna-XM_018124049.1 35103462 | 21 | 30252554 | 30252640 | Theobroma cacao 3641 | TCC|GTAAGTGTTT...TGTACCTGAACC/ATTGAACTGATT...GCCAG|GAA | 0 | 94.931 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);