introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
30 rows where transcript_id = 12801882
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 68111043 | GT-AG | 0 | 1.000000099473604e-05 | 6005 | rna-XM_029507516.1 12801882 | 1 | 22412804 | 22418808 | Echeneis naucrates 173247 | GAG|GTGAGTGAGG...AGCACCTTCACG/AGCACCTTCACG...CCAAG|GCT | 0 | 1 | 2.234 |
| 68111044 | GT-AG | 0 | 0.0001907018753054 | 118 | rna-XM_029507516.1 12801882 | 2 | 22418880 | 22418997 | Echeneis naucrates 173247 | CGA|GTAAGCTAGT...CAAGTCTAAATA/TCAAGTCTAAAT...CACAG|GCT | 2 | 1 | 3.589 |
| 68111045 | GT-AG | 0 | 1.000000099473604e-05 | 101 | rna-XM_029507516.1 12801882 | 3 | 22419068 | 22419168 | Echeneis naucrates 173247 | CTG|GTAAGGAGAC...TAAACCCTGACT/TAAACCCTGACT...TCTAG|GAC | 0 | 1 | 4.926 |
| 68111046 | GT-AG | 0 | 2.853361562954884e-05 | 88 | rna-XM_029507516.1 12801882 | 4 | 22419276 | 22419363 | Echeneis naucrates 173247 | GAA|GTAAATATGC...TCTGATTTAATC/ACATGTCTGATT...ATCAG|GGA | 2 | 1 | 6.968 |
| 68111047 | GT-AG | 0 | 1.000000099473604e-05 | 102 | rna-XM_029507516.1 12801882 | 5 | 22419395 | 22419496 | Echeneis naucrates 173247 | AAT|GTGAGTTGTT...GCATCTTTGTTT/TTAGATCTCATG...ACCAG|GTG | 0 | 1 | 7.56 |
| 68111048 | GT-AG | 0 | 1.000000099473604e-05 | 641 | rna-XM_029507516.1 12801882 | 6 | 22419608 | 22420248 | Echeneis naucrates 173247 | GTG|GTGAGTTAGC...AAAACCTAAGTT/AAAAACCTAAGT...TTTAG|GAG | 0 | 1 | 9.679 |
| 68111049 | GT-AG | 0 | 1.000000099473604e-05 | 212 | rna-XM_029507516.1 12801882 | 7 | 22420377 | 22420588 | Echeneis naucrates 173247 | CAG|GTGAGAGACG...CTTTTCTTTTCT/GATTAAATAAGT...GACAG|TCT | 2 | 1 | 12.123 |
| 68111050 | GT-AG | 0 | 1.000000099473604e-05 | 939 | rna-XM_029507516.1 12801882 | 8 | 22420660 | 22421598 | Echeneis naucrates 173247 | ATG|GTTAGTTCTC...TTGCTCTTGATT/TACTTTCTAACC...CCCAG|GGC | 1 | 1 | 13.478 |
| 68111051 | GT-AG | 0 | 3.4614923263664886e-05 | 166 | rna-XM_029507516.1 12801882 | 9 | 22421801 | 22421966 | Echeneis naucrates 173247 | CAG|GTAACGGACA...CATCCCTTATCT/TAAAGTTTCATC...TCTAG|TTG | 2 | 1 | 17.335 |
| 68111052 | GT-AG | 0 | 1.000000099473604e-05 | 491 | rna-XM_029507516.1 12801882 | 10 | 22422165 | 22422655 | Echeneis naucrates 173247 | CAG|GTCAGAGAGG...GGATTTTTGACC/GGATTTTTGACC...TCCAG|CCC | 2 | 1 | 21.115 |
| 68111053 | GT-AG | 0 | 1.000000099473604e-05 | 203 | rna-XM_029507516.1 12801882 | 11 | 22422962 | 22423164 | Echeneis naucrates 173247 | CAG|GTTAGCTGCG...TGATGTGTAACA/TGATGTGTAACA...GACAG|TGA | 2 | 1 | 26.957 |
| 68111054 | GT-AG | 0 | 0.0004267618050167 | 1504 | rna-XM_029507516.1 12801882 | 12 | 22423435 | 22424938 | Echeneis naucrates 173247 | CAG|GTGCCCAACA...GTGTCCTCATTT/TGTGTCCTCATT...TACAG|TGA | 2 | 1 | 32.111 |
| 68111055 | GT-AG | 0 | 2.3627092658093206e-05 | 993 | rna-XM_029507516.1 12801882 | 13 | 22425197 | 22426189 | Echeneis naucrates 173247 | CAG|GTAACGACAA...TTATCCTTCATT/TTATCCTTCATT...TACAG|CCA | 2 | 1 | 37.037 |
| 68111056 | GT-AG | 0 | 1.000000099473604e-05 | 109 | rna-XM_029507516.1 12801882 | 14 | 22426550 | 22426658 | Echeneis naucrates 173247 | CAG|GTGGGCTGCA...TGTGTTTTACAT/TGTTTAGTTATT...TATAG|GAG | 2 | 1 | 43.91 |
| 68111057 | GT-AG | 0 | 0.0018360462089756 | 1059 | rna-XM_029507516.1 12801882 | 15 | 22426998 | 22428056 | Echeneis naucrates 173247 | AAG|GTATTTAATG...TACACCTTAGTC/CTTAGTCTGAGT...TTTAG|CAG | 2 | 1 | 50.382 |
| 68111058 | GT-AG | 0 | 1.000000099473604e-05 | 243 | rna-XM_029507516.1 12801882 | 16 | 22428390 | 22428632 | Echeneis naucrates 173247 | AAT|GTGAGATGCC...TTTCCCTTACCT/TTTCCTCTCATT...TTTAG|GAT | 2 | 1 | 56.739 |
| 68111059 | GT-AG | 0 | 4.124466815004531e-05 | 982 | rna-XM_029507516.1 12801882 | 17 | 22428672 | 22429653 | Echeneis naucrates 173247 | AGA|GTAAGTCTTT...TATGCCTTTTCT/CCTTTTCTGATT...TCTAG|GAG | 2 | 1 | 57.484 |
| 68111060 | GT-AG | 0 | 1.000000099473604e-05 | 127 | rna-XM_029507516.1 12801882 | 18 | 22429784 | 22429910 | Echeneis naucrates 173247 | GAT|GTAAGAAACT...CATTATTTGACA/CATTATTTGACA...GGCAG|GAT | 0 | 1 | 59.966 |
| 68111061 | GT-AG | 0 | 1.000000099473604e-05 | 116 | rna-XM_029507516.1 12801882 | 19 | 22430069 | 22430184 | Echeneis naucrates 173247 | GAA|GTAAGGATGC...ACTACCATAACA/TCTCTGCTTATT...TCTAG|GGA | 2 | 1 | 62.982 |
| 68111062 | GT-AG | 0 | 1.000000099473604e-05 | 125 | rna-XM_029507516.1 12801882 | 20 | 22430846 | 22430970 | Echeneis naucrates 173247 | AAG|GTGAGGGTGT...TGATCCTTATCA/GTGATCCTTATC...CTCAG|ATG | 0 | 1 | 75.601 |
| 68111063 | GT-AG | 0 | 1.000000099473604e-05 | 1078 | rna-XM_029507516.1 12801882 | 21 | 22431304 | 22432381 | Echeneis naucrates 173247 | CAG|GTAAGGGTTA...AAAGTGTTACTC/TTCTGGTTTACA...TTTAG|GTC | 0 | 1 | 81.959 |
| 68111064 | GT-AG | 0 | 1.999753978186752e-05 | 177 | rna-XM_029507516.1 12801882 | 22 | 22432485 | 22432661 | Echeneis naucrates 173247 | CAG|GTAAATCTGT...TTTCTTTTAGCA/TTTTAATTTATT...CACAG|ACG | 1 | 1 | 83.925 |
| 68111065 | GT-AG | 0 | 1.000000099473604e-05 | 183 | rna-XM_029507516.1 12801882 | 23 | 22432688 | 22432870 | Echeneis naucrates 173247 | ACT|GTGAGTATCT...GTCCTTTTGGTG/TTGGTGCGTATA...GTTAG|ATT | 0 | 1 | 84.422 |
| 68111066 | GT-AG | 0 | 0.0012304785562252 | 153 | rna-XM_029507516.1 12801882 | 24 | 22432985 | 22433137 | Echeneis naucrates 173247 | CAG|GTACACTCGT...GTGTGTTTAACT/GTGTGTTTAACT...TGCAG|GAG | 0 | 1 | 86.598 |
| 68111067 | GT-AG | 0 | 1.000000099473604e-05 | 122 | rna-XM_029507516.1 12801882 | 25 | 22433231 | 22433352 | Echeneis naucrates 173247 | GAG|GTGCAAGCAC...TTTTCTGTACTT/CATAATATTATA...TCCAG|GAA | 0 | 1 | 88.373 |
| 68111068 | GT-AG | 0 | 1.000000099473604e-05 | 111 | rna-XM_029507516.1 12801882 | 26 | 22433549 | 22433659 | Echeneis naucrates 173247 | AGG|GTGGGCTTGC...TATTCCTTTTTG/ATTTGTTTGATG...TACAG|ATT | 1 | 1 | 92.115 |
| 68111069 | GT-AG | 0 | 1.000000099473604e-05 | 348 | rna-XM_029507516.1 12801882 | 27 | 22433760 | 22434107 | Echeneis naucrates 173247 | AAG|GTGTGTGTGT...TTCTTTTTATTT/TTTCTTTTTATT...TGCAG|GTC | 2 | 1 | 94.024 |
| 68111070 | GT-AG | 0 | 7.420550833529969e-05 | 108 | rna-XM_029507516.1 12801882 | 28 | 22434183 | 22434290 | Echeneis naucrates 173247 | CAG|GTAATCCGCA...TGTATCTTAATT/TGTATCTTAATT...TGTAG|GTG | 2 | 1 | 95.456 |
| 68111071 | GT-AG | 0 | 0.0003360245099999 | 274 | rna-XM_029507516.1 12801882 | 29 | 22434400 | 22434673 | Echeneis naucrates 173247 | AAG|GTATTGTCTT...CTTTCTATGATT/TGGTTGTTCACT...CATAG|TGT | 0 | 1 | 97.537 |
| 68111072 | GT-AG | 0 | 1.000000099473604e-05 | 162 | rna-XM_029507516.1 12801882 | 30 | 22434777 | 22434938 | Echeneis naucrates 173247 | GGG|GTGAGAACAC...CTGATTTTCGTG/GAAACGCTGATT...TGCAG|GCA | 1 | 1 | 99.504 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);