introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
32 rows where transcript_id = 12801930
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, length, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 68112245 | GT-AG | 0 | 1.000000099473604e-05 | 8251 | rna-XM_029497433.1 12801930 | 2 | 14132717 | 14140967 | Echeneis naucrates 173247 | AAG|GTAGGTCTGG...TTTTTGTTACTT/AAATGTCTCATT...CATAG|ATG | 0 | 1 | 5.251 |
| 68112246 | GT-AG | 0 | 1.000000099473604e-05 | 427 | rna-XM_029497433.1 12801930 | 3 | 14141172 | 14141598 | Echeneis naucrates 173247 | AAG|GTCAGGAACT...GTTAACTTAACC/GTTAACTTAACC...TCCAG|GAC | 0 | 1 | 10.377 |
| 68112247 | GT-AG | 0 | 1.000000099473604e-05 | 405 | rna-XM_029497433.1 12801930 | 4 | 14141686 | 14142090 | Echeneis naucrates 173247 | GAA|GTAAGGCACT...GAATTATTGACC/GAATTATTGACC...CATAG|AAC | 0 | 1 | 12.563 |
| 68112248 | GT-AG | 0 | 1.000000099473604e-05 | 596 | rna-XM_029497433.1 12801930 | 5 | 14142189 | 14142784 | Echeneis naucrates 173247 | CAA|GTAAGGCTCT...TTTGTCTTAGTC/CTTTGTCTTAGT...TGCAG|TAA | 2 | 1 | 15.025 |
| 68112249 | GT-AG | 0 | 1.000000099473604e-05 | 297 | rna-XM_029497433.1 12801930 | 6 | 14142912 | 14143208 | Echeneis naucrates 173247 | AAG|GTAAGACATA...GTGTCATTAATT/ATTGTACTCATT...AACAG|AAT | 0 | 1 | 18.216 |
| 68112250 | GT-AG | 0 | 1.000000099473604e-05 | 220 | rna-XM_029497433.1 12801930 | 7 | 14143290 | 14143509 | Echeneis naucrates 173247 | AAT|GTGAGTTACA...TTTGTCTTTTCA/TGTCTTTTCATT...ATCAG|TTC | 0 | 1 | 20.251 |
| 68112251 | GT-AG | 0 | 1.000000099473604e-05 | 196 | rna-XM_029497433.1 12801930 | 8 | 14143651 | 14143846 | Echeneis naucrates 173247 | AAG|GTAAATCAAT...AAAATGTTAGCT/TAATGTTTGAAT...TGCAG|AGC | 0 | 1 | 23.794 |
| 68112252 | GT-AG | 0 | 1.000000099473604e-05 | 544 | rna-XM_029497433.1 12801930 | 9 | 14143963 | 14144506 | Echeneis naucrates 173247 | ACG|GTGGGTTACA...CTGAATTTATTA/GCTGAATTTATT...TTCAG|GTT | 2 | 1 | 26.709 |
| 68112253 | GT-AG | 0 | 1.2716405324881536e-05 | 131 | rna-XM_029497433.1 12801930 | 10 | 14144602 | 14144732 | Echeneis naucrates 173247 | CAG|GTGTGTGTTT...CAATCTTTGAAC/ATTGTGGTTACA...ATTAG|AAT | 1 | 1 | 29.095 |
| 68112254 | GT-AG | 0 | 1.000000099473604e-05 | 298 | rna-XM_029497433.1 12801930 | 11 | 14144879 | 14145176 | Echeneis naucrates 173247 | AGG|GTGAGACTTT...TTACTCTTGTCC/TCCTTTCTGATC...TATAG|TGG | 0 | 1 | 32.764 |
| 68112255 | GT-AG | 0 | 1.000000099473604e-05 | 314 | rna-XM_029497433.1 12801930 | 12 | 14145329 | 14145642 | Echeneis naucrates 173247 | CAA|GTGAGAACAG...GTGTTTTTCGCA/GCACAATTAATT...TACAG|GTT | 2 | 1 | 36.583 |
| 68112256 | GT-AG | 0 | 1.000000099473604e-05 | 110 | rna-XM_029497433.1 12801930 | 13 | 14145757 | 14145866 | Echeneis naucrates 173247 | CAA|GTGAGTAAAC...TTTGTCTTAATG/TCTTACCTCATA...TTCAG|GGC | 2 | 1 | 39.447 |
| 68112257 | GT-AG | 0 | 1.9081371349101235e-05 | 122 | rna-XM_029497433.1 12801930 | 14 | 14145949 | 14146070 | Echeneis naucrates 173247 | GTG|GTGCGCTCAC...GTATTCATGACT/TAATTATTCAAC...AATAG|TCT | 0 | 1 | 41.508 |
| 68112258 | GT-AG | 0 | 1.0287518684991225e-05 | 146 | rna-XM_029497433.1 12801930 | 15 | 14146120 | 14146265 | Echeneis naucrates 173247 | CCA|GTAAGTTGGA...ATCTGCTTGGTA/ATTAAGTTAAAC...TACAG|AGA | 1 | 1 | 42.739 |
| 68112259 | GT-AG | 0 | 1.000000099473604e-05 | 427 | rna-XM_029497433.1 12801930 | 16 | 14146406 | 14146832 | Echeneis naucrates 173247 | GTG|GTGAGAGCAC...TGGTTTTTAGCA/TTGGTTTTTAGC...TGCAG|AGG | 0 | 1 | 46.256 |
| 68112260 | GT-AG | 0 | 1.000000099473604e-05 | 446 | rna-XM_029497433.1 12801930 | 17 | 14146949 | 14147394 | Echeneis naucrates 173247 | CAG|GTTAGATTTT...TTAATGTTAGAT/TTAAAATTAATG...TTCAG|TGC | 2 | 1 | 49.171 |
| 68112261 | GT-AG | 0 | 1.000000099473604e-05 | 194 | rna-XM_029497433.1 12801930 | 18 | 14147501 | 14147694 | Echeneis naucrates 173247 | AAG|GTACATGACA...ATTATGTTAAAC/ATTATGTTAAAC...TCTAG|GTC | 0 | 1 | 51.834 |
| 68112262 | GT-AG | 0 | 1.000000099473604e-05 | 268 | rna-XM_029497433.1 12801930 | 19 | 14147856 | 14148123 | Echeneis naucrates 173247 | ATC|GTAAGAGAAA...CCTGCTTTGTTT/CTTTGTTTAAGT...ATCAG|GAC | 2 | 1 | 55.879 |
| 68112263 | GT-AG | 0 | 1.000000099473604e-05 | 195 | rna-XM_029497433.1 12801930 | 20 | 14148248 | 14148442 | Echeneis naucrates 173247 | GAG|GTAGGAAACC...ATTTTTTTGTTC/TTTTGCTTCACC...TTCAG|CCA | 0 | 1 | 58.995 |
| 68112264 | GT-AG | 0 | 0.0001849908871838 | 207 | rna-XM_029497433.1 12801930 | 21 | 14148602 | 14148808 | Echeneis naucrates 173247 | AGA|GTATGAGCTG...ATCCCTTTGTCT/GAAAGTGTGACA...TCCAG|TCA | 0 | 1 | 62.99 |
| 68112265 | GT-AG | 0 | 1.000000099473604e-05 | 195 | rna-XM_029497433.1 12801930 | 22 | 14148878 | 14149072 | Echeneis naucrates 173247 | AAG|GTAAGAGGCA...CTTTCTCTGATT/CTTTCTCTGATT...CACAG|GTG | 0 | 1 | 64.724 |
| 68112266 | GT-AG | 0 | 1.000000099473604e-05 | 178 | rna-XM_029497433.1 12801930 | 23 | 14149210 | 14149387 | Echeneis naucrates 173247 | CAG|GTAGGATGCC...AATTTCTTCCCC/AAATGTTTCAAA...TGCAG|TAA | 2 | 1 | 68.166 |
| 68112267 | GT-AG | 0 | 1.000000099473604e-05 | 90 | rna-XM_029497433.1 12801930 | 24 | 14149512 | 14149601 | Echeneis naucrates 173247 | AAT|GTCAGTCCCA...GGCTTTTTAACA/GGCTTTTTAACA...CACAG|GAT | 0 | 1 | 71.281 |
| 68112268 | GT-AG | 0 | 1.000000099473604e-05 | 540 | rna-XM_029497433.1 12801930 | 25 | 14149807 | 14150346 | Echeneis naucrates 173247 | AGG|GTTGGTGTGC...AGACCCTGGACA/AAATTTGTTATT...CTCAG|AAA | 1 | 1 | 76.432 |
| 68112269 | GT-AG | 0 | 1.000000099473604e-05 | 159 | rna-XM_029497433.1 12801930 | 26 | 14150462 | 14150620 | Echeneis naucrates 173247 | AGA|GTGAGTGCTC...GTTATGTTGATG/GTTATGTTGATG...TGTAG|GTG | 2 | 1 | 79.322 |
| 68112270 | GT-AG | 0 | 0.0002120886156841 | 255 | rna-XM_029497433.1 12801930 | 27 | 14150754 | 14151008 | Echeneis naucrates 173247 | GAG|GTTTACACAC...ATTTCCTTTCTT/ACAACTGTAACC...TACAG|AAG | 0 | 1 | 82.663 |
| 68112271 | GC-AG | 0 | 1.000000099473604e-05 | 184 | rna-XM_029497433.1 12801930 | 28 | 14151129 | 14151312 | Echeneis naucrates 173247 | AAG|GCAATATAAA...TATTTTTTATTT/TTTTTATTTATT...AACAG|GAC | 0 | 1 | 85.678 |
| 68112272 | GT-AG | 0 | 1.000000099473604e-05 | 268 | rna-XM_029497433.1 12801930 | 29 | 14151376 | 14151643 | Echeneis naucrates 173247 | AAG|GTAAATACAA...AAAGCATGAATC/ACCTCTCCCATC...TCAAG|CAG | 0 | 1 | 87.261 |
| 68112273 | GT-AG | 0 | 1.000000099473604e-05 | 76 | rna-XM_029497433.1 12801930 | 30 | 14151799 | 14151874 | Echeneis naucrates 173247 | CAG|GTAATCACAC...TCTGTCTCAACT/GTCTGTCTCAAC...TGCAG|TAA | 2 | 1 | 91.156 |
| 68112274 | GT-AG | 0 | 1.000000099473604e-05 | 90 | rna-XM_029497433.1 12801930 | 31 | 14151963 | 14152052 | Echeneis naucrates 173247 | AAG|GTCAGAGATG...TTTACTGTACCG/TCAGTGTTTACT...TTCAG|GAT | 0 | 1 | 93.367 |
| 68112275 | GT-AG | 0 | 1.000000099473604e-05 | 792 | rna-XM_029497433.1 12801930 | 32 | 14152313 | 14153104 | Echeneis naucrates 173247 | GAG|GTGAGACTGG...TGATCTTTGATC/TGATCTTTGATC...TTCAG|GTA | 2 | 1 | 99.899 |
| 68120331 | GT-AG | 0 | 1.000000099473604e-05 | 946 | rna-XM_029497433.1 12801930 | 1 | 14131618 | 14132563 | Echeneis naucrates 173247 | GAG|GTAAATACTA...TGTTTTCTGACC/TGTTTTCTGACC...CATAG|AGG | 0 | 2.387 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);