introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
24 rows where transcript_id = 32672004
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, is_minor, score, length, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 182503815 | GT-AG | 0 | 1.000000099473604e-05 | 1398 | rna-XM_030239074.1 32672004 | 1 | 5433719 | 5435116 | Serinus canaria 9135 | AAG|GTTAAAACTT...GAACCCTTTTTT/TTTTAATTCATT...AATAG|ACT | 0 | 1 | 5.153 |
| 182503816 | AT-AC | 1 | 99.99999999982283 | 518 | rna-XM_030239074.1 32672004 | 2 | 5435236 | 5435753 | Serinus canaria 9135 | TTC|ATATCCTTTC...TTCACCTTGACT/TGGTTTTTCACC...TTCAC|AGT | 2 | 1 | 7.399 |
| 182503817 | GT-AG | 0 | 9.20884907032358e-05 | 1076 | rna-XM_030239074.1 32672004 | 3 | 5435844 | 5436919 | Serinus canaria 9135 | TGA|GTAAGTACCT...ATATTTTTATTT/GATATTTTTATT...CTCAG|ACA | 2 | 1 | 9.098 |
| 182503818 | GT-AG | 0 | 1.000000099473604e-05 | 2032 | rna-XM_030239074.1 32672004 | 4 | 5437049 | 5439080 | Serinus canaria 9135 | AAT|GTAAGTAAAA...CCATCTTTGTCA/ATCTTTGTCATT...TACAG|GTA | 2 | 1 | 11.533 |
| 182503819 | GT-AG | 0 | 1.000000099473604e-05 | 825 | rna-XM_030239074.1 32672004 | 5 | 5439173 | 5439997 | Serinus canaria 9135 | CAG|GTAAGGAACA...TGCATCTTATTT/CCTTTTCTCACT...CCTAG|GTC | 1 | 1 | 13.269 |
| 182503820 | GT-AG | 0 | 0.0069123436281934 | 1240 | rna-XM_030239074.1 32672004 | 6 | 5440239 | 5441478 | Serinus canaria 9135 | TCT|GTGTATCTTC...TATTGTTTAACA/TATTGTTTAACA...GTTAG|AAG | 2 | 1 | 17.818 |
| 182503821 | GT-AG | 0 | 1.000000099473604e-05 | 1206 | rna-XM_030239074.1 32672004 | 7 | 5441624 | 5442829 | Serinus canaria 9135 | CAA|GTGGGTGCTT...CCACCTTTACCT/TTGGAATTTATT...CACAG|ACC | 0 | 1 | 20.555 |
| 182503822 | GT-AG | 0 | 1.849587954054485e-05 | 1076 | rna-XM_030239074.1 32672004 | 8 | 5443025 | 5444100 | Serinus canaria 9135 | CAG|GTTTGTAAAT...TTTGCCTTGAAT/CTTGAATTAACA...TCCAG|ATG | 0 | 1 | 24.236 |
| 182503823 | GT-AG | 0 | 1.000000099473604e-05 | 3569 | rna-XM_030239074.1 32672004 | 9 | 5444293 | 5447861 | Serinus canaria 9135 | CTT|GTAAGTGCTT...TATATCCTAATG/TATATCCTAATG...CAAAG|AGG | 0 | 1 | 27.86 |
| 182503824 | GT-AG | 0 | 1.0901136393979044e-05 | 375 | rna-XM_030239074.1 32672004 | 10 | 5447965 | 5448339 | Serinus canaria 9135 | CAA|GTAAGCCATT...GTCACTTTTTCT/GGATTATTCATA...GATAG|AGT | 1 | 1 | 29.804 |
| 182503825 | GT-AG | 0 | 1.000000099473604e-05 | 122 | rna-XM_030239074.1 32672004 | 11 | 5448570 | 5448691 | Serinus canaria 9135 | CAG|GTAAGTTAAG...ATTTTCTGAATT/CATTTTCTGAAT...TCCAG|ATC | 0 | 1 | 34.145 |
| 182503826 | GT-AG | 0 | 6.159004901328276e-05 | 310 | rna-XM_030239074.1 32672004 | 12 | 5448845 | 5449154 | Serinus canaria 9135 | CTG|GTAATTATTT...GCTGTTTTAACT/CTGTGTTTTATT...TACAG|CTC | 0 | 1 | 37.033 |
| 182503827 | GT-AG | 0 | 1.000000099473604e-05 | 746 | rna-XM_030239074.1 32672004 | 13 | 5449509 | 5450254 | Serinus canaria 9135 | GTG|GTGAGTTTCA...CTTCTGTTAGCA/CTCCAACTTACT...TTAAG|GTG | 0 | 1 | 43.715 |
| 182503828 | GT-AG | 0 | 1.000000099473604e-05 | 1773 | rna-XM_030239074.1 32672004 | 14 | 5450708 | 5452480 | Serinus canaria 9135 | CAG|GTAAGGAATA...AATTCCTTAACT/CTTTTCTTAAAT...TGCAG|AAA | 0 | 1 | 52.265 |
| 182503829 | GT-AG | 0 | 1.000000099473604e-05 | 695 | rna-XM_030239074.1 32672004 | 15 | 5452596 | 5453290 | Serinus canaria 9135 | AGA|GTAGGAACCC...TTTCCATTGAAC/CCATATCTCAAC...TATAG|GTT | 1 | 1 | 54.436 |
| 182503830 | GT-AG | 0 | 0.0001162034299623 | 2571 | rna-XM_030239074.1 32672004 | 16 | 5453446 | 5456016 | Serinus canaria 9135 | TTG|GTAAATTTAA...CCTGTTTTAAAA/AAACTATTGACC...TCCAG|GCA | 0 | 1 | 57.361 |
| 182503831 | CT-TG | 0 | 0.2593984361940185 | 1426 | rna-XM_030239074.1 32672004 | 17 | 5456198 | 5457623 | Serinus canaria 9135 | GTA|CTATGCTCAG...ATTATATTAATT/ATTATATTAATT...TATTG|TTA | 1 | 1 | 60.778 |
| 182503832 | GT-CG | 0 | 0.5458287100409238 | 178 | rna-XM_030239074.1 32672004 | 18 | 5457653 | 5457830 | Serinus canaria 9135 | TGT|GTATTTTCTT...CTGTCCTTTGTT/CTGTTTCTGAAG...CACCG|TGT | 0 | 1 | 61.325 |
| 182503833 | GT-AG | 0 | 1.000000099473604e-05 | 160 | rna-XM_030239074.1 32672004 | 19 | 5457918 | 5458077 | Serinus canaria 9135 | AAG|GTAAAGATAT...TTTCCTTTATTT/TTTTCCTTTATT...TTCAG|GTT | 0 | 1 | 62.967 |
| 182503834 | GT-AG | 0 | 8.979856027845873e-05 | 913 | rna-XM_030239074.1 32672004 | 20 | 5458333 | 5459245 | Serinus canaria 9135 | GTG|GTAAGTTCTA...TTTTTCTTACTT/TTTTTTCTTACT...TTCAG|GCA | 0 | 1 | 67.78 |
| 182503835 | GT-AG | 0 | 1.000000099473604e-05 | 893 | rna-XM_030239074.1 32672004 | 21 | 5459300 | 5460192 | Serinus canaria 9135 | GAG|GTGTGTCACT...TCATTTTTACCA/ATTTTTCTGATT...AACAG|ATT | 0 | 1 | 68.8 |
| 182503836 | GT-AG | 0 | 3.6294650235438296e-05 | 1946 | rna-XM_030239074.1 32672004 | 22 | 5460335 | 5462280 | Serinus canaria 9135 | TAA|GTACTACATG...ATTTATTTGACT/ATTTATTTGACT...CTTAG|GTG | 1 | 1 | 71.48 |
| 182503837 | GT-AG | 0 | 1.000000099473604e-05 | 788 | rna-XM_030239074.1 32672004 | 23 | 5462382 | 5463169 | Serinus canaria 9135 | TCG|GTGAGGTCAC...TAGTTCTTCTCT/CTGGGTCTCAGA...CACAG|AAT | 0 | 1 | 73.386 |
| 182503838 | GT-AG | 0 | 8.390072965752748e-05 | 1307 | rna-XM_030239074.1 32672004 | 24 | 5463435 | 5464741 | Serinus canaria 9135 | TGA|GTAAGTATCA...TTTTCCTTTTCC/ATTTGACTGACT...CAAAG|GTC | 1 | 1 | 78.388 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);