introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
17 rows where transcript_id = 34753777
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 195793270 | GT-AG | 0 | 1.000000099473604e-05 | 284 | rna-XM_009991378.1 34753777 | 1 | 36346 | 36629 | Tauraco erythrolophus 121530 | GCG|GTAAGTGCTA...AAATACTTAAAT/ACTTAAATCATT...CTTAG|ATT | 0 | 1 | 3.325 |
| 195793271 | GT-AG | 0 | 0.0002590296897543 | 1185 | rna-XM_009991378.1 34753777 | 2 | 35029 | 36213 | Tauraco erythrolophus 121530 | ACA|GTAAGTTTCT...TTCCCCTTTTTC/CTTTTTCTAATG...TCTAG|ACG | 0 | 1 | 9.099 |
| 195793272 | GT-AG | 0 | 0.0015692247050839 | 1981 | rna-XM_009991378.1 34753777 | 3 | 32965 | 34945 | Tauraco erythrolophus 121530 | CAG|GTATGTATTC...TCTTTCTTCTCT/GCTGTATTAATT...AATAG|CCA | 2 | 1 | 12.73 |
| 195793273 | GT-AG | 0 | 8.348268810031444e-05 | 2259 | rna-XM_009991378.1 34753777 | 4 | 30609 | 32867 | Tauraco erythrolophus 121530 | CAG|GTATTGAATT...TTTCTTTTAATT/TTTCTTTTAATT...TTTAG|GGT | 0 | 1 | 16.973 |
| 195793274 | GT-AG | 0 | 1.000000099473604e-05 | 2460 | rna-XM_009991378.1 34753777 | 5 | 28047 | 30506 | Tauraco erythrolophus 121530 | ATG|GTAATACAGT...TTTATTTTAATG/ATTTCTTTTATT...ATCAG|TTG | 0 | 1 | 21.435 |
| 195793275 | GT-AG | 0 | 1.000000099473604e-05 | 2775 | rna-XM_009991378.1 34753777 | 6 | 25165 | 27939 | Tauraco erythrolophus 121530 | TAG|GTAAGTGTGA...AGAAACTTAAAT/AATGGTTTCATC...CTCAG|CTT | 2 | 1 | 26.115 |
| 195793276 | GT-AG | 0 | 0.0002152864355814 | 5454 | rna-XM_009991378.1 34753777 | 7 | 19464 | 24917 | Tauraco erythrolophus 121530 | GAA|GTAAGTTTGG...TGTTCCTTTTTT/TCTAATGTCAAC...TGCAG|CCG | 0 | 1 | 36.92 |
| 195793277 | GT-AG | 0 | 1.000000099473604e-05 | 2924 | rna-XM_009991378.1 34753777 | 8 | 16202 | 19125 | Tauraco erythrolophus 121530 | GGG|GTAAGGAGCG...ATTTCATTAACC/CTCAATTTCATT...AACAG|GAT | 2 | 1 | 51.706 |
| 195793278 | GT-AG | 0 | 0.0001533486652101 | 676 | rna-XM_009991378.1 34753777 | 9 | 15457 | 16132 | Tauraco erythrolophus 121530 | CAG|GTAAGCTTCA...AGATCTCTAACT/AGGATTCTAACC...TTCAG|AAA | 2 | 1 | 54.724 |
| 195793279 | GT-AG | 0 | 2.020393154636304e-05 | 904 | rna-XM_009991378.1 34753777 | 10 | 14465 | 15368 | Tauraco erythrolophus 121530 | CAG|GTAAACGTCA...TACCCCTTGTTT/ATATGGTTTACC...TGTAG|CCA | 0 | 1 | 58.574 |
| 195793280 | GT-AG | 0 | 0.0002080562741574 | 91 | rna-XM_009991378.1 34753777 | 11 | 14262 | 14352 | Tauraco erythrolophus 121530 | GAG|GTATTGTGTT...AATGTATTGAAT/AAGAAATTAATT...TTTAG|ATT | 1 | 1 | 63.473 |
| 195793281 | GT-AG | 0 | 1.000000099473604e-05 | 199 | rna-XM_009991378.1 34753777 | 12 | 13965 | 14163 | Tauraco erythrolophus 121530 | ATG|GTGGGTAAAA...ACTCTTTTGACA/TGTGTACTAATA...TGCAG|GGA | 0 | 1 | 67.76 |
| 195793282 | GT-AG | 0 | 1.000000099473604e-05 | 977 | rna-XM_009991378.1 34753777 | 13 | 12900 | 13876 | Tauraco erythrolophus 121530 | ATG|GTAAGTATCT...ATCATCTTATTT/AATCATCTTATT...TATAG|GAA | 1 | 1 | 71.61 |
| 195793283 | GT-AG | 0 | 0.0072240926230774 | 4088 | rna-XM_009991378.1 34753777 | 14 | 8662 | 12749 | Tauraco erythrolophus 121530 | TTG|GTATGGTTTT...TTTTTTTTATTT/TTTTTTTTTATT...CCTAG|GTA | 1 | 1 | 78.171 |
| 195793284 | GT-AG | 0 | 1.1295204415287527e-05 | 4915 | rna-XM_009991378.1 34753777 | 15 | 3550 | 8464 | Tauraco erythrolophus 121530 | AAG|GTGTGTATAC...TATATCTTAATG/CTTAATGTCATT...TGCAG|GCT | 0 | 1 | 86.789 |
| 195793285 | GT-AG | 0 | 1.000000099473604e-05 | 708 | rna-XM_009991378.1 34753777 | 16 | 2710 | 3417 | Tauraco erythrolophus 121530 | CTG|GTAAGCAGAA...TAATGGTTAGTC/GGAAGACTAATC...TGAAG|GAG | 0 | 1 | 92.563 |
| 195793286 | GT-AG | 0 | 1.000000099473604e-05 | 295 | rna-XM_009991378.1 34753777 | 17 | 2316 | 2610 | Tauraco erythrolophus 121530 | CAG|GTAAAGTGTT...AAGTGCTTAATG/AACTAACTTATT...TTTAG|TGG | 0 | 1 | 96.894 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);