introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
23 rows where transcript_id = 21436581
This data as json, CSV (advanced)
Suggested facets: score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 115635030 | GT-AG | 0 | 1.000000099473604e-05 | 841 | rna-XM_034065672.1 21436581 | 4 | 34238048 | 34238888 | Melopsittacus undulatus 13146 | CAG|GTTAGACCAT...TGACACTAAATA/GTGACACTAAAT...TCCAG|CCC | 2 | 1 | 10.83 |
| 115635031 | GT-AG | 0 | 1.000000099473604e-05 | 22438 | rna-XM_034065672.1 21436581 | 5 | 34239062 | 34261499 | Melopsittacus undulatus 13146 | AAG|GTAAGAAGAC...TATCACTTAGCA/TTTTGTATCACT...TGCAG|AAA | 1 | 1 | 14.413 |
| 115635032 | GT-AG | 0 | 1.000000099473604e-05 | 4448 | rna-XM_034065672.1 21436581 | 6 | 34261604 | 34266051 | Melopsittacus undulatus 13146 | GAG|GTGAAGAGCA...TTCTCTTTCTCT/CTCTCCCTCATT...TTTAG|GCC | 0 | 1 | 16.567 |
| 115635033 | GT-AG | 0 | 0.0046995437190865 | 1401 | rna-XM_034065672.1 21436581 | 7 | 34266230 | 34267630 | Melopsittacus undulatus 13146 | TAG|GTATATATCT...CAGGTTTTATCT/TTTTATCTCACT...CTTAG|TGA | 1 | 1 | 20.253 |
| 115635034 | GT-AG | 0 | 1.000000099473604e-05 | 894 | rna-XM_034065672.1 21436581 | 8 | 34267817 | 34268710 | Melopsittacus undulatus 13146 | AAG|GTAATTACTG...AGAATCTTCCTC/AAATTACTGAAG...TTAAG|ATT | 1 | 1 | 24.104 |
| 115635035 | GT-AG | 0 | 0.0004548637319221 | 89 | rna-XM_034065672.1 21436581 | 9 | 34268813 | 34268901 | Melopsittacus undulatus 13146 | AAG|GTATTGTGAA...GGTTTTTTAATC/GGTTTTTTAATC...ATCAG|GCA | 1 | 1 | 26.217 |
| 115635036 | GT-AG | 0 | 1.000000099473604e-05 | 3379 | rna-XM_034065672.1 21436581 | 10 | 34269043 | 34272421 | Melopsittacus undulatus 13146 | AAG|GTTAGGATTC...TTCTCCTTAGCT/CTTAGCTTAATC...TGCAG|TGC | 1 | 1 | 29.136 |
| 115635037 | GT-AG | 0 | 0.0001475484301181 | 2495 | rna-XM_034065672.1 21436581 | 11 | 34272570 | 34275064 | Melopsittacus undulatus 13146 | CAG|GTAATCCTGA...GTCATTTTAGCT/TTTTAGCTAATG...CATAG|AGA | 2 | 1 | 32.201 |
| 115635038 | GT-AG | 0 | 1.000000099473604e-05 | 9718 | rna-XM_034065672.1 21436581 | 12 | 34276335 | 34286052 | Melopsittacus undulatus 13146 | CAG|GTAAGCAAAT...TGTTTTTTTAAT/TGTTTTTTTAAT...TGCAG|CAA | 0 | 1 | 58.501 |
| 115635039 | GT-AG | 0 | 1.000000099473604e-05 | 3011 | rna-XM_034065672.1 21436581 | 13 | 34286264 | 34289274 | Melopsittacus undulatus 13146 | GCA|GTGAGTCCAG...TTTTTCTTTGCT/ATTCTTGTAACT...TCTAG|CTT | 1 | 1 | 62.87 |
| 115635040 | GT-AG | 0 | 1.000000099473604e-05 | 2080 | rna-XM_034065672.1 21436581 | 14 | 34289485 | 34291564 | Melopsittacus undulatus 13146 | CAG|GTAAGATGAC...GTGTTCTGAACC/GGTGTTCTGAAC...TGCAG|GAA | 1 | 1 | 67.219 |
| 115635041 | GT-AG | 0 | 2.936810234839769e-05 | 538 | rna-XM_034065672.1 21436581 | 15 | 34291781 | 34292318 | Melopsittacus undulatus 13146 | TGG|GTAAGTTGTA...AGATTCTTATAT/TAGATTCTTATA...ATTAG|GGC | 1 | 1 | 71.692 |
| 115635042 | GT-AG | 0 | 1.000000099473604e-05 | 2481 | rna-XM_034065672.1 21436581 | 16 | 34292449 | 34294929 | Melopsittacus undulatus 13146 | CAG|GTGAGTCTTA...AGCTGTTTACCA/GCTGTGTTCATG...CACAG|GCA | 2 | 1 | 74.384 |
| 115635043 | GT-AG | 0 | 0.0010544865168247 | 1906 | rna-XM_034065672.1 21436581 | 17 | 34295096 | 34297001 | Melopsittacus undulatus 13146 | CAG|GTATATTGGG...AAAGCCATATCC/CTATTGTTCATG...TGCAG|AGC | 0 | 1 | 77.821 |
| 115635044 | GT-AG | 0 | 1.000000099473604e-05 | 859 | rna-XM_034065672.1 21436581 | 18 | 34297281 | 34298139 | Melopsittacus undulatus 13146 | CAG|GTAAAGCCCC...TTTTCCTTCTAA/TTCCTTCTAAAA...TTTAG|AAT | 0 | 1 | 83.599 |
| 115635045 | GT-AG | 0 | 1.000000099473604e-05 | 522 | rna-XM_034065672.1 21436581 | 19 | 34298218 | 34298739 | Melopsittacus undulatus 13146 | CAG|GTAAAAGAAA...CTGGCCTTAGGA/GGGTTATTGATA...TCTAG|GGC | 0 | 1 | 85.214 |
| 115635046 | GT-AG | 0 | 8.152823112087733e-05 | 1348 | rna-XM_034065672.1 21436581 | 20 | 34298975 | 34300322 | Melopsittacus undulatus 13146 | ACG|GTACTGGTTC...CTTGTTTTAACT/AAATTTTTCATT...TTCAG|GTA | 1 | 1 | 90.081 |
| 115635047 | GT-AG | 0 | 1.6615228566971142e-05 | 2253 | rna-XM_034065672.1 21436581 | 21 | 34300501 | 34302753 | Melopsittacus undulatus 13146 | CAG|GTACATCTAG...GCATTCTAAGTC/CTAAGTCTGACA...TGTAG|CAT | 2 | 1 | 93.767 |
| 115635048 | GT-AG | 0 | 1.000000099473604e-05 | 5521 | rna-XM_034065672.1 21436581 | 22 | 34302953 | 34308473 | Melopsittacus undulatus 13146 | CAG|GTGAGAATTT...TACTTTTTAATT/TTTTAATTCATT...AATAG|GTT | 0 | 1 | 97.888 |
| 115635049 | GT-AG | 0 | 1.000000099473604e-05 | 4640 | rna-XM_034065672.1 21436581 | 23 | 34308564 | 34313203 | Melopsittacus undulatus 13146 | CGG|GTAAGAAATT...TTTCTCTTTTTT/GAAAAATTAATT...TGTAG|AAA | 0 | 1 | 99.752 |
| 115645529 | GT-AG | 0 | 0.000288807584343 | 9576 | rna-XM_034065672.1 21436581 | 1 | 34121549 | 34131124 | Melopsittacus undulatus 13146 | CCG|GTAACGGCTC...TCTTCCTTAAAA/ATTTCTTTCAAT...CCTAG|CAT | 0 | 2.381 | |
| 115645530 | GT-AG | 0 | 1.000000099473604e-05 | 61380 | rna-XM_034065672.1 21436581 | 2 | 34131205 | 34192584 | Melopsittacus undulatus 13146 | TTG|GTGAGTTGTC...TACAATTTAATA/TACAATTTAATA...CACAG|GAC | 0 | 4.038 | |
| 115645531 | GT-AG | 0 | 1.000000099473604e-05 | 45116 | rna-XM_034065672.1 21436581 | 3 | 34192830 | 34237945 | Melopsittacus undulatus 13146 | TAG|GTAAGTTGCT...CTTGCCTTTTCT/AGAGAGCTCATG...TTCAG|TTG | 0 | 9.112 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);