introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
25 rows where transcript_id = 15236048
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 82574917 | GT-AG | 0 | 1.000000099473604e-05 | 1180 | rna-XM_041504256.1 15236048 | 1 | 32695328 | 32696507 | Gigantopelta aegis 1735272 | AAA|GTAAATTAAA...ACTGTTTTCACG/ACTGTTTTCACG...TTTAG|GAT | 1 | 1 | 5.349 |
| 82574918 | GT-AG | 0 | 1.000000099473604e-05 | 2682 | rna-XM_041504256.1 15236048 | 2 | 32696621 | 32699302 | Gigantopelta aegis 1735272 | GAT|GTAAGACTTA...AACTCTTTTACT/CTTTTACTGACA...TTCAG|ATT | 0 | 1 | 8.096 |
| 82574919 | GT-AG | 0 | 1.000000099473604e-05 | 1580 | rna-XM_041504256.1 15236048 | 3 | 32699371 | 32700950 | Gigantopelta aegis 1735272 | CAC|GTAAGTCAGT...TTGTTTTTATTC/TTTATTCTAATT...TTCAG|CTC | 2 | 1 | 9.75 |
| 82574920 | GT-AG | 0 | 0.0562369946146679 | 3060 | rna-XM_041504256.1 15236048 | 4 | 32701280 | 32704339 | Gigantopelta aegis 1735272 | GGG|GTATGTTTGT...TTTTCTTTACTT/ATTTTCTTTACT...CATAG|GTT | 1 | 1 | 17.749 |
| 82574921 | GT-AG | 0 | 0.0001449953418914 | 1424 | rna-XM_041504256.1 15236048 | 5 | 32704453 | 32705876 | Gigantopelta aegis 1735272 | GAG|GTATTTAAAC...TCATTTGTAACC/CCATTACTAATC...TTCAG|ATC | 0 | 1 | 20.496 |
| 82574922 | GT-AG | 0 | 1.000000099473604e-05 | 2985 | rna-XM_041504256.1 15236048 | 6 | 32705945 | 32708929 | Gigantopelta aegis 1735272 | TGG|GTAAGATATA...CAGACGTTCACC/CAGACGTTCACC...TGCAG|TAA | 2 | 1 | 22.149 |
| 82574923 | GT-AG | 0 | 1.000000099473604e-05 | 787 | rna-XM_041504256.1 15236048 | 7 | 32709244 | 32710030 | Gigantopelta aegis 1735272 | TCG|GTAATGGCTG...AACACCCTGGCT/CCAATGCTAACA...TACAG|ATT | 1 | 1 | 29.784 |
| 82574924 | GT-AG | 0 | 1.000000099473604e-05 | 1194 | rna-XM_041504256.1 15236048 | 8 | 32710144 | 32711337 | Gigantopelta aegis 1735272 | GAC|GTGAGTTTTC...TTGCTTGTGAAT/TTGCTTGTGAAT...TTCAG|ATA | 0 | 1 | 32.531 |
| 82574925 | GT-AG | 0 | 1.0996031195680358e-05 | 1422 | rna-XM_041504256.1 15236048 | 9 | 32711406 | 32712827 | Gigantopelta aegis 1735272 | GGG|GTAAGTATTG...ATTTTCTTATAG/TATTTTCTTATA...TTCAG|GAG | 2 | 1 | 34.184 |
| 82574926 | GT-AG | 0 | 1.000000099473604e-05 | 2253 | rna-XM_041504256.1 15236048 | 10 | 32713142 | 32715394 | Gigantopelta aegis 1735272 | TAG|GTAATAAACG...AAATGTTTGACT/AAATGTTTGACT...TTCAG|ATT | 1 | 1 | 41.819 |
| 82574927 | GT-AG | 0 | 1.000000099473604e-05 | 4757 | rna-XM_041504256.1 15236048 | 11 | 32715508 | 32720264 | Gigantopelta aegis 1735272 | GAC|GTAAGTCAAA...GTTTTCGTACTT/TGTTTTCGTACT...TTCAG|ATC | 0 | 1 | 44.566 |
| 82574928 | GT-AG | 0 | 1.000000099473604e-05 | 1225 | rna-XM_041504256.1 15236048 | 12 | 32720333 | 32721557 | Gigantopelta aegis 1735272 | TGG|GTAAGTTATT...TTGTATTTAAAT/TTGTATTTAAAT...GACAG|GAA | 2 | 1 | 46.219 |
| 82574929 | GT-AG | 0 | 7.338486424823991e-05 | 3289 | rna-XM_041504256.1 15236048 | 13 | 32721860 | 32725148 | Gigantopelta aegis 1735272 | AAG|GTAACGTATG...ACTTGTTTAAAA/ACTTGTTTAAAA...TTCAG|AAT | 1 | 1 | 53.562 |
| 82574930 | GT-AG | 0 | 1.000000099473604e-05 | 1854 | rna-XM_041504256.1 15236048 | 14 | 32725262 | 32727115 | Gigantopelta aegis 1735272 | GAC|GTAAGTAATT...TCGTTCTTATGT/TTCGTTCTTATG...TGTAG|ATC | 0 | 1 | 56.309 |
| 82574931 | GT-AG | 0 | 1.000000099473604e-05 | 629 | rna-XM_041504256.1 15236048 | 15 | 32727184 | 32727812 | Gigantopelta aegis 1735272 | CAA|GTTAGTAATC...ACTTTTGTAACA/ACTTTTGTAACA...TGCAG|TGA | 2 | 1 | 57.963 |
| 82574932 | GT-AG | 0 | 1.000000099473604e-05 | 3316 | rna-XM_041504256.1 15236048 | 16 | 32728097 | 32731412 | Gigantopelta aegis 1735272 | AAT|GTAAGACTGG...TTTTTTTTTTCT/GTTTTCCTCACG...TCCAG|TTA | 1 | 1 | 64.867 |
| 82574933 | GT-AG | 0 | 0.0019383580474274 | 1645 | rna-XM_041504256.1 15236048 | 17 | 32731526 | 32733170 | Gigantopelta aegis 1735272 | GAC|GTAATTATCA...TTTATCTTAACA/TTTTTATTTATT...TTCAG|ATA | 0 | 1 | 67.615 |
| 82574934 | GT-AG | 0 | 0.0001816800150911 | 6170 | rna-XM_041504256.1 15236048 | 18 | 32733242 | 32739411 | Gigantopelta aegis 1735272 | CCA|GTACGTATAC...AAAACTTTACAA/AATGATCTCACG...TCCAG|GCC | 2 | 1 | 69.341 |
| 82574935 | GT-AG | 0 | 1.000000099473604e-05 | 2377 | rna-XM_041504256.1 15236048 | 19 | 32739636 | 32742012 | Gigantopelta aegis 1735272 | ATG|GTGAGCATGT...TGTTCCTTTATG/TTATGTATCATT...TCCAG|ATG | 1 | 1 | 74.787 |
| 82574936 | GT-AG | 0 | 8.940889158228022e-05 | 1540 | rna-XM_041504256.1 15236048 | 20 | 32742126 | 32743665 | Gigantopelta aegis 1735272 | GGT|GTAAGTGTTG...GTTTTTTTATAT/AATGTTTTCATT...TCCAG|ATC | 0 | 1 | 77.535 |
| 82574937 | GT-AG | 0 | 4.738904581814139e-05 | 5716 | rna-XM_041504256.1 15236048 | 21 | 32743734 | 32749449 | Gigantopelta aegis 1735272 | TGG|GTAAGTTGTA...TTTTTCTTCATT/TTTTTCTTCATT...TTCAG|AAA | 2 | 1 | 79.188 |
| 82574938 | GT-AG | 0 | 0.0001175740580023 | 1987 | rna-XM_041504256.1 15236048 | 22 | 32749683 | 32751669 | Gigantopelta aegis 1735272 | CAG|GTTTGTTGTT...CAAACTTTACCT/ACAAACTTTACC...TTCAG|TTC | 1 | 1 | 84.853 |
| 82574939 | GT-AG | 0 | 1.000000099473604e-05 | 830 | rna-XM_041504256.1 15236048 | 23 | 32751786 | 32752615 | Gigantopelta aegis 1735272 | CAG|GTACGAATAC...TTACTATTATAT/CTGGTTTTCAGG...TACAG|ATG | 0 | 1 | 87.673 |
| 82574940 | GT-AG | 0 | 1.000000099473604e-05 | 1819 | rna-XM_041504256.1 15236048 | 24 | 32752678 | 32754496 | Gigantopelta aegis 1735272 | TAC|GTGAGTATCC...ATTTTTATGATT/ATTTTTATGATT...TTTAG|ACC | 2 | 1 | 89.181 |
| 82574941 | GT-AG | 0 | 0.0003398438625012 | 3356 | rna-XM_041504256.1 15236048 | 25 | 32754760 | 32758115 | Gigantopelta aegis 1735272 | CAG|GTATGATACA...TAATTTTTAATT/TAATTTTTAATT...CCCAG|GTT | 1 | 1 | 95.575 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);