introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
25 rows where transcript_id = 6787102
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 35253060 | GT-AG | 0 | 9.131218274932328e-05 | 3302 | rna-XM_041188979.1 6787102 | 1 | 200972743 | 200976044 | Carcharodon carcharias 13397 | CAG|GTATGTGCAC...GTATTTTTAATC/GTATTTTTAATC...TTAAG|GAT | 0 | 1 | 1.384 |
| 35253061 | GT-AG | 0 | 1.000000099473604e-05 | 8793 | rna-XM_041188979.1 6787102 | 2 | 200976282 | 200985074 | Carcharodon carcharias 13397 | CAG|GTACTGCACC...TGACTTTTAATG/TGACTTTTAATG...TACAG|ATT | 0 | 1 | 8.218 |
| 35253062 | GT-AG | 0 | 1.119823049789451e-05 | 18074 | rna-XM_041188979.1 6787102 | 3 | 200985192 | 201003265 | Carcharodon carcharias 13397 | GAT|GTAAGCAAAT...TCTTCTCTATTC/AATGGGTTTATT...AACAG|GTT | 0 | 1 | 11.592 |
| 35253063 | GT-AG | 0 | 0.0011004244210775 | 5308 | rna-XM_041188979.1 6787102 | 4 | 201003446 | 201008753 | Carcharodon carcharias 13397 | GCA|GTAAGTTTTG...TGATCCTGAGCT/TTGATCCTGAGC...TTCAG|GAA | 0 | 1 | 16.782 |
| 35253064 | GT-AG | 0 | 8.675550258462061e-05 | 5535 | rna-XM_041188979.1 6787102 | 5 | 201008923 | 201014457 | Carcharodon carcharias 13397 | AAG|GTTTGTTATT...TGTATTTTGACA/TGTATTTTGACA...TTTAG|GTA | 1 | 1 | 21.655 |
| 35253065 | GT-AG | 0 | 1.000000099473604e-05 | 6103 | rna-XM_041188979.1 6787102 | 6 | 201014593 | 201020695 | Carcharodon carcharias 13397 | GAG|GTTTGTGCTG...TTGCCTTTTGCG/AGATTATTTACA...TTCAG|AAA | 1 | 1 | 25.548 |
| 35253066 | GT-AG | 0 | 0.0003711616882208 | 640 | rna-XM_041188979.1 6787102 | 7 | 201020860 | 201021499 | Carcharodon carcharias 13397 | AGG|GTATGATGCT...TGCGTTTTAACA/TGCGTTTTAACA...TTTAG|GGT | 0 | 1 | 30.277 |
| 35253067 | GT-AG | 0 | 1.000000099473604e-05 | 2619 | rna-XM_041188979.1 6787102 | 8 | 201021621 | 201024239 | Carcharodon carcharias 13397 | TGG|GTAAGTGAAG...TTTCTCTTACTT/TTACTTTTAATT...TTTAG|CTG | 1 | 1 | 33.766 |
| 35253068 | GT-AG | 0 | 1.000000099473604e-05 | 16211 | rna-XM_041188979.1 6787102 | 9 | 201024299 | 201040509 | Carcharodon carcharias 13397 | GTG|GTAAGACGAT...TTAACCTTTTTC/ATGTAATTTATT...CACAG|CCA | 0 | 1 | 35.467 |
| 35253069 | GT-AG | 0 | 1.000000099473604e-05 | 1104 | rna-XM_041188979.1 6787102 | 10 | 201040612 | 201041715 | Carcharodon carcharias 13397 | CAG|GTAAAGTAGC...TTTCCTTTTGTC/TTTTGTCTGATA...CCTAG|ATC | 0 | 1 | 38.408 |
| 35253070 | GT-AG | 0 | 3.7593503043430496e-05 | 1962 | rna-XM_041188979.1 6787102 | 11 | 201041832 | 201043793 | Carcharodon carcharias 13397 | CCA|GTAAGTTAAT...CACATTTTAAAA/AGCAATTTCACA...GATAG|TCT | 2 | 1 | 41.753 |
| 35253071 | GT-AG | 0 | 1.000000099473604e-05 | 3654 | rna-XM_041188979.1 6787102 | 12 | 201043972 | 201047625 | Carcharodon carcharias 13397 | GAA|GTAAGTCCAG...TTTGCTTTTGCA/ATTGAAGTAACA...TACAG|TGG | 0 | 1 | 46.886 |
| 35253072 | GT-AG | 0 | 1.000000099473604e-05 | 876 | rna-XM_041188979.1 6787102 | 13 | 201047749 | 201048624 | Carcharodon carcharias 13397 | GAG|GTAAAGACTT...CTATTTTTAATA/CTATTTTTAATA...TGCAG|GCA | 0 | 1 | 50.433 |
| 35253073 | GT-AG | 0 | 6.518367634616361e-05 | 2381 | rna-XM_041188979.1 6787102 | 14 | 201048774 | 201051154 | Carcharodon carcharias 13397 | ACT|GTAAGTCATT...TCTTCCTTTTTT/CTTTTTTTCATC...ACTAG|ACA | 2 | 1 | 54.729 |
| 35253074 | GT-AG | 0 | 1.000000099473604e-05 | 9600 | rna-XM_041188979.1 6787102 | 15 | 201051231 | 201060830 | Carcharodon carcharias 13397 | GGG|GTAAGGAGTT...ATACCCTCAGCA/GATACCCTCAGC...CTCAG|CAT | 0 | 1 | 56.92 |
| 35253075 | GT-AG | 0 | 0.0002877171289366 | 5825 | rna-XM_041188979.1 6787102 | 16 | 201060969 | 201066793 | Carcharodon carcharias 13397 | GAA|GTAAGCTGAT...TCCTTTTTAAAT/TCCTTTTTAAAT...TATAG|GTT | 0 | 1 | 60.9 |
| 35253076 | GT-AG | 0 | 1.000000099473604e-05 | 1772 | rna-XM_041188979.1 6787102 | 17 | 201066912 | 201068683 | Carcharodon carcharias 13397 | AAG|GTAAAGAGCT...CATTCTTTGTCA/TTTTGTGTAACA...CTTAG|TAA | 1 | 1 | 64.302 |
| 35253077 | GT-AG | 0 | 3.44990771064237e-05 | 2946 | rna-XM_041188979.1 6787102 | 18 | 201068839 | 201071784 | Carcharodon carcharias 13397 | AAG|GTATGTCAAA...TTGTTTTTGTTT/AGATTATTTAAA...CTTAG|GAT | 0 | 1 | 68.772 |
| 35253078 | GT-AG | 0 | 1.000000099473604e-05 | 1126 | rna-XM_041188979.1 6787102 | 19 | 201071949 | 201073074 | Carcharodon carcharias 13397 | TGG|GTAAGTGACA...ATCTTTTTAATT/ATCTTTTTAATT...TCTAG|GCT | 2 | 1 | 73.501 |
| 35253079 | GT-AG | 0 | 1.000000099473604e-05 | 1108 | rna-XM_041188979.1 6787102 | 20 | 201073228 | 201074335 | Carcharodon carcharias 13397 | TGC|GTAAGGATCT...GTAATTTTGAAT/ATTTTTTTCATT...TATAG|TTC | 2 | 1 | 77.912 |
| 35253080 | GT-AG | 0 | 1.000000099473604e-05 | 3179 | rna-XM_041188979.1 6787102 | 21 | 201074434 | 201077612 | Carcharodon carcharias 13397 | AAG|GTGAGGTCGC...TTTTTGTTACTT/GTTTTTGTTACT...TATAG|AAA | 1 | 1 | 80.738 |
| 35253081 | GT-AG | 0 | 1.000000099473604e-05 | 1580 | rna-XM_041188979.1 6787102 | 22 | 201077738 | 201079317 | Carcharodon carcharias 13397 | CCA|GTGAGTGTGG...GATCTTTTAGTT/ATAATATTAATT...TTCAG|ATT | 0 | 1 | 84.343 |
| 35253082 | GT-AG | 0 | 1.000000099473604e-05 | 12833 | rna-XM_041188979.1 6787102 | 23 | 201079408 | 201092240 | Carcharodon carcharias 13397 | AAG|GTAATGTTTG...TTTCTCTTTTTT/ACACAGTTTACC...ATTAG|ATG | 0 | 1 | 86.938 |
| 35253083 | GT-AG | 0 | 1.000000099473604e-05 | 4914 | rna-XM_041188979.1 6787102 | 24 | 201092442 | 201097355 | Carcharodon carcharias 13397 | AGG|GTGAGTATTT...CAGTTCTTGATT/TCTTGATTAATT...TACAG|AGC | 0 | 1 | 92.734 |
| 35253084 | GT-AG | 0 | 2.133115894857387e-05 | 6711 | rna-XM_041188979.1 6787102 | 25 | 201097458 | 201104168 | Carcharodon carcharias 13397 | AAG|GTATGAATGG...AATGTTTTATAG/TAATGTTTTATA...TACAG|ACT | 0 | 1 | 95.675 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);