introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
23 rows where transcript_id = 14424093
This data as json, CSV (advanced)
Suggested facets: score, length, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 77102386 | GT-AG | 0 | 1.000000099473604e-05 | 82 | rna-XM_024149487.1 14424093 | 1 | 1902626 | 1902707 | Eutrema salsugineum 72664 | AAG|GTCAACAAAA...GATTTCTTCTCT/CATGTTTTGAGC...TTCAG|GTT | 0 | 1 | 3.523 |
| 77102387 | GT-AG | 0 | 1.000000099473604e-05 | 86 | rna-XM_024149487.1 14424093 | 2 | 1902488 | 1902573 | Eutrema salsugineum 72664 | ATG|GTGAGCATTG...GTTTTCTGATTT/TGTTTTCTGATT...TCCAG|GAT | 1 | 1 | 4.978 |
| 77102388 | GT-AG | 0 | 4.09498526022224e-05 | 138 | rna-XM_024149487.1 14424093 | 3 | 1902302 | 1902439 | Eutrema salsugineum 72664 | ATG|GTAAGTTCTG...TATTTGTTAGTT/GAGAAACTTATC...AACAG|CTA | 1 | 1 | 6.32 |
| 77102389 | GT-AG | 0 | 1.000000099473604e-05 | 89 | rna-XM_024149487.1 14424093 | 4 | 1902058 | 1902146 | Eutrema salsugineum 72664 | AAG|GTATGGGATG...TGTGGTTTAAAA/TGTGGTTTAAAA...TCCAG|GGA | 0 | 1 | 10.654 |
| 77102390 | GT-AG | 0 | 0.0002090687095415 | 180 | rna-XM_024149487.1 14424093 | 5 | 1901820 | 1901999 | Eutrema salsugineum 72664 | CTG|GTACTGTGTC...TTGGTTTTGACC/TTGGTTTTGACC...TGCAG|TAA | 1 | 1 | 12.276 |
| 77102391 | GT-AG | 0 | 6.043246413835744e-05 | 120 | rna-XM_024149487.1 14424093 | 6 | 1901642 | 1901761 | Eutrema salsugineum 72664 | CAG|GTTCATTTCT...TTTTTCTTTGCA/AAATCATTTATT...GGAAG|CCC | 2 | 1 | 13.898 |
| 77102392 | GT-AG | 0 | 1.000000099473604e-05 | 66 | rna-XM_024149487.1 14424093 | 7 | 1901483 | 1901548 | Eutrema salsugineum 72664 | AAG|GTGGGTTCCT...AGATCCTTATGA/GTCGTTCTGATA...TACAG|TGA | 2 | 1 | 16.499 |
| 77102393 | GT-AG | 0 | 1.318655757500488e-05 | 97 | rna-XM_024149487.1 14424093 | 8 | 1901226 | 1901322 | Eutrema salsugineum 72664 | AAG|GTAGGTTACT...TGTTCGTTAACG/TTGAGCCTGACC...TGCAG|GCG | 0 | 1 | 20.973 |
| 77102394 | GT-AG | 0 | 25.462077169082537 | 105 | rna-XM_024149487.1 14424093 | 9 | 1901105 | 1901209 | Eutrema salsugineum 72664 | TAT|GTATCCACAT...ATGGCCTTCACG/ATGGCCTTCACG...GACAG|ATA | 1 | 1 | 21.421 |
| 77102395 | GT-AG | 0 | 5.5306554826178655e-05 | 80 | rna-XM_024149487.1 14424093 | 10 | 1900804 | 1900883 | Eutrema salsugineum 72664 | AAG|GTATTAGATA...ATATTTTTGATA/ATATTTTTGATA...GGCAG|AAT | 0 | 1 | 27.601 |
| 77102396 | GT-AG | 0 | 0.0105988239877942 | 81 | rna-XM_024149487.1 14424093 | 11 | 1900653 | 1900733 | Eutrema salsugineum 72664 | TTG|GTATGCCACG...TTATCCCTAATA/TAATTTTTTACG...CTTAG|AAC | 1 | 1 | 29.558 |
| 77102397 | GT-AG | 0 | 1.000000099473604e-05 | 120 | rna-XM_024149487.1 14424093 | 12 | 1900452 | 1900571 | Eutrema salsugineum 72664 | AAG|GTGCACAAAT...CAGCCTTTACCA/CTGCATCTTATG...TGCAG|AGT | 1 | 1 | 31.823 |
| 77102398 | GT-AG | 0 | 1.000000099473604e-05 | 98 | rna-XM_024149487.1 14424093 | 13 | 1899019 | 1899116 | Eutrema salsugineum 72664 | CAG|GTTCATACTA...AAGAAATTAACG/AAGAAATTAACG...TGCAG|AAC | 1 | 1 | 69.155 |
| 77102399 | GT-AG | 0 | 1.66185790213328e-05 | 207 | rna-XM_024149487.1 14424093 | 14 | 1898617 | 1898823 | Eutrema salsugineum 72664 | AAG|GTAATTATTA...TACTACTTATCT/TTACTACTTATC...CGCAG|ACT | 1 | 1 | 74.609 |
| 77102400 | GT-AG | 0 | 1.000000099473604e-05 | 98 | rna-XM_024149487.1 14424093 | 15 | 1898445 | 1898542 | Eutrema salsugineum 72664 | CAA|GTTAAGCAGA...GTATCTATAGTA/AGATAACTGAGT...GCCAG|GTT | 0 | 1 | 76.678 |
| 77102401 | GT-AG | 0 | 1.6115121036654465e-05 | 138 | rna-XM_024149487.1 14424093 | 16 | 1898106 | 1898243 | Eutrema salsugineum 72664 | CAG|GTTCATCCTA...TTGCTCTTAATA/TTGCTCTTAATA...CACAG|CAT | 0 | 1 | 82.299 |
| 77102402 | GT-AG | 0 | 1.000000099473604e-05 | 107 | rna-XM_024149487.1 14424093 | 17 | 1897954 | 1898060 | Eutrema salsugineum 72664 | AAG|GTAAGAGCCT...GTGCTTCTATCA/TATCAACTGACA...ACTAG|GTC | 0 | 1 | 83.557 |
| 77102403 | GT-AG | 0 | 0.0473033729456863 | 104 | rna-XM_024149487.1 14424093 | 18 | 1897796 | 1897899 | Eutrema salsugineum 72664 | TTG|GTATGTTTAA...TCTGTCTTGATG/TAATTATTGATT...AAAAG|GTA | 0 | 1 | 85.067 |
| 77102404 | GT-AG | 0 | 0.0005810128092723 | 109 | rna-XM_024149487.1 14424093 | 19 | 1897580 | 1897688 | Eutrema salsugineum 72664 | GAA|GTAAATTTTC...ATTCTCTGAATG/GATTCTCTGAAT...TGCAG|GAA | 2 | 1 | 88.059 |
| 77102405 | GT-AG | 0 | 1.000000099473604e-05 | 100 | rna-XM_024149487.1 14424093 | 20 | 1897431 | 1897530 | Eutrema salsugineum 72664 | GCG|GTTAGGCGCC...GGTCACTTGAAT/TCTCTGTTTACA...TGTAG|GTC | 0 | 1 | 89.43 |
| 77102406 | GT-AG | 0 | 0.0008966776642042 | 80 | rna-XM_024149487.1 14424093 | 21 | 1897291 | 1897370 | Eutrema salsugineum 72664 | CAG|GTACGCTCAG...TTTTCCTTTAAG/AAGATTCTGACT...ACCAG|CTT | 0 | 1 | 91.107 |
| 77102407 | GT-AG | 0 | 0.0087097569099999 | 110 | rna-XM_024149487.1 14424093 | 22 | 1897108 | 1897217 | Eutrema salsugineum 72664 | GAG|GTATATTCCT...GTCTTTTTATAC/CTTGGCTTCACT...CTAAG|GTG | 1 | 1 | 93.149 |
| 77102408 | GT-AG | 0 | 1.000000099473604e-05 | 73 | rna-XM_024149487.1 14424093 | 23 | 1896949 | 1897021 | Eutrema salsugineum 72664 | CAG|GTTAGTAAGA...TCATCTTTTATG/AAAGGATTCATC...TATAG|TGT | 0 | 1 | 95.554 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);