introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
25 rows where transcript_id = 14424026
This data as json, CSV (advanced)
Suggested facets: score, length, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 77101425 | GT-AG | 0 | 1.000000099473604e-05 | 120 | rna-XM_024149956.1 14424026 | 1 | 7324488 | 7324607 | Eutrema salsugineum 72664 | CAG|GTCTGTGTGG...TTTTCTTTCAAG/TTTTCTTTCAAG...TGCAG|GGT | 2 | 1 | 9.239 |
| 77101426 | GT-AG | 0 | 1.000000099473604e-05 | 95 | rna-XM_024149956.1 14424026 | 2 | 7324789 | 7324883 | Eutrema salsugineum 72664 | AAG|GTGAGTTTTG...TGTTCTTTTACA/TGTTCTTTTACA...TCTAG|TTT | 0 | 1 | 11.936 |
| 77101427 | GT-AG | 0 | 8.40872959669196e-05 | 194 | rna-XM_024149956.1 14424026 | 3 | 7324992 | 7325185 | Eutrema salsugineum 72664 | GTG|GTAAGCCTTG...AACGTCTTATAA/CAACGTCTTATA...TTCAG|GCT | 0 | 1 | 13.545 |
| 77101428 | GT-AG | 0 | 0.0029629353060687 | 93 | rna-XM_024149956.1 14424026 | 4 | 7325321 | 7325413 | Eutrema salsugineum 72664 | GAG|GTATTTTCTC...TAATCTATAAAG/TATGAGTTAATC...GGCAG|GCA | 0 | 1 | 15.557 |
| 77101429 | GT-AG | 0 | 1.000000099473604e-05 | 96 | rna-XM_024149956.1 14424026 | 5 | 7325878 | 7325973 | Eutrema salsugineum 72664 | GAG|GTAAGGTGAT...CATTTCTCACCG/GCATTTCTCACC...CTTAG|ATG | 2 | 1 | 22.471 |
| 77101430 | GT-AG | 0 | 1.000000099473604e-05 | 105 | rna-XM_024149956.1 14424026 | 6 | 7326080 | 7326184 | Eutrema salsugineum 72664 | GAG|GTTAGGTATT...AACATTTTATCA/ATAAATCTAACT...TTCAG|ATC | 0 | 1 | 24.05 |
| 77101431 | GT-AG | 0 | 1.000000099473604e-05 | 167 | rna-XM_024149956.1 14424026 | 7 | 7326311 | 7326477 | Eutrema salsugineum 72664 | AAG|GTGGAGTAAA...ATTTTCTTAGAT/TACTTTTTCATT...TGCAG|GCT | 0 | 1 | 25.928 |
| 77101432 | GT-AG | 0 | 8.027343134711674e-05 | 63 | rna-XM_024149956.1 14424026 | 8 | 7326772 | 7326834 | Eutrema salsugineum 72664 | AAG|GTAATTTTTG...TTTTTCATGAAC/CCCTTTTTCATG...AACAG|ATA | 0 | 1 | 30.308 |
| 77101433 | GT-AG | 0 | 1.000000099473604e-05 | 129 | rna-XM_024149956.1 14424026 | 9 | 7326952 | 7327080 | Eutrema salsugineum 72664 | CAG|GTTGCAAAAT...GTCTTTTTAATT/GTCTTTTTAATT...TGCAG|GAG | 0 | 1 | 32.052 |
| 77101434 | GT-AG | 0 | 0.1738559236586867 | 82 | rna-XM_024149956.1 14424026 | 10 | 7327303 | 7327384 | Eutrema salsugineum 72664 | TTG|GTATACCTCT...TATTTCGTGACC/CATGTTCTTATG...AGCAG|AGT | 0 | 1 | 35.36 |
| 77101435 | GT-AG | 0 | 1.000000099473604e-05 | 117 | rna-XM_024149956.1 14424026 | 11 | 7327496 | 7327612 | Eutrema salsugineum 72664 | AAG|GTGCAAATGC...TCTTTCTTAAAA/TTTTAATTTATC...CGCAG|AAA | 0 | 1 | 37.014 |
| 77101436 | GT-AG | 0 | 36.41628770115951 | 106 | rna-XM_024149956.1 14424026 | 12 | 7327748 | 7327853 | Eutrema salsugineum 72664 | AAG|GTATCTTTTT...TGGCCCTTGATA/GCATGCTTCATA...TCCAG|GGA | 0 | 1 | 39.025 |
| 77101437 | GT-AG | 0 | 1.000000099473604e-05 | 86 | rna-XM_024149956.1 14424026 | 13 | 7328220 | 7328305 | Eutrema salsugineum 72664 | AGT|GTAAGAAATT...CATATATTATCT/ATTTTGCTAATG...TTCAG|GCA | 0 | 1 | 44.479 |
| 77101438 | GT-AG | 0 | 1.000000099473604e-05 | 107 | rna-XM_024149956.1 14424026 | 14 | 7328396 | 7328502 | Eutrema salsugineum 72664 | GAG|GTGAAATCGT...ATTTTTTCAATC/CATTTTTTCAAT...TACAG|GCT | 0 | 1 | 45.82 |
| 77101439 | GT-AG | 0 | 1.000000099473604e-05 | 79 | rna-XM_024149956.1 14424026 | 15 | 7328767 | 7328845 | Eutrema salsugineum 72664 | CAG|GTTCTGTTTC...GGTTCCTTCACT/CTGTTTCTCAAT...TGCAG|GTT | 0 | 1 | 49.754 |
| 77101440 | GT-AG | 0 | 3.1960745406611245e-05 | 103 | rna-XM_024149956.1 14424026 | 16 | 7328970 | 7329072 | Eutrema salsugineum 72664 | TGG|GTATGAAACT...TTTTCGTTAATG/CTTGATTTTATT...TAAAG|GAA | 1 | 1 | 51.602 |
| 77101441 | GT-AG | 0 | 0.5417669228476164 | 391 | rna-XM_024149956.1 14424026 | 17 | 7329835 | 7330225 | Eutrema salsugineum 72664 | TCG|GTATGCTCTA...ATCTCCTTATAA/TATCTCCTTATA...CTCAG|GAA | 1 | 1 | 62.956 |
| 77101442 | GT-AG | 0 | 0.0013174256628421 | 146 | rna-XM_024149956.1 14424026 | 18 | 7330458 | 7330603 | Eutrema salsugineum 72664 | CAG|GTATGTGTGA...TCCTTCTTAGCC/TTGATTGTTACC...AACAG|GTC | 2 | 1 | 66.413 |
| 77101443 | GT-AG | 0 | 1.000000099473604e-05 | 96 | rna-XM_024149956.1 14424026 | 19 | 7330707 | 7330802 | Eutrema salsugineum 72664 | AAG|GTAAATGCTC...TCGATCTTAGTT/ATCTTAGTTATC...TACAG|GTT | 0 | 1 | 67.948 |
| 77101444 | GT-AG | 0 | 0.0022697677718364 | 84 | rna-XM_024149956.1 14424026 | 20 | 7331198 | 7331281 | Eutrema salsugineum 72664 | GAG|GTATATAATT...TTTTTCTTACTC/TTTTTTCTTACT...TGTAG|GCT | 2 | 1 | 73.834 |
| 77101445 | GT-AG | 0 | 3.315809858132191e-05 | 72 | rna-XM_024149956.1 14424026 | 21 | 7331512 | 7331583 | Eutrema salsugineum 72664 | AAG|GTAGATATGA...ATCTCATTGATT/CATGATCTCATT...ACCAG|ACA | 1 | 1 | 77.261 |
| 77101446 | GT-AG | 0 | 1.000000099473604e-05 | 198 | rna-XM_024149956.1 14424026 | 22 | 7331825 | 7332022 | Eutrema salsugineum 72664 | AAG|GTATGACGGC...TTTTTGTTAACA/TTTTTGTTAACA...TCTAG|GAG | 2 | 1 | 80.852 |
| 77101447 | GT-AG | 0 | 1.000000099473604e-05 | 129 | rna-XM_024149956.1 14424026 | 23 | 7332435 | 7332563 | Eutrema salsugineum 72664 | CAG|GTTCGTCAAT...AATTTCTTACCT/TGTGTTCTGATA...ATTAG|ATG | 0 | 1 | 86.992 |
| 77101448 | GT-AG | 0 | 0.0010881106280449 | 94 | rna-XM_024149956.1 14424026 | 24 | 7332804 | 7332897 | Eutrema salsugineum 72664 | GAG|GTAACATATA...CTTTTCTTGATG/CTTTTCTTGATG...TTCAG|GGG | 0 | 1 | 90.568 |
| 77101449 | GT-AG | 0 | 5.1362719324465634e-05 | 136 | rna-XM_024149956.1 14424026 | 25 | 7332986 | 7333121 | Eutrema salsugineum 72664 | GTG|GTACGATTCC...GATTCTTCAACT/TGGTTTCTGATA...CGCAG|GAG | 1 | 1 | 91.879 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);