introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
25 rows where transcript_id = 21550622
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, is_minor, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 116462296 | GT-AG | 0 | 1.000000099473604e-05 | 52341 | rna-XM_021643432.1 21550622 | 1 | 3341381 | 3393721 | Meriones unguiculatus 10047 | CAG|GTGAGGCGGG...GCCTCCTCATCT/TGCCTCCTCATC...TTCAG|ACT | 1 | 1 | 1.672 |
| 116462297 | GT-AG | 0 | 0.0009780673172947 | 6019 | rna-XM_021643432.1 21550622 | 2 | 3393859 | 3399877 | Meriones unguiculatus 10047 | AAA|GTAAGCTGCA...CTCCCTTTAATC/TTTAATCTCATT...TTCAG|AAC | 0 | 1 | 5.428 |
| 116462298 | GT-AG | 0 | 1.000000099473604e-05 | 3207 | rna-XM_021643432.1 21550622 | 3 | 3400057 | 3403263 | Meriones unguiculatus 10047 | CAG|GTGAGTCCCC...ACAATCTTAAAC/ACAGACTTTATA...TGTAG|CTC | 2 | 1 | 10.334 |
| 116462299 | GT-AG | 0 | 1.000000099473604e-05 | 9461 | rna-XM_021643432.1 21550622 | 4 | 3403421 | 3412881 | Meriones unguiculatus 10047 | AAG|GTAATGAATG...AATCCCTTCCTC/AGAGGTGTAATC...CATAG|GAG | 0 | 1 | 14.638 |
| 116462300 | GT-AG | 0 | 1.000000099473604e-05 | 644 | rna-XM_021643432.1 21550622 | 5 | 3412971 | 3413614 | Meriones unguiculatus 10047 | TGG|GTAAGTCTGG...TGATCCCTAGTT/TGCACTCTGACT...TCCAG|CCA | 2 | 1 | 17.078 |
| 116462301 | GT-AG | 0 | 1.000000099473604e-05 | 588 | rna-XM_021643432.1 21550622 | 6 | 3413768 | 3414355 | Meriones unguiculatus 10047 | CAG|GTAGTTCTAA...ACACTCTTGTTT/TTGTTTGTGATG...TCTAG|TCC | 2 | 1 | 21.272 |
| 116462302 | GT-AG | 0 | 1.000000099473604e-05 | 347 | rna-XM_021643432.1 21550622 | 7 | 3414477 | 3414823 | Meriones unguiculatus 10047 | CAG|GTAATGCACT...AGTTCCTTTCCA/GCAAGGCTGACT...CCAAG|GAC | 0 | 1 | 24.589 |
| 116462303 | GT-AG | 0 | 1.000000099473604e-05 | 1491 | rna-XM_021643432.1 21550622 | 8 | 3415008 | 3416498 | Meriones unguiculatus 10047 | TAG|GTAAGCCAAC...TTGGCTTCAGAG/TTCTGTGTAATT...TTCAG|ATC | 1 | 1 | 29.633 |
| 116462304 | GT-AG | 0 | 1.000000099473604e-05 | 4480 | rna-XM_021643432.1 21550622 | 9 | 3416979 | 3421458 | Meriones unguiculatus 10047 | ACA|GTAAGTTCAC...CCTCCCTTCTGG/CTTCTGGTGAGA...CACAG|TGC | 1 | 1 | 42.791 |
| 116462305 | GT-AG | 0 | 1.817516956426738e-05 | 2139 | rna-XM_021643432.1 21550622 | 10 | 3421662 | 3423800 | Meriones unguiculatus 10047 | CTG|GTAAGCAGTT...TTCTCTTTCACC/TTCTCTTTCACC...TCCAG|CAG | 0 | 1 | 48.355 |
| 116462306 | GT-AG | 0 | 1.000000099473604e-05 | 10606 | rna-XM_021643432.1 21550622 | 11 | 3423957 | 3434562 | Meriones unguiculatus 10047 | TCG|GTGAGTCGCT...TTCTCCTTCCCA/CCTTCCCACATT...TGTAG|TAC | 0 | 1 | 52.632 |
| 116462307 | GT-AG | 0 | 1.000000099473604e-05 | 818 | rna-XM_021643432.1 21550622 | 12 | 3434821 | 3435638 | Meriones unguiculatus 10047 | TCG|GTGAGCACTC...TTCCTCTTACCC/TTTCCTCTTACC...CCTAG|AGT | 0 | 1 | 59.704 |
| 116462308 | GT-AG | 0 | 1.2612331826458532e-05 | 1512 | rna-XM_021643432.1 21550622 | 13 | 3435735 | 3437246 | Meriones unguiculatus 10047 | GGA|GTAAGTAACC...CGTTCCTTCTCT/GCCTGTCCAACA...TGCAG|GAA | 0 | 1 | 62.336 |
| 116462309 | GT-AG | 0 | 1.000000099473604e-05 | 2377 | rna-XM_021643432.1 21550622 | 14 | 3437330 | 3439706 | Meriones unguiculatus 10047 | GAG|GTAAGAAGTG...ACACTTTCAACC/GCATGGCTGACA...CCCAG|GTC | 2 | 1 | 64.611 |
| 116462310 | GT-AG | 0 | 1.000000099473604e-05 | 1567 | rna-XM_021643432.1 21550622 | 15 | 3439819 | 3441385 | Meriones unguiculatus 10047 | GAG|GTAAGGGTTC...TCAGCCTTGCTT/CCTTGCTTCATG...CCCAG|GGT | 0 | 1 | 67.681 |
| 116462311 | GT-AG | 0 | 1.000000099473604e-05 | 3247 | rna-XM_021643432.1 21550622 | 16 | 3441462 | 3444708 | Meriones unguiculatus 10047 | AAG|GTATGGCAGC...TCTACCTAAATG/CTCTACCTAAAT...TGCAG|ACC | 1 | 1 | 69.764 |
| 116462312 | AT-AC | 1 | 99.99999997234671 | 2022 | rna-XM_021643432.1 21550622 | 17 | 3444794 | 3446815 | Meriones unguiculatus 10047 | CCG|ATATCCTTCC...TGGACCTTAATC/ATGGACCTTAAT...TTCAC|ATA | 2 | 1 | 72.094 |
| 116462313 | GT-AG | 0 | 1.000000099473604e-05 | 724 | rna-XM_021643432.1 21550622 | 18 | 3446909 | 3447632 | Meriones unguiculatus 10047 | CTG|GTAAGTGCCT...CTGCCTTTCACC/CTGCCTTTCACC...CTTAG|CCT | 2 | 1 | 74.644 |
| 116462314 | GT-AG | 0 | 1.000000099473604e-05 | 441 | rna-XM_021643432.1 21550622 | 19 | 3447819 | 3448259 | Meriones unguiculatus 10047 | CAG|GTGAGGAGCC...TACTTCTCACCA/ATACTTCTCACC...TCCAG|GCC | 2 | 1 | 79.742 |
| 116462315 | GT-AG | 0 | 1.000000099473604e-05 | 132 | rna-XM_021643432.1 21550622 | 20 | 3448345 | 3448476 | Meriones unguiculatus 10047 | GAG|GTATGGACTG...TGTTCTGTTTCC/GGAGGCTGCAAT...CCCAG|ATT | 0 | 1 | 82.072 |
| 116462316 | GT-AG | 0 | 1.000000099473604e-05 | 981 | rna-XM_021643432.1 21550622 | 21 | 3448572 | 3449552 | Meriones unguiculatus 10047 | CTG|GTGAGAAGGA...TGTTGCTTCTCT/CATATGGTCAGC...TGCAG|GGT | 2 | 1 | 84.677 |
| 116462317 | GT-AG | 0 | 0.0002104164847689 | 102 | rna-XM_021643432.1 21550622 | 22 | 3449632 | 3449733 | Meriones unguiculatus 10047 | AAG|GTAACTGCTC...CCTTCCTCAGCA/CCCTTCCTCAGC...TGCAG|CAC | 0 | 1 | 86.842 |
| 116462318 | GT-AG | 0 | 1.000000099473604e-05 | 465 | rna-XM_021643432.1 21550622 | 23 | 3449836 | 3450300 | Meriones unguiculatus 10047 | GAG|GTGGGTGATG...AATCACTTATCA/TTATCACTAACC...TACAG|GGC | 0 | 1 | 89.638 |
| 116462319 | GT-AG | 0 | 1.000000099473604e-05 | 639 | rna-XM_021643432.1 21550622 | 24 | 3450399 | 3451037 | Meriones unguiculatus 10047 | CCT|GTGAGTTAGC...TCTGCCTTCATC/TCTGCCTTCATC...CTCAG|AGG | 2 | 1 | 92.325 |
| 116462320 | GT-AG | 0 | 1.8932089516524413e-05 | 578 | rna-XM_021643432.1 21550622 | 25 | 3451167 | 3451744 | Meriones unguiculatus 10047 | GGT|GTAAGTGCTG...TAGCCCTGGACT/CCTGGACTCACT...ACCAG|GCA | 2 | 1 | 95.861 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);