introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
30 rows where transcript_id = 34880193
This data as json, CSV (advanced)
Suggested facets: score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 196362061 | GT-AG | 0 | 1.000000099473604e-05 | 123999 | rna-XM_026650398.2 34880193 | 2 | 4121951 | 4245949 | Terrapene carolina 158814 | AAG|GTAAGAGCCG...TTATTTTTACCA/GTTATTTTTACC...TTCAG|ACA | 0 | 1 | 9.143 |
| 196362062 | GT-AG | 0 | 1.000000099473604e-05 | 1585 | rna-XM_026650398.2 34880193 | 3 | 4120255 | 4121839 | Terrapene carolina 158814 | AAG|GTAAGGATGT...GTGTGCTTAAAA/GTTTGTGTTATT...CACAG|TCT | 0 | 1 | 12.026 |
| 196362063 | GT-AG | 0 | 0.0003256444978603 | 47948 | rna-XM_026650398.2 34880193 | 4 | 4072226 | 4120173 | Terrapene carolina 158814 | ACG|GTATGGTGCC...GAATCATTAAAT/ATAATATTCAGA...TTCAG|ATA | 0 | 1 | 14.13 |
| 196362064 | GT-AG | 0 | 1.000000099473604e-05 | 333 | rna-XM_026650398.2 34880193 | 5 | 4071788 | 4072120 | Terrapene carolina 158814 | GAG|GTAAGATAAT...TTTGTCTTAACT/TTTGTCTTAACT...ACTAG|ACC | 0 | 1 | 16.857 |
| 196362065 | GT-AG | 0 | 1.000000099473604e-05 | 12127 | rna-XM_026650398.2 34880193 | 6 | 4059528 | 4071654 | Terrapene carolina 158814 | AAG|GTAAAGTCAA...ACATTTGTGTCT/GAGTAGCTAATA...TCTAG|TGT | 1 | 1 | 20.312 |
| 196362066 | GT-AG | 0 | 0.0002672941370789 | 719 | rna-XM_026650398.2 34880193 | 7 | 4058702 | 4059420 | Terrapene carolina 158814 | GGT|GTAAGTATTA...TTAACTTTATAA/TTTAACTTTATA...CACAG|ATA | 0 | 1 | 23.091 |
| 196362067 | GT-AG | 0 | 1.000000099473604e-05 | 915 | rna-XM_026650398.2 34880193 | 8 | 4057661 | 4058575 | Terrapene carolina 158814 | ATG|GTAGGTACCA...ATTTATTTAACT/ATTTATTTAACT...CTCAG|GTT | 0 | 1 | 26.364 |
| 196362068 | GT-AG | 0 | 3.950637847748118e-05 | 1863 | rna-XM_026650398.2 34880193 | 9 | 4055695 | 4057557 | Terrapene carolina 158814 | ATG|GTACGTAGTA...AAGATCTTATTT/ATTTATTTCATT...CACAG|GAT | 1 | 1 | 29.039 |
| 196362069 | GT-AG | 0 | 1.000000099473604e-05 | 3072 | rna-XM_026650398.2 34880193 | 10 | 4052460 | 4055531 | Terrapene carolina 158814 | AGG|GTAATGAACT...TATCCATTAATG/TGCATTCTAACT...TTTAG|TTT | 2 | 1 | 33.273 |
| 196362070 | GT-AG | 0 | 0.022572847674702 | 956 | rna-XM_026650398.2 34880193 | 11 | 4051389 | 4052344 | Terrapene carolina 158814 | AGT|GTATGTACTT...GTTCTTTTATTT/AGTTCTTTTATT...TTCAG|TGT | 0 | 1 | 36.26 |
| 196362071 | GT-AG | 0 | 1.000000099473604e-05 | 2157 | rna-XM_026650398.2 34880193 | 12 | 4049092 | 4051248 | Terrapene carolina 158814 | CAG|GTAAAAATAT...CTGCTTCTATTT/AAGAATCTGATA...TTCAG|ATG | 2 | 1 | 39.896 |
| 196362072 | GT-AG | 0 | 1.000000099473604e-05 | 2733 | rna-XM_026650398.2 34880193 | 13 | 4046271 | 4049003 | Terrapene carolina 158814 | CAG|GTGAGGCTTT...TCATTTTTGCTG/TGAAGACTCATT...TTCAG|ATA | 0 | 1 | 42.182 |
| 196362073 | GT-AG | 0 | 1.000000099473604e-05 | 3242 | rna-XM_026650398.2 34880193 | 14 | 4042943 | 4046184 | Terrapene carolina 158814 | CAA|GTGAGTTGAT...TTCTTTTTATTT/TTTCTTTTTATT...CAAAG|GAT | 2 | 1 | 44.416 |
| 196362074 | GT-AG | 0 | 1.000000099473604e-05 | 1946 | rna-XM_026650398.2 34880193 | 15 | 4040897 | 4042842 | Terrapene carolina 158814 | AAA|GTAAGAACTT...TTTTTCTTTTTG/GTATGTGTAACT...GTTAG|TGT | 0 | 1 | 47.013 |
| 196362075 | GT-AG | 0 | 5.435222649581487e-05 | 1808 | rna-XM_026650398.2 34880193 | 16 | 4038812 | 4040619 | Terrapene carolina 158814 | AAG|GTGCACATCA...GAAATTTTAACT/GAAATTTTAACT...TGCAG|TTG | 1 | 1 | 54.208 |
| 196362076 | GT-AG | 0 | 1.000000099473604e-05 | 7064 | rna-XM_026650398.2 34880193 | 17 | 4031619 | 4038682 | Terrapene carolina 158814 | CAA|GTAAGTAAAT...TTCTTCTTCTTT/TTAGTTCTAACT...CATAG|TTA | 1 | 1 | 57.558 |
| 196362077 | GT-AG | 0 | 1.000000099473604e-05 | 1799 | rna-XM_026650398.2 34880193 | 18 | 4029685 | 4031483 | Terrapene carolina 158814 | CAG|GTAATGATAT...GTTATATTAACA/GTTATATTAACA...TTCAG|GTT | 1 | 1 | 61.065 |
| 196362078 | GT-AG | 0 | 1.000000099473604e-05 | 777 | rna-XM_026650398.2 34880193 | 19 | 4028790 | 4029566 | Terrapene carolina 158814 | ACT|GTAAGAAAAT...GGATACTTACAT/TGGATACTTACA...CACAG|GGA | 2 | 1 | 64.13 |
| 196362079 | GT-AG | 0 | 1.000000099473604e-05 | 161 | rna-XM_026650398.2 34880193 | 20 | 4028518 | 4028678 | Terrapene carolina 158814 | CCG|GTAATGGAGT...GAATTTTTACTT/TTTTTACTTACT...TCCAG|AAG | 2 | 1 | 67.013 |
| 196362080 | GT-AG | 0 | 1.1389659613727774e-05 | 3271 | rna-XM_026650398.2 34880193 | 21 | 4025150 | 4028420 | Terrapene carolina 158814 | GAG|GTAATATTTA...TTTGTCTTACAT/ATTTGTCTTACA...TACAG|TGT | 0 | 1 | 69.532 |
| 196362081 | GT-AG | 0 | 1.3683305773384576e-05 | 4281 | rna-XM_026650398.2 34880193 | 22 | 4020761 | 4025041 | Terrapene carolina 158814 | GAT|GTAAGTAACA...ATTTTCTTTTCC/CTAATATTGATA...CTCAG|ATA | 0 | 1 | 72.338 |
| 196362082 | GT-AG | 0 | 0.0004128260602777 | 286 | rna-XM_026650398.2 34880193 | 23 | 4020361 | 4020646 | Terrapene carolina 158814 | CAG|GTATGTGTCT...ATTTCATTAATA/GTCAATTTCATT...TTCAG|TGT | 0 | 1 | 75.299 |
| 196362083 | GT-AG | 0 | 3.927636936215316e-05 | 1703 | rna-XM_026650398.2 34880193 | 24 | 4018523 | 4020225 | Terrapene carolina 158814 | AGA|GTAAGTTATA...TGTATCATACCA/TAATGTATCATA...ATCAG|GCT | 0 | 1 | 78.805 |
| 196362084 | GT-AG | 0 | 1.000000099473604e-05 | 9631 | rna-XM_026650398.2 34880193 | 25 | 4008725 | 4018355 | Terrapene carolina 158814 | CAG|GTACAGTAGA...TTTTTCATATTT/CTGTTTTTCATA...TGTAG|AAT | 2 | 1 | 83.143 |
| 196362085 | GT-AG | 0 | 1.000000099473604e-05 | 1407 | rna-XM_026650398.2 34880193 | 26 | 4007212 | 4008618 | Terrapene carolina 158814 | GAG|GTAAAGGAAC...ACTTATTTGACT/GTTTAACTTATT...CCCAG|AGT | 0 | 1 | 85.896 |
| 196362086 | GT-AG | 0 | 1.000000099473604e-05 | 15612 | rna-XM_026650398.2 34880193 | 27 | 3991507 | 4007118 | Terrapene carolina 158814 | TTG|GTAAGAAGAT...CATTCTTTTTCT/AGCCATTTAACA...AACAG|CAA | 0 | 1 | 88.312 |
| 196362087 | GT-AG | 0 | 1.000000099473604e-05 | 123 | rna-XM_026650398.2 34880193 | 28 | 3991264 | 3991386 | Terrapene carolina 158814 | GAG|GTAGGCCACT...GCAATTTTAAAG/GCAATTTTAAAG...TGCAG|GAT | 0 | 1 | 91.429 |
| 196362088 | GT-AG | 0 | 0.0096618778167446 | 1478 | rna-XM_026650398.2 34880193 | 29 | 3989677 | 3991154 | Terrapene carolina 158814 | CTG|GTATGTGTTT...TTGCTCTTAACT/TTGCTCTTAACT...GGTAG|TTC | 1 | 1 | 94.26 |
| 196362089 | GT-AG | 0 | 0.0004900863283941 | 7520 | rna-XM_026650398.2 34880193 | 30 | 3982026 | 3989545 | Terrapene carolina 158814 | AAG|GTATTTCCTT...CTCTCCTTTTCT/TCTTTTGTAAAT...TGTAG|GAC | 0 | 1 | 97.662 |
| 196362409 | GT-AG | 0 | 1.000000099473604e-05 | 2407 | rna-XM_026650398.2 34880193 | 1 | 4246133 | 4248539 | Terrapene carolina 158814 | TTG|GTGTGTGTTT...TGTGCTCTCTCT/CCCGGGCTGAGC...GGCAG|GAG | 0 | 4.701 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);