introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
34 rows where transcript_id = 5530409
This data as json, CSV (advanced)
Suggested facets: score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 28484042 | GT-AG | 0 | 7.949173718784976e-05 | 64085 | rna-XM_044272365.1 5530409 | 2 | 448941013 | 449005097 | Bufo gargarizans 30331 | AAA|GTAAGTCTAG...TGTTTCTTATCT/ATGTTTCTTATC...TCCAG|CTC | 1 | 1 | 7.023 |
| 28484043 | GT-AG | 0 | 3.221081425506303e-05 | 4406 | rna-XM_044272365.1 5530409 | 3 | 449005244 | 449009649 | Bufo gargarizans 30331 | GAG|GTATTAGGTC...ACAGCTTTACCT/GACAGCTTTACC...CACAG|GTT | 0 | 1 | 9.407 |
| 28484044 | GT-AG | 0 | 1.000000099473604e-05 | 39459 | rna-XM_044272365.1 5530409 | 4 | 449009792 | 449049250 | Bufo gargarizans 30331 | GGG|GTAAGAAACC...TGGTCCTTGAGT/ACCGTTCTGATT...TTCAG|AGG | 1 | 1 | 11.726 |
| 28484045 | GT-AG | 0 | 1.000000099473604e-05 | 3522 | rna-XM_044272365.1 5530409 | 5 | 449049440 | 449052961 | Bufo gargarizans 30331 | CAG|GTAAGGGGTC...TGTATTTTATAT/CCCTATCTGACT...GACAG|GTG | 1 | 1 | 14.813 |
| 28484046 | GT-AG | 0 | 1.000000099473604e-05 | 1071 | rna-XM_044272365.1 5530409 | 6 | 449053073 | 449054143 | Bufo gargarizans 30331 | GAG|GTAGGAATGA...CTTTCCTCATTT/CCTTTCCTCATT...CTCAG|AGC | 1 | 1 | 16.626 |
| 28484047 | GT-AG | 0 | 1.000000099473604e-05 | 5671 | rna-XM_044272365.1 5530409 | 7 | 449054156 | 449059826 | Bufo gargarizans 30331 | AAG|GTTGGTGTGT...CCTTTCTTATTT/TCCTTTCTTATT...TTCAG|TTC | 1 | 1 | 16.822 |
| 28484048 | GT-AG | 0 | 1.000000099473604e-05 | 4071 | rna-XM_044272365.1 5530409 | 8 | 449060097 | 449064167 | Bufo gargarizans 30331 | AAG|GTAAGAGCTT...TTGTATTTAATC/TTGTATTTAATC...TATAG|CCT | 1 | 1 | 21.231 |
| 28484049 | GT-AG | 0 | 1.000000099473604e-05 | 1699 | rna-XM_044272365.1 5530409 | 9 | 449064750 | 449066448 | Bufo gargarizans 30331 | GAG|GTAGGTTAAG...AAGCATTTATTT/GAAGCATTTATT...GGCAG|TTC | 1 | 1 | 30.737 |
| 28484050 | GT-AG | 0 | 0.0002019502397912 | 932 | rna-XM_044272365.1 5530409 | 10 | 449066583 | 449067514 | Bufo gargarizans 30331 | GAG|GTAAATTGTG...ATTTTTTTAACT/ATTTTTTTAACT...TTCAG|CAG | 0 | 1 | 32.925 |
| 28484051 | GT-AG | 0 | 0.0001934682063609 | 2441 | rna-XM_044272365.1 5530409 | 11 | 449067660 | 449070100 | Bufo gargarizans 30331 | CAA|GTAGGTCTCT...CTACCCTTATAC/TTATACTTCAAC...ATCAG|AGC | 1 | 1 | 35.293 |
| 28484052 | GT-AG | 0 | 0.039590455993436 | 742 | rna-XM_044272365.1 5530409 | 12 | 449070407 | 449071148 | Bufo gargarizans 30331 | ATG|GTATGTTTCC...ATTCTTTTATCA/TATTCTTTTATC...TCCAG|TTC | 1 | 1 | 40.291 |
| 28484053 | GT-AG | 0 | 1.000000099473604e-05 | 4206 | rna-XM_044272365.1 5530409 | 13 | 449071343 | 449075548 | Bufo gargarizans 30331 | CAG|GTAAATAAAA...AATTTTTTACTC/CAATTTTTTACT...AACAG|TGG | 0 | 1 | 43.459 |
| 28484054 | GT-AG | 0 | 1.000000099473604e-05 | 4586 | rna-XM_044272365.1 5530409 | 14 | 449075576 | 449080161 | Bufo gargarizans 30331 | CAT|GTGAGTAATG...TCTGCCTTCCTC/TGGAGACTGACT...TTCAG|GAA | 0 | 1 | 43.9 |
| 28484055 | GT-AG | 0 | 1.000000099473604e-05 | 2485 | rna-XM_044272365.1 5530409 | 15 | 449080280 | 449082764 | Bufo gargarizans 30331 | CAG|GTGATCCTCT...ATTACATTAAAA/CTTTAATTCACA...TTCAG|TTC | 1 | 1 | 45.827 |
| 28484056 | GT-AG | 0 | 1.000000099473604e-05 | 362 | rna-XM_044272365.1 5530409 | 16 | 449083353 | 449083714 | Bufo gargarizans 30331 | AAG|GTAGGATTTG...ATGTTTTTACTT/AATGTTTTTACT...TCCAG|CAG | 1 | 1 | 55.43 |
| 28484057 | GT-AG | 0 | 2.626304178874567e-05 | 1608 | rna-XM_044272365.1 5530409 | 17 | 449083816 | 449085423 | Bufo gargarizans 30331 | AAA|GTAAGTTGTT...AATTTTTTTTCT/TTAATATTCAAA...TCAAG|ATT | 0 | 1 | 57.08 |
| 28484058 | GT-AG | 0 | 1.000000099473604e-05 | 1274 | rna-XM_044272365.1 5530409 | 18 | 449085684 | 449086957 | Bufo gargarizans 30331 | AAA|GTAAGTCTCC...TCTTCTGTAGCA/TAATTACTAATT...TACAG|AGG | 2 | 1 | 61.326 |
| 28484059 | GT-AG | 0 | 1.000000099473604e-05 | 20593 | rna-XM_044272365.1 5530409 | 19 | 449087049 | 449107641 | Bufo gargarizans 30331 | GAG|GTAATATAGG...GAAATTTTATTT/AGAAATTTTATT...TCCAG|CTT | 0 | 1 | 62.812 |
| 28484060 | GT-AG | 0 | 1.000000099473604e-05 | 4062 | rna-XM_044272365.1 5530409 | 20 | 449107852 | 449111913 | Bufo gargarizans 30331 | CTG|GTGAGTAGAT...TAAGCTTTAAAT/TCTCTCTTTATT...TTCAG|ACA | 0 | 1 | 66.242 |
| 28484061 | GT-AG | 0 | 2.709804601154505e-05 | 7995 | rna-XM_044272365.1 5530409 | 21 | 449112075 | 449120069 | Bufo gargarizans 30331 | AAG|GTAACATAAT...ATCTTTTCAATT/TCAATTTTCATG...TACAG|GAA | 2 | 1 | 68.871 |
| 28484062 | GT-AG | 0 | 1.000000099473604e-05 | 12525 | rna-XM_044272365.1 5530409 | 22 | 449120183 | 449132707 | Bufo gargarizans 30331 | CAG|GTGAGAAGTG...CATTTTTTATTT/TTTTTATTTATA...TTCAG|GTT | 1 | 1 | 70.717 |
| 28484063 | GT-AG | 0 | 1.000000099473604e-05 | 458 | rna-XM_044272365.1 5530409 | 23 | 449132750 | 449133207 | Bufo gargarizans 30331 | CAA|GTTAGTTTGC...ATGCCATTATCT/CATTATCTAAGC...TATAG|GCA | 1 | 1 | 71.403 |
| 28484064 | GT-AG | 0 | 0.2202126034481037 | 21242 | rna-XM_044272365.1 5530409 | 24 | 449133306 | 449154547 | Bufo gargarizans 30331 | GAG|GTATCTAATT...TTTTCTTTTACT/CATGTTCTCATT...TACAG|TCT | 0 | 1 | 73.003 |
| 28484065 | GT-AG | 0 | 1.000000099473604e-05 | 11201 | rna-XM_044272365.1 5530409 | 25 | 449154672 | 449165872 | Bufo gargarizans 30331 | AAG|GTAAGCTGAT...CCCTTCTTGTTT/AAGGATTTTATA...TCCAG|GCA | 1 | 1 | 75.029 |
| 28484066 | GT-AG | 0 | 1.4907944125641004e-05 | 17704 | rna-XM_044272365.1 5530409 | 26 | 449166049 | 449183752 | Bufo gargarizans 30331 | AGG|GTAAATATGT...TTTATTTTACCA/TTTTATTTTACC...TGTAG|ATA | 0 | 1 | 77.903 |
| 28484067 | GT-AG | 0 | 1.000000099473604e-05 | 1551 | rna-XM_044272365.1 5530409 | 27 | 449183873 | 449185423 | Bufo gargarizans 30331 | AAG|GTAAGCCTGC...TGTCTTTTGGTA/GTAAGATTTATA...GACAG|AAT | 0 | 1 | 79.863 |
| 28484068 | GT-AG | 0 | 4.270439468276307e-05 | 6271 | rna-XM_044272365.1 5530409 | 28 | 449185579 | 449191849 | Bufo gargarizans 30331 | CAG|GTACAATTCA...TTAGTGTTGACC/TTAGTGTTGACC...TCTAG|TGC | 2 | 1 | 82.394 |
| 28484069 | GT-AG | 0 | 1.000000099473604e-05 | 406 | rna-XM_044272365.1 5530409 | 29 | 449192136 | 449192541 | Bufo gargarizans 30331 | AAG|GTACAAATTG...GGTTTTTTGTTT/AGGTTGCTAAAA...TACAG|CGC | 0 | 1 | 87.065 |
| 28484070 | GT-AG | 0 | 1.2987700603135911e-05 | 4185 | rna-XM_044272365.1 5530409 | 30 | 449192721 | 449196905 | Bufo gargarizans 30331 | TAG|GTAAGCGTGG...TTTTTCTTTCTT/ATGTAATTTATA...AACAG|ACA | 2 | 1 | 89.989 |
| 28484071 | GT-AG | 0 | 7.924340670383999e-05 | 1923 | rna-XM_044272365.1 5530409 | 31 | 449197033 | 449198955 | Bufo gargarizans 30331 | AGA|GTAAGTATCT...CAGGTTTTAACT/CAGGTTTTAACT...AACAG|GAA | 0 | 1 | 92.063 |
| 28484072 | GT-AG | 0 | 1.000000099473604e-05 | 5073 | rna-XM_044272365.1 5530409 | 32 | 449199082 | 449204154 | Bufo gargarizans 30331 | CGG|GTAAGATACC...CATGCTTTACTA/ATTTAATTTACC...TGCAG|GAT | 0 | 1 | 94.121 |
| 28484073 | GT-AG | 0 | 1.000000099473604e-05 | 2585 | rna-XM_044272365.1 5530409 | 33 | 449204310 | 449206894 | Bufo gargarizans 30331 | CAG|GTAAATCTCC...TCATCCTTCTCT/CCTATTATCATC...AACAG|TGC | 2 | 1 | 96.652 |
| 28484074 | GT-AG | 0 | 1.000000099473604e-05 | 10905 | rna-XM_044272365.1 5530409 | 34 | 449207031 | 449217935 | Bufo gargarizans 30331 | GAG|GTAAGTAGAA...TTTTCCTTTTCT/CCTTTTCTGATT...TTTAG|GAC | 0 | 1 | 98.873 |
| 28505559 | GT-AG | 0 | 3.383136253284222e-05 | 87152 | rna-XM_044272365.1 5530409 | 1 | 448853692 | 448940843 | Bufo gargarizans 30331 | CAG|GTACTGCTTA...ATAGTCTTATCT/CATAGTCTTATC...TACAG|GCT | 0 | 5.634 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);