introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
25 rows where transcript_id = 24436960
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 133512800 | GT-AG | 0 | 1.000000099473604e-05 | 30499 | rna-XM_029783764.2 24436960 | 1 | 155258373 | 155288871 | Octopus sinensis 2607531 | CTG|GTGAGTAGTA...TTTGTCTTAATT/TTTGTCTTAATT...TGTAG|GTG | 2 | 1 | 1.845 |
| 133512801 | GT-AG | 0 | 0.0001542345485482 | 17629 | rna-XM_029783764.2 24436960 | 2 | 155240633 | 155258261 | Octopus sinensis 2607531 | AAG|GTACGTATTG...TTTGTTTTATTT/TATTTATTCATT...AACAG|TGA | 2 | 1 | 4.612 |
| 133512802 | GT-AG | 0 | 3.933793937079712e-05 | 4625 | rna-XM_029783764.2 24436960 | 3 | 155235890 | 155240514 | Octopus sinensis 2607531 | GAT|GTAAGTATAT...CATTTTTTATCT/TCATTTTTTATC...TTCAG|GAA | 0 | 1 | 7.554 |
| 133512803 | GT-AG | 0 | 1.000000099473604e-05 | 1202 | rna-XM_029783764.2 24436960 | 4 | 155234472 | 155235673 | Octopus sinensis 2607531 | AAG|GTAATTTCAA...ATTCACTTATAT/TCAATATTCACT...TTCAG|ATG | 0 | 1 | 12.939 |
| 133512804 | GT-AG | 0 | 4.082354934211815e-05 | 6301 | rna-XM_029783764.2 24436960 | 5 | 155228081 | 155234381 | Octopus sinensis 2607531 | CAA|GTAAGTCTTT...CTCTTTTTATTT/TATTTGTTAATT...TTCAG|GTT | 0 | 1 | 15.183 |
| 133512805 | GT-AG | 0 | 1.000000099473604e-05 | 3774 | rna-XM_029783764.2 24436960 | 6 | 155224143 | 155227916 | Octopus sinensis 2607531 | CAG|GTAATTCTCA...TCATTGTTGATT/TCATTGTTGATT...TGCAG|GTG | 2 | 1 | 19.272 |
| 133512806 | GT-AG | 0 | 2.5127876576412624e-05 | 17901 | rna-XM_029783764.2 24436960 | 7 | 155206116 | 155224016 | Octopus sinensis 2607531 | AAG|GTTTGTATAC...GGTACTTTATTT/TTTTGTTTTACT...TTCAG|ACT | 2 | 1 | 22.413 |
| 133512807 | GT-AG | 0 | 1.000000099473604e-05 | 32150 | rna-XM_029783764.2 24436960 | 8 | 155173719 | 155205868 | Octopus sinensis 2607531 | CAG|GTAAAATGGA...GCAGTCTTAATT/TTAATTTTCATT...TGCAG|GAT | 0 | 1 | 28.571 |
| 133512808 | GT-AG | 0 | 1.000000099473604e-05 | 2383 | rna-XM_029783764.2 24436960 | 9 | 155171225 | 155173607 | Octopus sinensis 2607531 | AAA|GTAAGAATCT...TACATTTTGATT/TACATTTTGATT...CTTAG|GAT | 0 | 1 | 31.339 |
| 133512809 | GT-AG | 0 | 1.000000099473604e-05 | 1777 | rna-XM_029783764.2 24436960 | 10 | 155169352 | 155171128 | Octopus sinensis 2607531 | AAG|GTAATATTGT...TGTATTTTACCA/TGTTTATTTACT...TCCAG|TCT | 0 | 1 | 33.732 |
| 133512810 | GT-AG | 0 | 1.000000099473604e-05 | 3744 | rna-XM_029783764.2 24436960 | 11 | 155165416 | 155169159 | Octopus sinensis 2607531 | AAG|GTAAATATAT...TGCTTCATATCA/CATTGCTTCATA...TTAAG|GAT | 0 | 1 | 38.519 |
| 133512811 | GT-AG | 0 | 0.0001564046659028 | 1902 | rna-XM_029783764.2 24436960 | 12 | 155163332 | 155165233 | Octopus sinensis 2607531 | TAA|GTAAGTTTTC...TCTTTCTTTCCT/GTTCATCTAATC...TATAG|TTG | 2 | 1 | 43.057 |
| 133512812 | GT-AG | 0 | 0.0144440005663587 | 2118 | rna-XM_029783764.2 24436960 | 13 | 155161117 | 155163234 | Octopus sinensis 2607531 | CAG|GTATACTTAT...ATAATTTTAAGA/AAGATTTTCATG...TTCAG|GGA | 0 | 1 | 45.475 |
| 133512813 | GT-AG | 0 | 1.000000099473604e-05 | 4190 | rna-XM_029783764.2 24436960 | 14 | 155156714 | 155160903 | Octopus sinensis 2607531 | ATT|GTAAGAATTT...TATATTTTACCT/TTATATTTTACC...ATTAG|GAA | 0 | 1 | 50.785 |
| 133512814 | GT-AG | 0 | 1.000000099473604e-05 | 3086 | rna-XM_029783764.2 24436960 | 15 | 155153452 | 155156537 | Octopus sinensis 2607531 | GTG|GTAAGATATT...TCTATATTAACT/TCTATATTAACT...TCCAG|GTC | 2 | 1 | 55.173 |
| 133512815 | GT-AG | 0 | 1.000000099473604e-05 | 1905 | rna-XM_029783764.2 24436960 | 16 | 155151272 | 155153176 | Octopus sinensis 2607531 | TTG|GTAAGATACT...TTTTCCTTTTTC/CAAAAACTAAAC...TCTAG|GTC | 1 | 1 | 62.029 |
| 133512816 | GT-AG | 0 | 0.0001605186900505 | 1161 | rna-XM_029783764.2 24436960 | 17 | 155150031 | 155151191 | Octopus sinensis 2607531 | CAG|GTATTTGCTC...GTTTTTTTCTCT/TATTGGTTAATC...TACAG|TGT | 0 | 1 | 64.024 |
| 133512817 | GT-AG | 0 | 1.000000099473604e-05 | 5616 | rna-XM_029783764.2 24436960 | 18 | 155144264 | 155149879 | Octopus sinensis 2607531 | TAG|GTAAGGTTAA...ATATCTTTTACA/ATATCTTTTACA...TGCAG|CTC | 1 | 1 | 67.789 |
| 133512818 | GT-AG | 0 | 2.499997093727613e-05 | 2639 | rna-XM_029783764.2 24436960 | 19 | 155141394 | 155144032 | Octopus sinensis 2607531 | ACA|GTAAGTATTA...GTATTGTTAAAA/TAATATTTCATT...TCCAG|GAT | 1 | 1 | 73.548 |
| 133512819 | GT-AG | 0 | 7.816866771292704e-05 | 1630 | rna-XM_029783764.2 24436960 | 20 | 155139663 | 155141292 | Octopus sinensis 2607531 | AAC|GTAAGTATAT...ATTTTCTTATTT/TATTTTCTTATT...TGTAG|ATT | 0 | 1 | 76.066 |
| 133512820 | GT-AG | 0 | 1.000000099473604e-05 | 532 | rna-XM_029783764.2 24436960 | 21 | 155138933 | 155139464 | Octopus sinensis 2607531 | AAG|GTAAGAAACT...AATGTTTTATTG/CAATGTTTTATT...CACAG|ATT | 0 | 1 | 81.002 |
| 133512821 | GT-AG | 0 | 1.000000099473604e-05 | 1780 | rna-XM_029783764.2 24436960 | 22 | 155136997 | 155138776 | Octopus sinensis 2607531 | CAG|GTAATGAGAA...GCATTTTTATAA/TATAAATTAACT...TATAG|GAT | 0 | 1 | 84.892 |
| 133512822 | GT-AG | 0 | 0.0003277572761504 | 364 | rna-XM_029783764.2 24436960 | 23 | 155136543 | 155136906 | Octopus sinensis 2607531 | GAT|GTAAGTTTTA...TCTTCTTTCATA/AATTTTCTCAAA...TATAG|GTT | 0 | 1 | 87.135 |
| 133512823 | GT-AG | 0 | 1.000000099473604e-05 | 3168 | rna-XM_029783764.2 24436960 | 24 | 155133202 | 155136369 | Octopus sinensis 2607531 | AAG|GTTAGTATTT...TTTTCTTTCTTT/TGTTGACTAATA...AATAG|AAC | 2 | 1 | 91.449 |
| 133512824 | GT-AG | 0 | 1.000000099473604e-05 | 1721 | rna-XM_029783764.2 24436960 | 25 | 155131375 | 155133095 | Octopus sinensis 2607531 | ATG|GTAATAATTA...GATTTCTAATCT/AGATTTCTAATC...TTCAG|GTC | 0 | 1 | 94.091 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);