introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
44 rows where transcript_id = 34991565
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 196985748 | GT-AG | 0 | 1.000000099473604e-05 | 126951 | rna-XM_034167047.1 34991565 | 2 | 22370551 | 22497501 | Thalassophryne amazonica 390379 | AAG|GTGAGCAAAG...TACTTTTTGACC/TACTTTTTGACC...CACAG|CCT | 0 | 1 | 2.813 |
| 196985749 | GT-AG | 0 | 0.0001157154596988 | 2609 | rna-XM_034167047.1 34991565 | 3 | 22367824 | 22370432 | Thalassophryne amazonica 390379 | ACG|GTACTCAGCT...GTGCCAGTGATG/GTGATGCTGACT...TGCAG|ACA | 1 | 1 | 4.901 |
| 196985750 | GT-AG | 0 | 1.000000099473604e-05 | 548 | rna-XM_034167047.1 34991565 | 4 | 22367197 | 22367744 | Thalassophryne amazonica 390379 | CAG|GTAAGTGAGT...TCTTTCTTTTCT/TTTTTTTTCTTT...CTCAG|TTT | 2 | 1 | 6.299 |
| 196985751 | GT-AG | 0 | 0.0006349250327076 | 565 | rna-XM_034167047.1 34991565 | 5 | 22366542 | 22367106 | Thalassophryne amazonica 390379 | AAA|GTATGTACAC...CTTTCCTTCTTT/TTCTGTTTGATT...TGTAG|TCT | 2 | 1 | 7.891 |
| 196985752 | GT-AG | 0 | 2.2012312644159312e-05 | 3613 | rna-XM_034167047.1 34991565 | 6 | 22362872 | 22366484 | Thalassophryne amazonica 390379 | TTC|GTAAGTGTTT...TTCTTTGTGATT/TTGTGATTTACT...TTCAG|GTC | 2 | 1 | 8.9 |
| 196985753 | GT-AG | 0 | 1.000000099473604e-05 | 2382 | rna-XM_034167047.1 34991565 | 7 | 22360426 | 22362807 | Thalassophryne amazonica 390379 | TTG|GTGAGTGGAA...CGTGGCTTAACA/TTCTTTGTAACC...CTTAG|CCG | 0 | 1 | 10.032 |
| 196985754 | GT-AG | 0 | 1.2358076934667678e-05 | 171 | rna-XM_034167047.1 34991565 | 8 | 22360180 | 22360350 | Thalassophryne amazonica 390379 | CCC|GTAAGTGAGA...GCTTCTTTATCG/TGCTTCTTTATC...TGTAG|GAG | 0 | 1 | 11.359 |
| 196985755 | GT-AG | 0 | 1.000000099473604e-05 | 1355 | rna-XM_034167047.1 34991565 | 9 | 22358680 | 22360034 | Thalassophryne amazonica 390379 | TAG|GTCAGAAATA...TGGCTCTTGAGG/AGAGTGTTCATG...TCCAG|GCT | 1 | 1 | 13.924 |
| 196985756 | GT-AG | 0 | 1.000000099473604e-05 | 118 | rna-XM_034167047.1 34991565 | 10 | 22358443 | 22358560 | Thalassophryne amazonica 390379 | CAG|GTAGGAAATG...TATTCATTATTT/CATTATTTCACT...AAAAG|TCA | 0 | 1 | 16.03 |
| 196985757 | GT-AG | 0 | 7.813491500394519e-05 | 18854 | rna-XM_034167047.1 34991565 | 11 | 22339389 | 22358242 | Thalassophryne amazonica 390379 | CAG|GTAATCTGAA...TTTTCCTCATCC/TTTTTCCTCATC...TCTAG|ACT | 2 | 1 | 19.568 |
| 196985758 | GT-AG | 0 | 1.000000099473604e-05 | 247 | rna-XM_034167047.1 34991565 | 12 | 22338975 | 22339221 | Thalassophryne amazonica 390379 | TCG|GTGAGTAGAA...GACATTTTGATT/TTTTGATTGACA...TCTAG|AGA | 1 | 1 | 22.523 |
| 196985759 | GT-AG | 0 | 1.000000099473604e-05 | 203 | rna-XM_034167047.1 34991565 | 13 | 22338644 | 22338846 | Thalassophryne amazonica 390379 | AGA|GTGAGTAAAC...GTGGTCTTCATT/GTGGTCTTCATT...TCCAG|GGC | 0 | 1 | 24.788 |
| 196985760 | GT-AG | 0 | 1.000000099473604e-05 | 4128 | rna-XM_034167047.1 34991565 | 14 | 22334390 | 22338517 | Thalassophryne amazonica 390379 | CAG|GTTCATATCA...TTTTCCATCATT/CATTTACTAATA...TTTAG|GAG | 0 | 1 | 27.017 |
| 196985761 | GT-AG | 0 | 1.000000099473604e-05 | 107 | rna-XM_034167047.1 34991565 | 15 | 22334103 | 22334209 | Thalassophryne amazonica 390379 | GAG|GTGAATATTT...ATTGCTTTAAAA/ATTGCTTTAAAA...GCCAG|CCC | 0 | 1 | 30.202 |
| 196985762 | GT-AG | 0 | 0.0004941715251475 | 2875 | rna-XM_034167047.1 34991565 | 16 | 22330880 | 22333754 | Thalassophryne amazonica 390379 | AAG|GTATCAGAGA...TGCTTCTTGCTG/GGGGGCCTCACT...AACAG|GGG | 0 | 1 | 36.359 |
| 196985763 | GT-AG | 0 | 1.000000099473604e-05 | 6300 | rna-XM_034167047.1 34991565 | 17 | 22324517 | 22330816 | Thalassophryne amazonica 390379 | GAG|GTCTGTACGT...CTTCCATTACTC/TTTATACTGACT...TGCAG|CAC | 0 | 1 | 37.473 |
| 196985764 | GT-AG | 0 | 1.000000099473604e-05 | 88 | rna-XM_034167047.1 34991565 | 18 | 22324339 | 22324426 | Thalassophryne amazonica 390379 | GAG|GTAAATAGTG...TGTTTCTTTTTG/TAAGAGATTACA...TTCAG|GGA | 0 | 1 | 39.066 |
| 196985765 | GT-AG | 0 | 1.000000099473604e-05 | 95 | rna-XM_034167047.1 34991565 | 19 | 22324166 | 22324260 | Thalassophryne amazonica 390379 | GAG|GTAAAATACT...ATCTCCTTAGTC/CTCATTTTCATC...TTCAG|GTG | 0 | 1 | 40.446 |
| 196985766 | GT-AG | 0 | 1.000000099473604e-05 | 4250 | rna-XM_034167047.1 34991565 | 20 | 22319595 | 22323844 | Thalassophryne amazonica 390379 | ACG|GTAAGATGCA...AATTTTTTAAAC/AATTTTTTAAAC...TACAG|GGA | 0 | 1 | 46.125 |
| 196985767 | GT-AG | 0 | 1.000000099473604e-05 | 1466 | rna-XM_034167047.1 34991565 | 21 | 22318048 | 22319513 | Thalassophryne amazonica 390379 | TCC|GTGAGTGCCG...TTGGTTTTATTG/GTTGGTTTTATT...TCCAG|ATC | 0 | 1 | 47.558 |
| 196985768 | GT-AG | 0 | 1.000000099473604e-05 | 460 | rna-XM_034167047.1 34991565 | 22 | 22317345 | 22317804 | Thalassophryne amazonica 390379 | GAG|GTGAGACCTG...CATTTTTTAACC/CATTTTTTAACC...AACAG|GAA | 0 | 1 | 51.858 |
| 196985769 | GT-AG | 0 | 1.000000099473604e-05 | 105 | rna-XM_034167047.1 34991565 | 23 | 22317129 | 22317233 | Thalassophryne amazonica 390379 | AAG|GTACAACCCT...CCTCTCTCAATT/TATCTTCTCAAA...ATCAG|GTG | 0 | 1 | 53.822 |
| 196985770 | GT-AG | 0 | 1.000000099473604e-05 | 141 | rna-XM_034167047.1 34991565 | 24 | 22316811 | 22316951 | Thalassophryne amazonica 390379 | CAG|GTAAAGCAGC...TATACCATAACA/CATTGCCTAATG...TGCAG|GAA | 0 | 1 | 56.953 |
| 196985771 | GT-AG | 0 | 1.000000099473604e-05 | 392 | rna-XM_034167047.1 34991565 | 25 | 22316302 | 22316693 | Thalassophryne amazonica 390379 | AAG|GTGCTGGAAC...GGGCTCTTATCA/TGGGCTCTTATC...AATAG|GTA | 0 | 1 | 59.023 |
| 196985772 | GT-AG | 0 | 1.000000099473604e-05 | 818 | rna-XM_034167047.1 34991565 | 26 | 22315295 | 22316112 | Thalassophryne amazonica 390379 | GAG|GTGAGACACC...CTTTTCTTGTGT/GCATATCTCATG...TCCAG|GCA | 0 | 1 | 62.367 |
| 196985773 | GT-AG | 0 | 1.000000099473604e-05 | 301 | rna-XM_034167047.1 34991565 | 27 | 22314874 | 22315174 | Thalassophryne amazonica 390379 | AAG|GTACGATATG...CAGATCTTAAAT/AATAATCTTACC...TCCAG|GAG | 0 | 1 | 64.49 |
| 196985774 | GT-AG | 0 | 1.000000099473604e-05 | 550 | rna-XM_034167047.1 34991565 | 28 | 22314267 | 22314816 | Thalassophryne amazonica 390379 | GTG|GTAAGTGAAA...GGTGTCTTTCCT/TCTTTCCTCCTG...TGAAG|GAG | 0 | 1 | 65.499 |
| 196985775 | GT-AG | 0 | 1.000000099473604e-05 | 1279 | rna-XM_034167047.1 34991565 | 29 | 22312931 | 22314209 | Thalassophryne amazonica 390379 | GAG|GTGAGCCACT...AGAACCCTAACT/TGGTATCTAATT...TTTAG|AAC | 0 | 1 | 66.507 |
| 196985776 | GT-AG | 0 | 1.6995797993181458e-05 | 5188 | rna-XM_034167047.1 34991565 | 30 | 22307662 | 22312849 | Thalassophryne amazonica 390379 | AGG|GTAAGTCTCC...TTTTTTTTATCT/TTTTTTTTTATC...CGCAG|GCT | 0 | 1 | 67.941 |
| 196985777 | GT-AG | 0 | 3.9669160560946736e-05 | 4630 | rna-XM_034167047.1 34991565 | 31 | 22302960 | 22307589 | Thalassophryne amazonica 390379 | CCG|GTAAGCAGCC...ATTACTTTAATT/AATTTATTTATT...TGCAG|GCA | 0 | 1 | 69.214 |
| 196985778 | GT-AG | 0 | 1.000000099473604e-05 | 255 | rna-XM_034167047.1 34991565 | 32 | 22302651 | 22302905 | Thalassophryne amazonica 390379 | TCA|GTGAGTGTGT...TTTTTTTTTCCT/TAGATATTTATG...TCCAG|ACA | 0 | 1 | 70.17 |
| 196985779 | GT-AG | 0 | 1.000000099473604e-05 | 575 | rna-XM_034167047.1 34991565 | 33 | 22301953 | 22302527 | Thalassophryne amazonica 390379 | TCT|GTAAGACCCC...TCTTCTTTCTTC/CCCCGCGTAACC...CACAG|CCT | 0 | 1 | 72.346 |
| 196985780 | GT-AG | 0 | 1.000000099473604e-05 | 385 | rna-XM_034167047.1 34991565 | 34 | 22301457 | 22301841 | Thalassophryne amazonica 390379 | CAG|GTAGGACGAG...TCTCCTTTATTT/CTTTATTTCACT...AACAG|GAT | 0 | 1 | 74.31 |
| 196985781 | GT-AG | 0 | 1.000000099473604e-05 | 4350 | rna-XM_034167047.1 34991565 | 35 | 22297065 | 22301414 | Thalassophryne amazonica 390379 | GAG|GTGAGTCACC...TGGTCTTTATTC/TTTATTCTGACG...TTCAG|CCT | 0 | 1 | 75.053 |
| 196985782 | GT-AG | 0 | 1.509737046786561 | 335 | rna-XM_034167047.1 34991565 | 36 | 22296559 | 22296893 | Thalassophryne amazonica 390379 | CAG|GTACCCACTG...CATTTCTTGACT/CATTTCTTGACT...CTCAG|AGT | 0 | 1 | 78.079 |
| 196985783 | GT-AG | 0 | 1.000000099473604e-05 | 623 | rna-XM_034167047.1 34991565 | 37 | 22295776 | 22296398 | Thalassophryne amazonica 390379 | AAG|GTTAGTTCCC...TCTGCCTTGTTT/CCTTGTTTAATT...CCCAG|CAA | 1 | 1 | 80.909 |
| 196985784 | GT-AG | 0 | 1.000000099473604e-05 | 98 | rna-XM_034167047.1 34991565 | 38 | 22295458 | 22295555 | Thalassophryne amazonica 390379 | GGG|GTAGGAATGC...TTGTTTTTTGTG/CAGTTACTCACT...CACAG|GCC | 2 | 1 | 84.802 |
| 196985785 | GT-AG | 0 | 1.000000099473604e-05 | 87 | rna-XM_034167047.1 34991565 | 39 | 22295281 | 22295367 | Thalassophryne amazonica 390379 | AGG|GTAATAACGG...CTGTGTTTAACT/CTGTGTTTAACT...GTCAG|TCT | 2 | 1 | 86.394 |
| 196985786 | GT-AG | 0 | 1.000000099473604e-05 | 250 | rna-XM_034167047.1 34991565 | 40 | 22294848 | 22295097 | Thalassophryne amazonica 390379 | ACG|GTAAATAGAA...TTCTCCTCACCT/CCTCACCTCATT...CACAG|GCC | 2 | 1 | 89.632 |
| 196985787 | GT-AG | 0 | 0.0006469236269327 | 2744 | rna-XM_034167047.1 34991565 | 41 | 22291963 | 22294706 | Thalassophryne amazonica 390379 | CAG|GTCTGCTTCT...TCTGTCTTCGTT/TCTTCGTTTATC...TGTAG|AAT | 2 | 1 | 92.127 |
| 196985788 | GT-AG | 0 | 1.000000099473604e-05 | 1861 | rna-XM_034167047.1 34991565 | 42 | 22289962 | 22291822 | Thalassophryne amazonica 390379 | AAG|GTGAGAAACA...CATTTCATATTT/TTTTTTGTCATT...GACAG|AGC | 1 | 1 | 94.604 |
| 196985789 | GT-AG | 0 | 1.000000099473604e-05 | 413 | rna-XM_034167047.1 34991565 | 43 | 22289511 | 22289923 | Thalassophryne amazonica 390379 | CTG|GTAAGAGCTA...TCTGCCTCATTT/TTCTGCCTCATT...TCTAG|TCC | 0 | 1 | 95.276 |
| 196985790 | GT-AG | 0 | 1.000000099473604e-05 | 2436 | rna-XM_034167047.1 34991565 | 44 | 22287000 | 22289435 | Thalassophryne amazonica 390379 | CAG|GTACTGCTGC...GTGCCCCTGACG/CTGACGCTGAAT...TCCAG|TCT | 0 | 1 | 96.603 |
| 196985791 | GT-AG | 0 | 1.000000099473604e-05 | 3752 | rna-XM_034167047.1 34991565 | 45 | 22283193 | 22286944 | Thalassophryne amazonica 390379 | AAG|GTGAGTGCTG...TGCGCTGTGACT/CTGTGACTGACG...TCCAG|GGC | 1 | 1 | 97.576 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);