introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
35 rows where transcript_id = 34991564
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 196985713 | GT-AG | 0 | 0.0001834158827499 | 81 | rna-XM_034178363.1 34991564 | 1 | 155442270 | 155442350 | Thalassophryne amazonica 390379 | AAA|GTCACGTTCT...ATGATCTTGTCC/TGAGAACTGAAT...TGCAG|TCT | 0 | 1 | 5.424 |
| 196985714 | GT-AG | 0 | 1.000000099473604e-05 | 258 | rna-XM_034178363.1 34991564 | 2 | 155442449 | 155442706 | Thalassophryne amazonica 390379 | CAG|GTGAGACACC...TTTGTCTTTGTC/TCTGTACCTATT...TTTAG|GGA | 2 | 1 | 7.144 |
| 196985715 | GT-AG | 0 | 3.122771833127475 | 15771 | rna-XM_034178363.1 34991564 | 3 | 155443680 | 155459450 | Thalassophryne amazonica 390379 | CTT|GTACCTCCCA...TTATCTTTGATT/ATTTTGTTCATT...TGAAG|TTG | 0 | 1 | 24.223 |
| 196985716 | GT-AG | 0 | 1.000000099473604e-05 | 104 | rna-XM_034178363.1 34991564 | 4 | 155459609 | 155459712 | Thalassophryne amazonica 390379 | AGC|GTAAGTAGCC...TATTGCTTATTC/CTTATTCTCACT...ACCAG|GTT | 2 | 1 | 26.997 |
| 196985717 | GT-AG | 0 | 1.000000099473604e-05 | 1179 | rna-XM_034178363.1 34991564 | 5 | 155459829 | 155461007 | Thalassophryne amazonica 390379 | CAG|GTAAACCAAG...GTTACTTTCATG/GTTACTTTCATG...TGGAG|GGA | 1 | 1 | 29.033 |
| 196985718 | GT-AG | 0 | 1.000000099473604e-05 | 189 | rna-XM_034178363.1 34991564 | 6 | 155461147 | 155461335 | Thalassophryne amazonica 390379 | TTT|GTAAGAACTA...ATGGTTTTGATT/ATGGTTTTGATT...TCCAG|GGT | 2 | 1 | 31.473 |
| 196985719 | GT-AG | 0 | 0.0002001573612418 | 83 | rna-XM_034178363.1 34991564 | 7 | 155461421 | 155461503 | Thalassophryne amazonica 390379 | CAG|GTAACTCCCT...TGTCTCTCAGCT/TTGTCTCTCAGC...GACAG|GGA | 0 | 1 | 32.965 |
| 196985720 | AT-AG | 0 | 1.000000099473604e-05 | 3417 | rna-XM_034178363.1 34991564 | 8 | 155461665 | 155465081 | Thalassophryne amazonica 390379 | ATC|ATGAGGAGGA...ACAGCAGTAACA/CTAATAGTGATG...TGAAG|GTC | 2 | 1 | 35.791 |
| 196985721 | GT-AG | 0 | 1.702710748106691e-05 | 680 | rna-XM_034178363.1 34991564 | 9 | 155465201 | 155465880 | Thalassophryne amazonica 390379 | GAG|GTATTAACAA...CCTTCTTTGTTT/TAAGATCTGATT...CTCAG|GTA | 1 | 1 | 37.88 |
| 196985722 | GT-AG | 0 | 0.0001201662926525 | 146 | rna-XM_034178363.1 34991564 | 10 | 155466111 | 155466256 | Thalassophryne amazonica 390379 | CAG|GTAACTATGG...ATTTCCTCTGTG/TCTGTGTACATG...ACCAG|GAC | 0 | 1 | 41.917 |
| 196985723 | GT-AG | 0 | 1.000000099473604e-05 | 392 | rna-XM_034178363.1 34991564 | 11 | 155466465 | 155466856 | Thalassophryne amazonica 390379 | CAG|GTGAACATTA...TCTGCCTGGGTG/TGGGAACGGACT...CCCAG|GAC | 1 | 1 | 45.568 |
| 196985724 | GT-AG | 0 | 1.000000099473604e-05 | 736 | rna-XM_034178363.1 34991564 | 12 | 155466915 | 155467650 | Thalassophryne amazonica 390379 | ACT|GTCAGACTTG...CTTCTCTTATTG/TTTTATTTGACT...TTTAG|ACA | 2 | 1 | 46.586 |
| 196985725 | GT-AG | 0 | 1.000000099473604e-05 | 146 | rna-XM_034178363.1 34991564 | 14 | 155467843 | 155467988 | Thalassophryne amazonica 390379 | CAG|GTCAACAGTC...TTTTTTGTATTT/CTGAAATTGAAT...CAGAG|GTT | 0 | 1 | 49.921 |
| 196985726 | GT-AC | 0 | 1.000000099473604e-05 | 553 | rna-XM_034178363.1 34991564 | 15 | 155468077 | 155468629 | Thalassophryne amazonica 390379 | TAA|GTTGGTGTTC...AGTAACTTGTCA/TATAAAGTAACT...ACAAC|ACT | 1 | 1 | 51.466 |
| 196985727 | GT-AG | 0 | 1.000000099473604e-05 | 112 | rna-XM_034178363.1 34991564 | 16 | 155468847 | 155468958 | Thalassophryne amazonica 390379 | CAG|GTTGGTCAAT...ATGTCTATAATT/ATGTCTATAATT...CCCAG|ACC | 2 | 1 | 55.275 |
| 196985728 | GT-AG | 0 | 1.000000099473604e-05 | 631 | rna-XM_034178363.1 34991564 | 17 | 155469204 | 155469834 | Thalassophryne amazonica 390379 | CAG|GTGTTACATG...TGACTCATGACA/TTTTGACTCATG...CTCAG|GTA | 1 | 1 | 59.575 |
| 196985729 | GT-AG | 0 | 0.0007474645217836 | 2163 | rna-XM_034178363.1 34991564 | 18 | 155469966 | 155472128 | Thalassophryne amazonica 390379 | AAG|GTCTGTTTCT...ATGTTTTTAACA/ATGTTTTTAACA...GACAG|GTT | 0 | 1 | 61.875 |
| 196985730 | GT-AG | 0 | 1.000000099473604e-05 | 68 | rna-XM_034178363.1 34991564 | 19 | 155472252 | 155472319 | Thalassophryne amazonica 390379 | CAG|GTACTTGTAT...TAACTAGTGACT/GATCATTTCACA...GGAAG|GTT | 0 | 1 | 64.034 |
| 196985731 | GC-AG | 0 | 1.000000099473604e-05 | 101 | rna-XM_034178363.1 34991564 | 20 | 155472452 | 155472552 | Thalassophryne amazonica 390379 | GAG|GCGAGAGAGA...GTTTGTTTAGAG/TATGTGTTTAAT...GGCAG|ATT | 0 | 1 | 66.351 |
| 196985732 | GT-AG | 0 | 1.000000099473604e-05 | 901 | rna-XM_034178363.1 34991564 | 21 | 155472631 | 155473531 | Thalassophryne amazonica 390379 | GCT|GTGAGTTTCT...CTGTTCTTGTCA/TTTAATCTCACC...TATAG|TCC | 0 | 1 | 67.72 |
| 196985733 | GT-AG | 0 | 3.450067338618992e-05 | 256 | rna-XM_034178363.1 34991564 | 22 | 155473680 | 155473935 | Thalassophryne amazonica 390379 | CAG|GTATGTACAC...TATGCTTTTGTT/TTGTTGCTGACT...TTAAG|AAA | 1 | 1 | 70.318 |
| 196985734 | GT-AC | 0 | 1.000000099473604e-05 | 2717 | rna-XM_034178363.1 34991564 | 23 | 155473996 | 155476712 | Thalassophryne amazonica 390379 | CAG|GTGAGAGTTT...CGCTCCTCAACC/CCGCTCCTCAAC...AACAC|CTT | 1 | 1 | 71.371 |
| 196985735 | GT-AG | 0 | 0.0005911179806396 | 358 | rna-XM_034178363.1 34991564 | 24 | 155476745 | 155477102 | Thalassophryne amazonica 390379 | GAG|GTATCAGAGA...GTATTTTTCTCC/TTTTTGTTCAGT...TCCAG|ATG | 0 | 1 | 71.933 |
| 196985736 | GT-AG | 0 | 1.000000099473604e-05 | 1732 | rna-XM_034178363.1 34991564 | 25 | 155477278 | 155479009 | Thalassophryne amazonica 390379 | CAG|GTGAGAGGTG...TTTTTTTTTTTT/TCTGCTTTCAAT...CTTAG|TGT | 1 | 1 | 75.004 |
| 196985737 | GT-AG | 0 | 0.001250878655938 | 1669 | rna-XM_034178363.1 34991564 | 26 | 155479236 | 155480904 | Thalassophryne amazonica 390379 | TGG|GTTTGTTTGC...CACCTTTTAACT/CACCTTTTAACT...TGCAG|TGG | 2 | 1 | 78.971 |
| 196985738 | GT-AG | 0 | 3.780839076875836e-05 | 2169 | rna-XM_034178363.1 34991564 | 27 | 155481002 | 155483170 | Thalassophryne amazonica 390379 | CAG|GTACAGTATT...GCCCTTTTATCT/TGCCCTTTTATC...TTTAG|AAA | 0 | 1 | 80.674 |
| 196985739 | GT-AG | 0 | 0.0151516385528506 | 260 | rna-XM_034178363.1 34991564 | 28 | 155483299 | 155483558 | Thalassophryne amazonica 390379 | CAG|GTAACCTGTT...TTTTTTCTAATT/TTTTTTCTAATT...TCCAG|GTT | 2 | 1 | 82.921 |
| 196985740 | GT-AG | 0 | 2.3092098925300963e-05 | 3246 | rna-XM_034178363.1 34991564 | 29 | 155483698 | 155486943 | Thalassophryne amazonica 390379 | CTG|GTACTCAAAT...AGATTCAAATCA/ATCAGATTCAAA...CATAG|CTT | 0 | 1 | 85.361 |
| 196985741 | GT-AG | 0 | 1.000000099473604e-05 | 98 | rna-XM_034178363.1 34991564 | 30 | 155487003 | 155487100 | Thalassophryne amazonica 390379 | TTG|GTAATGTGTG...CCGATTTGAACC/ACCGATTTGAAC...TCTAG|GTC | 2 | 1 | 86.396 |
| 196985742 | GT-AG | 0 | 1.000000099473604e-05 | 106 | rna-XM_034178363.1 34991564 | 31 | 155487239 | 155487344 | Thalassophryne amazonica 390379 | GAG|GTTAGTCCAC...TTCTCCATAATA/ATGTTCTCCATA...TTTAG|CAT | 2 | 1 | 88.819 |
| 196985743 | GT-AG | 0 | 1.000000099473604e-05 | 4582 | rna-XM_034178363.1 34991564 | 32 | 155487427 | 155492008 | Thalassophryne amazonica 390379 | CAG|GTAATATATA...TGGAACTTGACA/TCCATTGTCACT...AGCAG|CCA | 0 | 1 | 90.258 |
| 196985744 | GT-AG | 0 | 7.340917458214472e-05 | 3723 | rna-XM_034178363.1 34991564 | 33 | 155492123 | 155495845 | Thalassophryne amazonica 390379 | AAG|GTAACATGCA...ATTTTCTAGAAC/TGTGTGTGCACA...GGCAG|GTG | 0 | 1 | 92.259 |
| 196985745 | GT-AG | 0 | 0.0003317825841174 | 372 | rna-XM_034178363.1 34991564 | 34 | 155495959 | 155496330 | Thalassophryne amazonica 390379 | AAA|GTATGTGTAC...TTGCTGTTACTA/CTGTTACTAAAT...CCCAG|TAC | 2 | 1 | 94.243 |
| 196985746 | GT-AG | 0 | 1.000000099473604e-05 | 2007 | rna-XM_034178363.1 34991564 | 35 | 155496482 | 155498488 | Thalassophryne amazonica 390379 | ACA|GTAAGTACAT...AGAGCCGTGATT/TTTGTAATGATT...TTCAG|AAA | 0 | 1 | 96.893 |
| 196985747 | GT-AG | 0 | 1.000000099473604e-05 | 690 | rna-XM_034178363.1 34991564 | 36 | 155498578 | 155499267 | Thalassophryne amazonica 390379 | CAG|GTAAGTGGTC...CAAACCTTGTTC/AATGATGTAACT...CTTAG|GGT | 2 | 1 | 98.455 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);