introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
32 rows where transcript_id = 34991586
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 196986365 | GT-AG | 0 | 1.000000099473604e-05 | 1551 | rna-XM_034172666.1 34991586 | 1 | 91764533 | 91766083 | Thalassophryne amazonica 390379 | AAG|GTAGGATCAA...CTGGCCTCACTT/ACTGGCCTCACT...ATTAG|GCT | 1 | 1 | 0.872 |
| 196986366 | GT-AG | 0 | 0.0007856437306566 | 43549 | rna-XM_034172666.1 34991586 | 2 | 91766295 | 91809843 | Thalassophryne amazonica 390379 | TGG|GTATGTTACA...TTTGTTTGAGTA/ATTTGTTTGAGT...GACAG|GGA | 2 | 1 | 5.472 |
| 196986367 | GT-AG | 0 | 1.000000099473604e-05 | 8349 | rna-XM_034172666.1 34991586 | 3 | 91810020 | 91818368 | Thalassophryne amazonica 390379 | CTG|GTGAGTGAAA...TGATATTTAATA/TTCATGTTCACT...TCCAG|GAT | 1 | 1 | 9.309 |
| 196986368 | GT-AG | 0 | 1.000000099473604e-05 | 25444 | rna-XM_034172666.1 34991586 | 4 | 91818573 | 91844016 | Thalassophryne amazonica 390379 | CAG|GTAGGCCAGC...TCTACTTTCTCT/AGTTCTCTCATA...TCCAG|CTG | 1 | 1 | 13.756 |
| 196986369 | GT-AG | 0 | 1.000000099473604e-05 | 6163 | rna-XM_034172666.1 34991586 | 5 | 91844199 | 91850361 | Thalassophryne amazonica 390379 | AAT|GTGAGTATTT...TGGTCTCTGTCT/AAAATGCACACA...ATCAG|GCT | 0 | 1 | 17.724 |
| 196986370 | GT-AG | 0 | 0.0001886123309246 | 471 | rna-XM_034172666.1 34991586 | 6 | 91850471 | 91850941 | Thalassophryne amazonica 390379 | TTG|GTACACAGAC...TCAATCTCAACA/ATCAATCTCAAC...TTTAG|GCA | 1 | 1 | 20.1 |
| 196986371 | GT-AG | 0 | 1.000000099473604e-05 | 2365 | rna-XM_034172666.1 34991586 | 7 | 91851069 | 91853433 | Thalassophryne amazonica 390379 | CAG|GTAACGCCTC...TCTCTCTTTCTG/TATTACGTAAGT...TCTAG|TCG | 2 | 1 | 22.869 |
| 196986372 | GT-AG | 0 | 4.019930856575645e-05 | 153 | rna-XM_034172666.1 34991586 | 8 | 91853537 | 91853689 | Thalassophryne amazonica 390379 | GAA|GTGTGTGTGT...TCATCTTTATTT/TTAATTCTCATC...TTTAG|GTG | 0 | 1 | 25.114 |
| 196986373 | GT-AG | 0 | 1.000000099473604e-05 | 2930 | rna-XM_034172666.1 34991586 | 9 | 91853788 | 91856717 | Thalassophryne amazonica 390379 | CAG|GTCAGTCTCT...GTTCTGTTGCCC/TGAGTGCTCATA...TGCAG|GTA | 2 | 1 | 27.251 |
| 196986374 | GT-AG | 0 | 1.4842779343779064e-05 | 8827 | rna-XM_034172666.1 34991586 | 10 | 91856858 | 91865684 | Thalassophryne amazonica 390379 | CAG|GTTTGTGTTA...TAGACCTTATAC/CTTATACTAAGA...CTCAG|GAT | 1 | 1 | 30.303 |
| 196986375 | GT-AG | 0 | 0.0012354950044938 | 2953 | rna-XM_034172666.1 34991586 | 11 | 91865873 | 91868825 | Thalassophryne amazonica 390379 | GAG|GTATTGTGCA...CTTATCTTAACT/CTTATCTTAACT...TGTAG|GCG | 0 | 1 | 34.402 |
| 196986376 | GT-AG | 0 | 1.000000099473604e-05 | 25135 | rna-XM_034172666.1 34991586 | 12 | 91868950 | 91894084 | Thalassophryne amazonica 390379 | AAG|GTACTGATAC...TGTTTCTTGTTT/CAGTAGCTCATG...ATCAG|TTA | 1 | 1 | 37.105 |
| 196986377 | GT-AG | 0 | 1.000000099473604e-05 | 2239 | rna-XM_034172666.1 34991586 | 13 | 91894237 | 91896475 | Thalassophryne amazonica 390379 | AAG|GTACATAATC...ATAATTTGAATT/TTTGAATTAATT...TTCAG|AAT | 0 | 1 | 40.419 |
| 196986378 | GT-AG | 0 | 1.000000099473604e-05 | 1436 | rna-XM_034172666.1 34991586 | 14 | 91896578 | 91898013 | Thalassophryne amazonica 390379 | CAG|GTAAAAGAAA...TTACCTTCAGTG/CTGGGATTTACC...CGCAG|ATA | 0 | 1 | 42.642 |
| 196986379 | GT-AG | 0 | 5.461728968960986e-05 | 10374 | rna-XM_034172666.1 34991586 | 15 | 91898170 | 91908543 | Thalassophryne amazonica 390379 | CAG|GTTTGTTCAC...TCCTCTTTGAAT/TCTATACTTAAA...GGTAG|GTG | 0 | 1 | 46.043 |
| 196986380 | GT-AG | 0 | 1.000000099473604e-05 | 128 | rna-XM_034172666.1 34991586 | 16 | 91908695 | 91908822 | Thalassophryne amazonica 390379 | CAG|GTGCTCATCA...AAGATCTTACTT/TTGCTTTTCATT...CACAG|GGC | 1 | 1 | 49.335 |
| 196986381 | GT-AG | 0 | 1.000000099473604e-05 | 7212 | rna-XM_034172666.1 34991586 | 17 | 91908948 | 91916159 | Thalassophryne amazonica 390379 | CAG|GTATGGACCG...AAAAACTTGACC/AGATATTTCAAA...TGCAG|GTA | 0 | 1 | 52.06 |
| 196986382 | GT-AG | 0 | 1.000000099473604e-05 | 11180 | rna-XM_034172666.1 34991586 | 18 | 91916273 | 91927452 | Thalassophryne amazonica 390379 | CAG|GTGGGCAAAT...ATTGTCTGAATT/TGAATTTTCATG...TGTAG|GGT | 2 | 1 | 54.524 |
| 196986383 | GT-AG | 0 | 0.0042433855211981 | 1060 | rna-XM_034172666.1 34991586 | 19 | 91927594 | 91928653 | Thalassophryne amazonica 390379 | GAC|GTATGTATGC...AATGTCTTCCTG/GACACATTTATG...TTCAG|TAT | 2 | 1 | 57.598 |
| 196986384 | GT-AG | 0 | 9.477122109131632e-05 | 6663 | rna-XM_034172666.1 34991586 | 20 | 91928785 | 91935447 | Thalassophryne amazonica 390379 | AGG|GTACACACAC...GACTTCATGAAA/AAAGACTTCATG...CTTAG|GGT | 1 | 1 | 60.453 |
| 196986385 | GT-AG | 0 | 1.000000099473604e-05 | 5830 | rna-XM_034172666.1 34991586 | 21 | 91935585 | 91941414 | Thalassophryne amazonica 390379 | GAG|GTAAGTACTG...TGAGTGTTATTA/ATTATGCTGATG...TCCAG|GTG | 0 | 1 | 63.44 |
| 196986386 | GT-AG | 0 | 0.0001037721767792 | 175 | rna-XM_034172666.1 34991586 | 22 | 91941549 | 91941723 | Thalassophryne amazonica 390379 | CAG|GTTTATCTGT...CATGTTTTGAAG/CATGTTTTGAAG...CTTAG|CTT | 2 | 1 | 66.361 |
| 196986387 | GT-AG | 0 | 1.2950864759198492e-05 | 20440 | rna-XM_034172666.1 34991586 | 23 | 91941900 | 91962339 | Thalassophryne amazonica 390379 | CAG|GTCTGCACAC...AATGATTTGACT/AATGATTTGACT...TTCAG|GTC | 1 | 1 | 70.198 |
| 196986388 | GT-AG | 0 | 1.000000099473604e-05 | 3276 | rna-XM_034172666.1 34991586 | 24 | 91962443 | 91965718 | Thalassophryne amazonica 390379 | TCG|GTGAGCTCGC...TAGTTCTTGAAA/ATGTATTTTATT...CACAG|GAA | 2 | 1 | 72.444 |
| 196986389 | GT-AG | 0 | 1.000000099473604e-05 | 30565 | rna-XM_034172666.1 34991586 | 25 | 91965945 | 91996509 | Thalassophryne amazonica 390379 | TGG|GTAAGACCAA...TGAATTTCAATT/CTGAATTTCAAT...TCCAG|GAT | 0 | 1 | 77.371 |
| 196986390 | GT-AG | 0 | 0.004931310402439 | 10535 | rna-XM_034172666.1 34991586 | 26 | 91996622 | 92007156 | Thalassophryne amazonica 390379 | AAG|GTATTTTCAT...AATTTTTTGAAG/GTGTGTTTTATT...CTCAG|GGT | 1 | 1 | 79.813 |
| 196986391 | GT-AG | 0 | 0.0007684739455628 | 19099 | rna-XM_034172666.1 34991586 | 27 | 92007293 | 92026391 | Thalassophryne amazonica 390379 | AAG|GTACATTTTC...TTAGTCTAATTT/TTTAGTCTAATT...TCCAG|GGA | 2 | 1 | 82.777 |
| 196986392 | GT-AG | 0 | 1.7427740334995326e-05 | 3038 | rna-XM_034172666.1 34991586 | 28 | 92026505 | 92029542 | Thalassophryne amazonica 390379 | AAG|GTACAGTCAG...AATTTTTTAAAT/AATTTTTTAAAT...TTCAG|GGA | 1 | 1 | 85.241 |
| 196986393 | GT-AG | 0 | 1.4648930871512293e-05 | 2995 | rna-XM_034172666.1 34991586 | 29 | 92029755 | 92032749 | Thalassophryne amazonica 390379 | GTG|GTACGTATCT...ACAATATTGAAA/ACAATATTGAAA...TGTAG|GCT | 0 | 1 | 89.863 |
| 196986394 | GT-AG | 0 | 0.0028849397855875 | 6681 | rna-XM_034172666.1 34991586 | 30 | 92032848 | 92039528 | Thalassophryne amazonica 390379 | TGA|GTAAGTTTTT...ATTGTTTTAATA/ATTGTTTTAATA...TGCAG|CGA | 2 | 1 | 91.999 |
| 196986395 | GT-AG | 0 | 1.000000099473604e-05 | 86 | rna-XM_034172666.1 34991586 | 31 | 92039617 | 92039702 | Thalassophryne amazonica 390379 | CCA|GTTAGCAAAC...TCTGCTTTGTTT/CATGTGGTAATT...TTTAG|GAA | 0 | 1 | 93.918 |
| 196986396 | GT-AG | 0 | 1.000000099473604e-05 | 3554 | rna-XM_034172666.1 34991586 | 32 | 92039837 | 92043390 | Thalassophryne amazonica 390379 | GAG|GTAAGAAGTG...TTATTCTCAATG/ATTATTCTCAAT...TTCAG|GTC | 2 | 1 | 96.839 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);