introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
32 rows where transcript_id = 34991568
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 196985846 | GT-AG | 0 | 1.000000099473604e-05 | 6384 | rna-XM_034170522.1 34991568 | 1 | 58251896 | 58258279 | Thalassophryne amazonica 390379 | ACT|GTGAGTAAAT...TTCATTTTAAAC/CTTGTTTTCATA...TGCAG|GTT | 1 | 1 | 3.26 |
| 196985847 | GT-AG | 0 | 1.000000099473604e-05 | 6730 | rna-XM_034170522.1 34991568 | 2 | 58258412 | 58265141 | Thalassophryne amazonica 390379 | CAG|GTTGGAGCAT...GTTTTCTAAATG/TGTTTTCTAAAT...CTCAG|CTC | 1 | 1 | 5.678 |
| 196985848 | GT-AG | 0 | 1.000000099473604e-05 | 96 | rna-XM_034170522.1 34991568 | 3 | 58265231 | 58265326 | Thalassophryne amazonica 390379 | CAG|GTTGGTCAGA...TGATTTTCAGTG/GTGATTTTCAGT...AACAG|ATG | 0 | 1 | 7.308 |
| 196985849 | GT-AG | 0 | 1.818236234658767e-05 | 7378 | rna-XM_034170522.1 34991568 | 4 | 58265486 | 58272863 | Thalassophryne amazonica 390379 | CAG|GTCTGTAACA...ACCTCCTTAGTA/TTCTGTATCATC...CCTAG|GTG | 0 | 1 | 10.22 |
| 196985850 | GT-AG | 0 | 1.000000099473604e-05 | 14257 | rna-XM_034170522.1 34991568 | 5 | 58272961 | 58287217 | Thalassophryne amazonica 390379 | CAA|GTACTGCTCT...AACACAGTAACC/TGAAAACTGATT...TTTAG|AGA | 1 | 1 | 11.996 |
| 196985851 | GT-AG | 0 | 1.000000099473604e-05 | 11919 | rna-XM_034170522.1 34991568 | 6 | 58287350 | 58299268 | Thalassophryne amazonica 390379 | GCC|GTAAGAAACC...GCTATCTTATCA/TGCTATCTTATC...TTCAG|CCT | 1 | 1 | 14.414 |
| 196985852 | GT-AG | 0 | 1.000000099473604e-05 | 9619 | rna-XM_034170522.1 34991568 | 7 | 58299422 | 58309040 | Thalassophryne amazonica 390379 | AAG|GTAGCAAAAC...TGAATATTAATT/TGAATATTAATT...TGTAG|GTC | 1 | 1 | 17.216 |
| 196985853 | GT-AG | 0 | 1.000000099473604e-05 | 2362 | rna-XM_034170522.1 34991568 | 8 | 58309147 | 58311508 | Thalassophryne amazonica 390379 | CTG|GTGAGAGTGA...TGGATTTCAACT/CTGGATTTCAAC...CTTAG|GCA | 2 | 1 | 19.158 |
| 196985854 | GT-AG | 0 | 1.000000099473604e-05 | 113 | rna-XM_034170522.1 34991568 | 9 | 58311877 | 58311989 | Thalassophryne amazonica 390379 | GCT|GTGAGTCTCT...TTCTCCTGTACT/AAGATGCTCATG...AATAG|CTC | 1 | 1 | 25.897 |
| 196985855 | GT-AG | 0 | 0.0008028980566053 | 28765 | rna-XM_034170522.1 34991568 | 10 | 58312436 | 58341200 | Thalassophryne amazonica 390379 | CAG|GTATGTTACA...TGAACTGTAACT/TTGCACATCACT...TCCAG|ATT | 0 | 1 | 34.066 |
| 196985856 | GT-AG | 0 | 1.000000099473604e-05 | 5952 | rna-XM_034170522.1 34991568 | 11 | 58341289 | 58347240 | Thalassophryne amazonica 390379 | AAG|GTAGATAAAT...TTTTTCTTGTTT/CGAATTCTAAAT...TGTAG|GCA | 1 | 1 | 35.678 |
| 196985857 | GT-AG | 0 | 1.000000099473604e-05 | 103 | rna-XM_034170522.1 34991568 | 12 | 58347421 | 58347523 | Thalassophryne amazonica 390379 | GCG|GTGATGCAAT...TGTTTCTTCTCT/GTTGTTTTCATG...TTTAG|TGA | 1 | 1 | 38.974 |
| 196985858 | GT-AG | 0 | 1.000000099473604e-05 | 8171 | rna-XM_034170522.1 34991568 | 13 | 58347681 | 58355851 | Thalassophryne amazonica 390379 | CCT|GTGAGTGGCC...TGTTCTGTAACT/CTGTAACTTATT...GACAG|GGG | 2 | 1 | 41.85 |
| 196985859 | GT-AG | 0 | 1.000000099473604e-05 | 3925 | rna-XM_034170522.1 34991568 | 14 | 58356009 | 58359933 | Thalassophryne amazonica 390379 | CAA|GTGAGGACAG...ATTCATTTATTT/TATTTATTCATT...TGCAG|GTG | 0 | 1 | 44.725 |
| 196985860 | GT-AG | 0 | 1.000000099473604e-05 | 19271 | rna-XM_034170522.1 34991568 | 15 | 58360045 | 58379315 | Thalassophryne amazonica 390379 | CAG|GTCTTTAAAT...AAGAACTTAACT/TATTTTCTGATC...ATCAG|GTG | 0 | 1 | 46.758 |
| 196985861 | GT-AG | 0 | 1.000000099473604e-05 | 95 | rna-XM_034170522.1 34991568 | 16 | 58379530 | 58379624 | Thalassophryne amazonica 390379 | GCA|GTGAGGCCTC...TTTTTTTTATTT/TTTTTTTTTATT...TTCAG|AGG | 1 | 1 | 50.678 |
| 196985862 | GT-AG | 0 | 1.000000099473604e-05 | 7281 | rna-XM_034170522.1 34991568 | 17 | 58379873 | 58387153 | Thalassophryne amazonica 390379 | AAG|GTCAGATCAA...CAGCTTTTGACA/CAGCTTTTGACA...TCTAG|ATA | 0 | 1 | 55.22 |
| 196985863 | GT-AG | 0 | 1.000000099473604e-05 | 4296 | rna-XM_034170522.1 34991568 | 18 | 58387443 | 58391738 | Thalassophryne amazonica 390379 | AAG|GTACAAATCC...TATTGTTTGAAT/CTCTGCTTTATT...TCCAG|GAT | 1 | 1 | 60.513 |
| 196985864 | GT-AG | 0 | 1.000000099473604e-05 | 2441 | rna-XM_034170522.1 34991568 | 19 | 58391985 | 58394425 | Thalassophryne amazonica 390379 | ATG|GTAAGTACAT...TTGTCCTCAATC/TTTGTCCTCAAT...TTTAG|GTC | 1 | 1 | 65.018 |
| 196985865 | GT-AG | 0 | 1.6542647235273984e-05 | 27088 | rna-XM_034170522.1 34991568 | 20 | 58394584 | 58421671 | Thalassophryne amazonica 390379 | AAG|GTAAACCACA...TTCCCCTTTGTT/TGTGTACTAAAT...CACAG|ACA | 0 | 1 | 67.912 |
| 196985866 | GT-AG | 0 | 1.000000099473604e-05 | 5858 | rna-XM_034170522.1 34991568 | 21 | 58421853 | 58427710 | Thalassophryne amazonica 390379 | CAA|GTAGGTGATA...ATTAATTTAGTT/ATTTAGTTGATA...CTCAG|GTG | 1 | 1 | 71.227 |
| 196985867 | GT-AG | 0 | 1.000000099473604e-05 | 249 | rna-XM_034170522.1 34991568 | 22 | 58427840 | 58428088 | Thalassophryne amazonica 390379 | GTG|GTGAGTACAA...TTCCTTTTATTT/ATTCCTTTTATT...TGTAG|GAC | 1 | 1 | 73.59 |
| 196985868 | GT-AG | 0 | 1.000000099473604e-05 | 12050 | rna-XM_034170522.1 34991568 | 23 | 58428251 | 58440300 | Thalassophryne amazonica 390379 | CTG|GTTAGTTTCA...TATGCTATAATA/ACTAAATTAAAA...CACAG|GTG | 1 | 1 | 76.557 |
| 196985869 | GT-AG | 0 | 1.000000099473604e-05 | 9324 | rna-XM_034170522.1 34991568 | 24 | 58440459 | 58449782 | Thalassophryne amazonica 390379 | AAG|GTGATGCATA...TTCTTCCTACTC/GGGCTTCTCAAA...ATTAG|GTG | 0 | 1 | 79.451 |
| 196985870 | GT-AG | 0 | 1.000000099473604e-05 | 15016 | rna-XM_034170522.1 34991568 | 25 | 58449985 | 58465000 | Thalassophryne amazonica 390379 | GAG|GTAAAGCTCA...AAGATGTTAATG/GTTATATTTATA...CTCAG|GAC | 1 | 1 | 83.15 |
| 196985871 | GT-AG | 0 | 2.2076375993718423e-05 | 4666 | rna-XM_034170522.1 34991568 | 26 | 58465280 | 58469945 | Thalassophryne amazonica 390379 | GTC|GTAAGTATAA...CATTTTTCAGCT/CTGTTTCTCACA...ACCAG|CAC | 1 | 1 | 88.26 |
| 196985872 | GT-AG | 0 | 0.0001469516172267 | 8514 | rna-XM_034170522.1 34991568 | 27 | 58470072 | 58478585 | Thalassophryne amazonica 390379 | ATG|GTAACGCATT...AGGCTTTTAATT/TTTGTTGTGACT...TGCAG|ACT | 1 | 1 | 90.568 |
| 196985873 | GT-AG | 0 | 1.000000099473604e-05 | 3800 | rna-XM_034170522.1 34991568 | 28 | 58478653 | 58482452 | Thalassophryne amazonica 390379 | TAT|GTGAGTTTTA...ATATCCTCCGTT/CTCCGTTTCACA...CATAG|GTG | 2 | 1 | 91.795 |
| 196985874 | GT-AG | 0 | 2.021018969844634e-05 | 147 | rna-XM_034170522.1 34991568 | 29 | 58482545 | 58482691 | Thalassophryne amazonica 390379 | TTG|GTAATTTCAA...TGATCTTTCTCT/ACTAAACTGATC...TTTAG|ATT | 1 | 1 | 93.48 |
| 196985875 | GT-AG | 0 | 1.000000099473604e-05 | 83 | rna-XM_034170522.1 34991568 | 30 | 58482789 | 58482871 | Thalassophryne amazonica 390379 | GAG|GTGCGCTGCG...GCTCCATTAGGT/ATGGTGATGATT...CACAG|AAA | 2 | 1 | 95.256 |
| 196985876 | GT-AG | 0 | 1.000000099473604e-05 | 13015 | rna-XM_034170522.1 34991568 | 31 | 58482933 | 58495947 | Thalassophryne amazonica 390379 | GAG|GTCAGTGCAC...TAATTTTAAGCC/ATGAATTTCATC...CTCAG|CTT | 0 | 1 | 96.374 |
| 196985877 | GT-AG | 0 | 1.000000099473604e-05 | 92 | rna-XM_034170522.1 34991568 | 32 | 58496044 | 58496135 | Thalassophryne amazonica 390379 | GAG|GTAAGTGTTT...CTCTCTGTATAC/ATCTGTCTGATG...CACAG|CAG | 0 | 1 | 98.132 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);