introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
30 rows where transcript_id = 15198714
This data as json, CSV (advanced)
Suggested facets: score, length, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 82237933 | GT-AG | 0 | 1.000000099473604e-05 | 406 | rna-XM_031056068.1 15198714 | 1 | 30419678 | 30420083 | Geospiza fortis 48883 | CCC|GTGAGCCAGG...CCCGCCTGATCC/CCCCGCCTGATC...CGCAG|CGG | 1 | 1 | 1.472 |
| 82237934 | GT-AG | 0 | 1.000000099473604e-05 | 313 | rna-XM_031056068.1 15198714 | 2 | 30419245 | 30419557 | Geospiza fortis 48883 | GTG|GTGAGTGGTA...TCTGCCATAGCA/GTCCTGGTGACT...TGCAG|ATG | 1 | 1 | 3.414 |
| 82237935 | GT-AG | 0 | 8.629253227629019e-05 | 79 | rna-XM_031056068.1 15198714 | 3 | 30419046 | 30419124 | Geospiza fortis 48883 | GTG|GTATGTACTC...GGCTGCTGAGTG/AGGCTGCTGAGT...CTCAG|TGT | 1 | 1 | 5.356 |
| 82237936 | GT-AG | 0 | 1.000000099473604e-05 | 226 | rna-XM_031056068.1 15198714 | 4 | 30418592 | 30418817 | Geospiza fortis 48883 | GCG|GTAAGCAAGA...TCTGCCCTGCTG/GCCCTGCTGACA...CATAG|CTC | 1 | 1 | 9.045 |
| 82237937 | GT-AG | 0 | 1.6942275177711565e-05 | 92 | rna-XM_031056068.1 15198714 | 5 | 30418386 | 30418477 | Geospiza fortis 48883 | GTG|GTAAGTTACA...TTACCCTTGGCT/TCCTGCTTCATG...CACAG|GTC | 1 | 1 | 10.89 |
| 82237938 | GT-AG | 0 | 1.000000099473604e-05 | 299 | rna-XM_031056068.1 15198714 | 6 | 30417967 | 30418265 | Geospiza fortis 48883 | CTG|GTAAGGGGAG...CTGACCTCATCT/ACTGACCTCATC...TGCAG|ATG | 1 | 1 | 12.832 |
| 82237939 | GT-AG | 0 | 1.000000099473604e-05 | 117 | rna-XM_031056068.1 15198714 | 7 | 30417724 | 30417840 | Geospiza fortis 48883 | CAG|GTAGGGAGAG...TTCCTCTCACCC/TTTCCTCTCACC...GGCAG|ACA | 1 | 1 | 14.871 |
| 82237940 | GT-AG | 0 | 1.000000099473604e-05 | 154 | rna-XM_031056068.1 15198714 | 8 | 30417363 | 30417516 | Geospiza fortis 48883 | CAG|GTGAGCAGGA...TGTGCCTTAAAA/GTGTGCCTTAAA...TACAG|ATG | 1 | 1 | 18.22 |
| 82237941 | GT-AG | 0 | 1.000000099473604e-05 | 135 | rna-XM_031056068.1 15198714 | 9 | 30417059 | 30417193 | Geospiza fortis 48883 | AAG|GTGAGCAGTG...CTGGTGTTATCC/TGGGTGCTGAGT...CCCAG|GTC | 2 | 1 | 20.955 |
| 82237942 | GT-AG | 0 | 1.000000099473604e-05 | 210 | rna-XM_031056068.1 15198714 | 10 | 30416680 | 30416889 | Geospiza fortis 48883 | GAG|GTAGAGATGA...CGTGCCTTGCCT/GGATGCCACATG...CCCAG|TCC | 0 | 1 | 23.689 |
| 82237943 | GT-AG | 0 | 1.000000099473604e-05 | 319 | rna-XM_031056068.1 15198714 | 11 | 30416135 | 30416453 | Geospiza fortis 48883 | CCA|GTGAGTACCT...GAGCTTTTAACA/GAGCTTTTAACA...CCCAG|GGG | 1 | 1 | 27.346 |
| 82237944 | GT-AG | 0 | 1.000000099473604e-05 | 194 | rna-XM_031056068.1 15198714 | 12 | 30415815 | 30416008 | Geospiza fortis 48883 | CAG|GTGAGCAGGG...CTGTCCTTATGC/GCTGTGCTCACA...TGCAG|AGC | 1 | 1 | 29.385 |
| 82237945 | GT-AG | 0 | 1.000000099473604e-05 | 233 | rna-XM_031056068.1 15198714 | 13 | 30415366 | 30415598 | Geospiza fortis 48883 | CAG|GTGAGACCCT...AACTCCTTGTCT/GACAGCCTCATA...CCAAG|TCT | 1 | 1 | 32.88 |
| 82237946 | GT-AG | 0 | 1.000000099473604e-05 | 234 | rna-XM_031056068.1 15198714 | 14 | 30414823 | 30415056 | Geospiza fortis 48883 | TGG|GTGAGCCCTG...CAGGTCCTACCA/CCAGGTCCTACC...CACAG|GCC | 1 | 1 | 37.88 |
| 82237947 | GT-AG | 0 | 1.000000099473604e-05 | 178 | rna-XM_031056068.1 15198714 | 15 | 30414432 | 30414609 | Geospiza fortis 48883 | CTG|GTAAGAGTGG...GGTGCTTGGAAT/ATGGTCCCAACA...GCCAG|TGC | 1 | 1 | 41.327 |
| 82237948 | GT-AG | 0 | 1.000000099473604e-05 | 354 | rna-XM_031056068.1 15198714 | 16 | 30413499 | 30413852 | Geospiza fortis 48883 | CAG|GTAGGAGCCA...AGCCCCTTGTCT/CCAGTGCAAAGC...CCCAG|GGG | 1 | 1 | 50.696 |
| 82237949 | GT-AG | 0 | 1.000000099473604e-05 | 158 | rna-XM_031056068.1 15198714 | 17 | 30413125 | 30413282 | Geospiza fortis 48883 | GTG|GTAAGACCCT...CTGGCCTTTGCC/CCCCTGCTCAGC...CACAG|TGC | 1 | 1 | 54.191 |
| 82237950 | GT-AG | 0 | 1.000000099473604e-05 | 82 | rna-XM_031056068.1 15198714 | 18 | 30412804 | 30412885 | Geospiza fortis 48883 | CAG|GTGAGTGGAG...GCCTTTTTCACC/GCCTTTTTCACC...TGCAG|GGC | 0 | 1 | 58.058 |
| 82237951 | GT-AG | 0 | 1.000000099473604e-05 | 334 | rna-XM_031056068.1 15198714 | 19 | 30412340 | 30412673 | Geospiza fortis 48883 | GCT|GTGAGCAACA...GCAGCCTTCCCC/ATGAGGCTCAGG...TGCAG|GCC | 1 | 1 | 60.162 |
| 82237952 | GT-AG | 0 | 1.000000099473604e-05 | 107 | rna-XM_031056068.1 15198714 | 20 | 30412054 | 30412160 | Geospiza fortis 48883 | CCT|GTGAGTGCCA...GGTTCCTTTCCC/GGGCAGCTGATG...CCCAG|CGG | 0 | 1 | 63.058 |
| 82237953 | GT-AG | 0 | 1.000000099473604e-05 | 239 | rna-XM_031056068.1 15198714 | 21 | 30411628 | 30411866 | Geospiza fortis 48883 | CAG|GTAAGTCCTG...GGACCATTAGTG/TTAGTGCTGAGC...CCCAG|GAC | 1 | 1 | 66.084 |
| 82237954 | GT-AG | 0 | 1.000000099473604e-05 | 230 | rna-XM_031056068.1 15198714 | 22 | 30411179 | 30411408 | Geospiza fortis 48883 | AAG|GTGGGTTGTG...GCCTCTTGGATA/GCAGCTGTCACC...TTCAG|GTG | 1 | 1 | 69.628 |
| 82237955 | GT-AG | 0 | 1.000000099473604e-05 | 290 | rna-XM_031056068.1 15198714 | 23 | 30410705 | 30410994 | Geospiza fortis 48883 | AGG|GTAAGTCTGG...GATGCTTTGTCA/CATAGGCTCATA...ACTAG|GCT | 2 | 1 | 72.605 |
| 82237956 | GT-AG | 0 | 1.000000099473604e-05 | 68 | rna-XM_031056068.1 15198714 | 24 | 30410435 | 30410502 | Geospiza fortis 48883 | AAG|GTGTGGTGGG...GGGGCTCTGATG/TGGGCACTGACA...GACAG|GTC | 0 | 1 | 75.874 |
| 82237957 | GT-AG | 0 | 0.0027543534809972 | 287 | rna-XM_031056068.1 15198714 | 25 | 30409983 | 30410269 | Geospiza fortis 48883 | CAG|GTACCTTCTT...TCAGCTCTGCCG/CGGGTGCTGACA...CACAG|GTG | 0 | 1 | 78.544 |
| 82237958 | GT-AG | 0 | 1.000000099473604e-05 | 92 | rna-XM_031056068.1 15198714 | 26 | 30409690 | 30409781 | Geospiza fortis 48883 | CAG|GTACTGTCCT...CACATCCTAGTG/GGGGTCCACATC...TCCAG|GTC | 0 | 1 | 81.796 |
| 82237959 | GT-AG | 0 | 1.000000099473604e-05 | 148 | rna-XM_031056068.1 15198714 | 27 | 30408998 | 30409145 | Geospiza fortis 48883 | ACG|GTGAGTGAGG...AGCCTCTCGACC/TGTTGCCGCACT...TGCAG|AGA | 1 | 1 | 90.599 |
| 82237960 | GT-AG | 0 | 1.000000099473604e-05 | 188 | rna-XM_031056068.1 15198714 | 28 | 30408681 | 30408868 | Geospiza fortis 48883 | ATG|GTAGGTCTGT...GCACCCTTGTGG/GCAGTGCTGACC...TCCAG|GCT | 1 | 1 | 92.686 |
| 82237961 | GT-AG | 0 | 1.000000099473604e-05 | 87 | rna-XM_031056068.1 15198714 | 29 | 30408486 | 30408572 | Geospiza fortis 48883 | AAG|GTGGGTCTGG...TGTGCCTTGCGA/GCCATGCTAACC...GGCAG|GCA | 1 | 1 | 94.434 |
| 82237962 | GT-AG | 0 | 1.000000099473604e-05 | 123 | rna-XM_031056068.1 15198714 | 30 | 30408126 | 30408248 | Geospiza fortis 48883 | ACT|GTGAGTGCCT...CTCACCTGAGTC/TCTCTGCTGATC...TCCAG|GTG | 1 | 1 | 98.269 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);