introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
16 rows where transcript_id = 34437905
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, length, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 193125175 | GT-AG | 0 | 1.000000099473604e-05 | 373 | rna-SMICCKB8_LOCUS77 34437905 | 1 | 8604 | 8976 | Symbiodinium sp. kb8 230985 | ACG|GTGCGGTGAG...CATGTGTTGACG/CATGTGTTGACG...CGCAG|GTG | 0 | 1 | 3.135 |
| 193125176 | GT-AG | 0 | 1.000000099473604e-05 | 252 | rna-SMICCKB8_LOCUS77 34437905 | 2 | 8249 | 8500 | Symbiodinium sp. kb8 230985 | TGG|GTGCAGGAGA...AAGACCATTGCG/GATATGCAAACA...TCCAG|GAG | 1 | 1 | 13.898 |
| 193125177 | GC-AG | 0 | 1.000000099473604e-05 | 269 | rna-SMICCKB8_LOCUS77 34437905 | 3 | 7906 | 8174 | Symbiodinium sp. kb8 230985 | GAG|GCCCTGCCCA...ATTTGCTTCTCC/TAGTCTGTCAAC...AGCAG|GTC | 0 | 1 | 21.63 |
| 193125178 | GC-AG | 0 | 1.000000099473604e-05 | 754 | rna-SMICCKB8_LOCUS77 34437905 | 4 | 7107 | 7860 | Symbiodinium sp. kb8 230985 | GAG|GCTACGCCGT...ATGCTCCTGACT/ATGCTCCTGACT...TCCAG|GCA | 0 | 1 | 26.332 |
| 193125179 | GC-AG | 0 | 1.000000099473604e-05 | 340 | rna-SMICCKB8_LOCUS77 34437905 | 5 | 6697 | 7036 | Symbiodinium sp. kb8 230985 | GAG|GCGGCTTTCT...AGCTTCGTATAG/TATAGCCGCATA...GTGAG|GCT | 1 | 1 | 33.647 |
| 193125180 | GT-AG | 0 | 1.000000099473604e-05 | 112 | rna-SMICCKB8_LOCUS77 34437905 | 6 | 6520 | 6631 | Symbiodinium sp. kb8 230985 | GAG|GTAAGACTTT...TCAGTCTTAGCA/TTCAGTCTTAGC...GGCAG|GCA | 0 | 1 | 40.439 |
| 193125181 | GT-AG | 0 | 0.0058715991072646 | 81 | rna-SMICCKB8_LOCUS77 34437905 | 7 | 6369 | 6449 | Symbiodinium sp. kb8 230985 | TGA|GTGCCTTGTC...ATCTACTTGTTT/CTGGTAGTAAAG...GCAAG|GTG | 1 | 1 | 47.753 |
| 193125182 | GC-AG | 0 | 1.000000099473604e-05 | 150 | rna-SMICCKB8_LOCUS77 34437905 | 8 | 6190 | 6339 | Symbiodinium sp. kb8 230985 | TCG|GCCGCTTCGT...GCGGCGCTGACA/GCGGCGCTGACA...GCAAG|GAT | 0 | 1 | 50.784 |
| 193125183 | GA-AG | 0 | 1.000000099473604e-05 | 312 | rna-SMICCKB8_LOCUS77 34437905 | 9 | 5834 | 6145 | Symbiodinium sp. kb8 230985 | ATG|GAGCGAAAGC...TACTCTGAGACT/CCTACTCTGAGA...GCTAG|GAG | 2 | 1 | 55.381 |
| 193125184 | GC-AG | 0 | 1.000000099473604e-05 | 271 | rna-SMICCKB8_LOCUS77 34437905 | 10 | 5530 | 5800 | Symbiodinium sp. kb8 230985 | AGG|GCGCCTGGAC...GAACCCTAAACC/AAAATCCTAACA...CTCAG|GCT | 2 | 1 | 58.83 |
| 193125185 | GC-AG | 0 | 1.000000099473604e-05 | 365 | rna-SMICCKB8_LOCUS77 34437905 | 11 | 5091 | 5455 | Symbiodinium sp. kb8 230985 | CAG|GCAGGACGGC...TGCACCTTGAGT/TGCACCTTGAGT...CGCAG|GTG | 1 | 1 | 66.562 |
| 193125186 | GC-AG | 0 | 2.231487920574637e-05 | 1178 | rna-SMICCKB8_LOCUS77 34437905 | 12 | 3891 | 5068 | Symbiodinium sp. kb8 230985 | GAA|GCCCCTTGAT...CCACATTTAAAT/CCACATTTAAAT...CCGAG|GAT | 2 | 1 | 68.861 |
| 193125187 | GT-AG | 0 | 1.000000099473604e-05 | 492 | rna-SMICCKB8_LOCUS77 34437905 | 13 | 3342 | 3833 | Symbiodinium sp. kb8 230985 | GAT|GTCGGGGCCT...ACACTACTAACG/ACACTACTAACG...CTAAG|GTC | 2 | 1 | 74.817 |
| 193125188 | GC-AG | 0 | 1.000000099473604e-05 | 332 | rna-SMICCKB8_LOCUS77 34437905 | 14 | 2954 | 3285 | Symbiodinium sp. kb8 230985 | CAG|GCTGGTGGTG...ACCACCTTTGCT/CAAATCCTAAGA...GCGAG|GCA | 1 | 1 | 80.669 |
| 193125189 | GC-AG | 0 | 7.22133836953343e-05 | 1000 | rna-SMICCKB8_LOCUS77 34437905 | 15 | 1883 | 2882 | Symbiodinium sp. kb8 230985 | TGC|GCTCCACGGC...CTACCCGTGACA/CTACCCGTGACA...GCGAG|GCT | 0 | 1 | 88.088 |
| 193125190 | GT-AG | 0 | 4.016836332782789e-05 | 271 | rna-SMICCKB8_LOCUS77 34437905 | 16 | 1531 | 1801 | Symbiodinium sp. kb8 230985 | GAA|GTGACTGGGC...TCCTTCTTAGTT/CTCCTTCTTAGT...TAGAG|GTC | 0 | 1 | 96.552 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);