introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
15 rows where transcript_id = 34437901
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, length, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 193125097 | GT-AG | 0 | 1.000000099473604e-05 | 329 | rna-SMICCKB8_LOCUS71 34437901 | 1 | 23708 | 24036 | Symbiodinium sp. kb8 230985 | TGT|GTTCGTGATC...GTGTTTTTGGTC/GGTCTCGTCAAC...CCAAG|GCC | 0 | 1 | 17.625 |
| 193125098 | GT-AG | 0 | 1.000000099473604e-05 | 485 | rna-SMICCKB8_LOCUS71 34437901 | 2 | 24076 | 24560 | Symbiodinium sp. kb8 230985 | GCG|GTCAGGGTCC...AGACCCTGAACC/ACTACATTCACG...CCAAG|GTT | 0 | 1 | 22.605 |
| 193125099 | GC-AG | 0 | 1.000000099473604e-05 | 233 | rna-SMICCKB8_LOCUS71 34437901 | 3 | 24618 | 24850 | Symbiodinium sp. kb8 230985 | CAG|GCCAGTTGGG...GTGACCTTGAAG/TGAAGCTTTACA...GACAG|ATC | 0 | 1 | 29.885 |
| 193125100 | GC-AG | 0 | 0.0016368918520904 | 210 | rna-SMICCKB8_LOCUS71 34437901 | 4 | 24883 | 25092 | Symbiodinium sp. kb8 230985 | CCT|GCCGCCGACT...TGGACCCTAACC/TGGACCCTAACC...CTGAG|GCC | 2 | 1 | 33.972 |
| 193125101 | GC-AG | 0 | 2.7276413329524257e-05 | 564 | rna-SMICCKB8_LOCUS71 34437901 | 5 | 25139 | 25702 | Symbiodinium sp. kb8 230985 | GAG|GCGCCCCGCC...AAATCCCTAATG/AAATCCCTAATG...CCAAG|ATC | 0 | 1 | 39.847 |
| 193125102 | GT-AG | 0 | 1.9123948029784087e-05 | 201 | rna-SMICCKB8_LOCUS71 34437901 | 6 | 25742 | 25942 | Symbiodinium sp. kb8 230985 | GCA|GTCCGGTGCT...GTGGCTTTGATC/GTGGCTTTGATC...TGCAG|GCG | 0 | 1 | 44.828 |
| 193125103 | GC-AG | 0 | 1.000000099473604e-05 | 70 | rna-SMICCKB8_LOCUS71 34437901 | 7 | 25999 | 26068 | Symbiodinium sp. kb8 230985 | CAG|GCTCGGCCAG...ACCGCTCTGACA/ACCGCTCTGACA...TACAG|GCT | 2 | 1 | 51.98 |
| 193125104 | GC-AG | 0 | 1.000000099473604e-05 | 70 | rna-SMICCKB8_LOCUS71 34437901 | 8 | 26121 | 26190 | Symbiodinium sp. kb8 230985 | GAG|GCCGGTTTGA...CGCCTCTGGATG/CATGCTGCTATC...CGCAG|GTT | 0 | 1 | 58.621 |
| 193125105 | GC-AG | 0 | 1.000000099473604e-05 | 91 | rna-SMICCKB8_LOCUS71 34437901 | 9 | 26215 | 26305 | Symbiodinium sp. kb8 230985 | GCC|GCTGCGGCCT...GTTTCCGTTGTA/CCATCCCACACC...AACAG|ATA | 0 | 1 | 61.686 |
| 193125106 | GC-AG | 0 | 1.000000099473604e-05 | 59 | rna-SMICCKB8_LOCUS71 34437901 | 10 | 26360 | 26418 | Symbiodinium sp. kb8 230985 | CAG|GCGCGACTGA...ATTTCTTTGTCT/GCCCTGTTGATT...GCGAG|GAC | 0 | 1 | 68.582 |
| 193125107 | GA-AG | 0 | 1.000000099473604e-05 | 230 | rna-SMICCKB8_LOCUS71 34437901 | 11 | 26461 | 26690 | Symbiodinium sp. kb8 230985 | GAG|GACGGGGACG...TGCTCGGTCGCA/GGAAGATGCATT...ATCAG|GAG | 0 | 1 | 73.946 |
| 193125108 | GC-AG | 0 | 1.000000099473604e-05 | 119 | rna-SMICCKB8_LOCUS71 34437901 | 12 | 26730 | 26848 | Symbiodinium sp. kb8 230985 | GAG|GCGATCGGAA...AATCTCTCATCC/CAATCTCTCATC...GGAAG|GCT | 0 | 1 | 78.927 |
| 193125109 | GC-AG | 0 | 1.000000099473604e-05 | 850 | rna-SMICCKB8_LOCUS71 34437901 | 13 | 26884 | 27733 | Symbiodinium sp. kb8 230985 | CGA|GCCTGGATCC...CAAGCCTCAACG/TCAAGCCTCAAC...CTAAG|GGG | 2 | 1 | 83.397 |
| 193125110 | GT-AG | 0 | 1.000000099473604e-05 | 312 | rna-SMICCKB8_LOCUS71 34437901 | 14 | 27805 | 28116 | Symbiodinium sp. kb8 230985 | TCG|GTGAGCCTGG...TTCTCCTTAAGT/TTTCTCCTTAAG...AGCAG|GCG | 1 | 1 | 92.465 |
| 193125111 | GT-AG | 0 | 4.961827117557088e-05 | 203 | rna-SMICCKB8_LOCUS71 34437901 | 15 | 28149 | 28351 | Symbiodinium sp. kb8 230985 | CAC|GTGCCGGGCC...CATACCTTGAAC/TAAAGCCTCAAA...TCGAG|GAG | 0 | 1 | 96.552 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);