introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
23 rows where transcript_id = 31341666
This data as json, CSV (advanced)
Suggested facets: score, length, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 174593902 | GT-AG | 0 | 0.0051199267453436 | 58 | rna-NZN594_LOCUS184 31341666 | 1 | 1037790 | 1037847 | Rotaria sp. silwood1 2762511 | CAA|GTATAAATGA...TTTTCTTTATTT/ATTTTCTTTATT...TTTAG|TCT | 0 | 1 | 0.87 |
| 174593903 | GT-AG | 0 | 0.022354772738421 | 47 | rna-NZN594_LOCUS184 31341666 | 2 | 1037651 | 1037697 | Rotaria sp. silwood1 2762511 | GGA|GTATATATTG...ATTTCAATAATG/GAACATTTCAAT...TTTAG|TAA | 2 | 1 | 3.294 |
| 174593904 | GT-AG | 0 | 0.0001332474415255 | 54 | rna-NZN594_LOCUS184 31341666 | 3 | 1037449 | 1037502 | Rotaria sp. silwood1 2762511 | CAG|GTATAATAAG...TCATTTTTATTG/TATTTATTTATT...TTTAG|TTT | 0 | 1 | 7.194 |
| 174593905 | GT-AG | 0 | 1.010609571575978e-05 | 56 | rna-NZN594_LOCUS184 31341666 | 4 | 1037212 | 1037267 | Rotaria sp. silwood1 2762511 | CAG|GTACGTAATG...CTTTCATTATCA/AATTCTTTCATT...TTTAG|ATG | 1 | 1 | 11.963 |
| 174593906 | GT-AG | 0 | 1.000000099473604e-05 | 54 | rna-NZN594_LOCUS184 31341666 | 5 | 1036976 | 1037029 | Rotaria sp. silwood1 2762511 | CAG|GTTTTATTTA...TTAAGATTAATT/TTAAGATTAATT...TACAG|TTA | 0 | 1 | 16.759 |
| 174593907 | GT-AG | 0 | 0.0012274968633817 | 2599 | rna-NZN594_LOCUS184 31341666 | 6 | 1034181 | 1036779 | Rotaria sp. silwood1 2762511 | TCG|GTATGTATAT...AAATCATTGATG/GTTTTTTTTATA...TTTAG|TAC | 1 | 1 | 21.924 |
| 174593908 | GT-AG | 0 | 1.000000099473604e-05 | 58 | rna-NZN594_LOCUS184 31341666 | 7 | 1034028 | 1034085 | Rotaria sp. silwood1 2762511 | GAT|GTAAGAAAAT...AGTGTCTTATTG/TAGTGTCTTATT...AATAG|GGT | 0 | 1 | 24.427 |
| 174593909 | GT-AG | 0 | 1.000000099473604e-05 | 336 | rna-NZN594_LOCUS184 31341666 | 8 | 1033562 | 1033897 | Rotaria sp. silwood1 2762511 | AAT|GTCAGGACAT...CTTTTTTCAGAA/CCTTTTTTCAGA...AATAG|TTA | 1 | 1 | 27.852 |
| 174593910 | GT-AG | 0 | 0.000147444987858 | 60 | rna-NZN594_LOCUS184 31341666 | 9 | 1033391 | 1033450 | Rotaria sp. silwood1 2762511 | GCT|GTAAGTAAAT...TTTTCTTTGATT/TTTTCTTTGATT...TTAAG|CAC | 1 | 1 | 30.777 |
| 174593911 | GT-AG | 0 | 0.0922326505469322 | 59 | rna-NZN594_LOCUS184 31341666 | 10 | 1033090 | 1033148 | Rotaria sp. silwood1 2762511 | TTG|GTATTCATAT...TATTTTTTATCT/GTATTTTTTATC...TTTAG|GGA | 0 | 1 | 37.154 |
| 174593912 | GT-AG | 0 | 1.000000099473604e-05 | 67 | rna-NZN594_LOCUS184 31341666 | 11 | 1032849 | 1032915 | Rotaria sp. silwood1 2762511 | ATG|GTATGAAATT...TCATTTTTACAG/TAATTATTCATT...TCTAG|GAT | 0 | 1 | 41.739 |
| 174593913 | GT-AG | 0 | 0.556719043437114 | 63 | rna-NZN594_LOCUS184 31341666 | 12 | 1032210 | 1032272 | Rotaria sp. silwood1 2762511 | CAA|GTATGTTTGA...AATTTCTTAACT/ATTTTTCTAATT...TAAAG|GTA | 0 | 1 | 56.917 |
| 174593914 | GT-AG | 0 | 1.000000099473604e-05 | 823 | rna-NZN594_LOCUS184 31341666 | 13 | 1031198 | 1032020 | Rotaria sp. silwood1 2762511 | GAG|GTCAGAATTA...TATTTTTTATAA/TTATTTTTTATA...GTTAG|GCA | 0 | 1 | 61.897 |
| 174593915 | GT-AG | 0 | 1.000000099473604e-05 | 59 | rna-NZN594_LOCUS184 31341666 | 14 | 1031107 | 1031165 | Rotaria sp. silwood1 2762511 | AAA|GTAAGTCCAT...TTAACATTGATA/AAATGATTAACA...TTTAG|GTT | 2 | 1 | 62.74 |
| 174593916 | GT-AG | 0 | 3.990270225513384e-05 | 62 | rna-NZN594_LOCUS184 31341666 | 15 | 1030894 | 1030955 | Rotaria sp. silwood1 2762511 | GTG|GTATGTACAC...TCAGCTATATAT/CACATAATGATC...TTTAG|ATA | 0 | 1 | 66.719 |
| 174593917 | GT-AG | 0 | 0.0022354472166562 | 64 | rna-NZN594_LOCUS184 31341666 | 16 | 1030683 | 1030746 | Rotaria sp. silwood1 2762511 | TAT|GTATGTGATA...ATTTCTTTCATT/ATTTCTTTCATT...TTCAG|TTT | 0 | 1 | 70.593 |
| 174593918 | GT-AG | 0 | 0.0011148254175587 | 403 | rna-NZN594_LOCUS184 31341666 | 17 | 1030122 | 1030524 | Rotaria sp. silwood1 2762511 | CAG|GTTTGTTTAT...GTTTTTTTAACT/GTTTTTTTAACT...ATTAG|AAT | 2 | 1 | 74.756 |
| 174593919 | GT-AG | 0 | 0.037431690755034 | 70 | rna-NZN594_LOCUS184 31341666 | 18 | 1029949 | 1030018 | Rotaria sp. silwood1 2762511 | ATG|GTATCAAATA...TTTGTTTTATTT/CTTTGTTTTATT...TATAG|ATT | 0 | 1 | 77.47 |
| 174593920 | GT-AG | 0 | 0.0066999320884284 | 56 | rna-NZN594_LOCUS184 31341666 | 19 | 1029667 | 1029722 | Rotaria sp. silwood1 2762511 | ATA|GTATGTATAT...TTGTTTTTCACT/TTGTTTTTCACT...AAAAG|GTA | 1 | 1 | 83.426 |
| 174593921 | GT-AG | 0 | 0.0001056927767289 | 61 | rna-NZN594_LOCUS184 31341666 | 20 | 1029503 | 1029563 | Rotaria sp. silwood1 2762511 | TAC|GTAAGATTCT...TGTTTTTTATCT/TTGTTTTTTATC...AAAAG|TAA | 2 | 1 | 86.14 |
| 174593922 | GT-AG | 0 | 2.506129321525483e-05 | 55 | rna-NZN594_LOCUS184 31341666 | 21 | 1029242 | 1029296 | Rotaria sp. silwood1 2762511 | TCA|GTAAGAATAA...TATGTCTTAATA/TATGTCTTAATA...TGTAG|AAA | 1 | 1 | 91.568 |
| 174593923 | GT-AG | 0 | 1.000000099473604e-05 | 54 | rna-NZN594_LOCUS184 31341666 | 22 | 1029076 | 1029129 | Rotaria sp. silwood1 2762511 | AAA|GTAAAACGAA...TTTTCATTATTA/GTTTTTTTCATT...TTTAG|ATC | 2 | 1 | 94.519 |
| 174593924 | GT-AG | 0 | 1.000000099473604e-05 | 59 | rna-NZN594_LOCUS184 31341666 | 23 | 1028848 | 1028906 | Rotaria sp. silwood1 2762511 | GAA|GTAAAATATT...AAGACATTAAAA/TAAAAACTAAAT...TTTAG|AAA | 0 | 1 | 98.972 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);