introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
22 rows where transcript_id = 24003294
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 130621141 | GT-AG | 0 | 3.721860310719691e-05 | 3559 | rna-XM_026053543.1 24003294 | 1 | 5423131 | 5426689 | Nothoprocta perdicaria 30464 | CAG|GTACGCCAAA...TGGTCCTTTTCA/ATTTTGTTTATA...TTCAG|CTA | 0 | 1 | 5.258 |
| 130621142 | GT-AG | 0 | 1.000000099473604e-05 | 560 | rna-XM_026053543.1 24003294 | 2 | 5426862 | 5427421 | Nothoprocta perdicaria 30464 | CTG|GTAAGTACAA...TGTTGTGTGATT/TGTTGTGTGATT...GTCAG|CTG | 1 | 1 | 11.055 |
| 130621143 | GT-AG | 0 | 1.5147735795401203e-05 | 1236 | rna-XM_026053543.1 24003294 | 3 | 5427523 | 5428758 | Nothoprocta perdicaria 30464 | CAG|GTTTGTTAAA...TCCCCCTTGGTT/GTATAGCTCATG...TGCAG|ACT | 0 | 1 | 14.459 |
| 130621144 | GT-AG | 0 | 1.000000099473604e-05 | 355 | rna-XM_026053543.1 24003294 | 4 | 5428941 | 5429295 | Nothoprocta perdicaria 30464 | TCG|GTAAGAACTC...TTTTTTTTCATT/TTTTTTTTCATT...ATTAG|TGA | 2 | 1 | 20.593 |
| 130621145 | GT-AG | 0 | 1.4405021241430391e-05 | 516 | rna-XM_026053543.1 24003294 | 5 | 5429371 | 5429886 | Nothoprocta perdicaria 30464 | CAC|GTAAGTAACT...CCACTCTTGATT/TTTGTGTTTATC...TGCAG|GAA | 2 | 1 | 23.121 |
| 130621146 | GT-AG | 0 | 1.000000099473604e-05 | 1489 | rna-XM_026053543.1 24003294 | 6 | 5429985 | 5431473 | Nothoprocta perdicaria 30464 | CAG|GTAAAATGAA...CCTGCCTTTCTA/ACTGGGTTAACT...CCAAG|GTG | 1 | 1 | 26.424 |
| 130621147 | GT-AG | 0 | 1.000000099473604e-05 | 1169 | rna-XM_026053543.1 24003294 | 7 | 5431559 | 5432727 | Nothoprocta perdicaria 30464 | CAG|GTAGGACTGA...TTATGCTTATTA/CTTATGCTTATT...TTTAG|GTC | 2 | 1 | 29.289 |
| 130621148 | GT-AG | 0 | 1.000000099473604e-05 | 405 | rna-XM_026053543.1 24003294 | 8 | 5432890 | 5433294 | Nothoprocta perdicaria 30464 | AGG|GTAGGTTACA...CCATGTTTAATA/CCATGTTTAATA...TCCAG|GTT | 2 | 1 | 34.749 |
| 130621149 | GT-AG | 0 | 1.000000099473604e-05 | 302 | rna-XM_026053543.1 24003294 | 9 | 5433427 | 5433728 | Nothoprocta perdicaria 30464 | CAG|GTAAAATCCG...ATTGTGTTAACT/ATTGTGTTAACT...TGCAG|ACG | 2 | 1 | 39.198 |
| 130621150 | GT-AG | 0 | 1.000000099473604e-05 | 105 | rna-XM_026053543.1 24003294 | 10 | 5433853 | 5433957 | Nothoprocta perdicaria 30464 | AAT|GTGAGTACCA...ATGAGTTTACCC/CATGAGTTTACC...TCTAG|GAC | 0 | 1 | 43.377 |
| 130621151 | GT-AG | 0 | 1.000000099473604e-05 | 744 | rna-XM_026053543.1 24003294 | 11 | 5434164 | 5434907 | Nothoprocta perdicaria 30464 | AAT|GTAAGTACGT...GTATATTTATCA/AGTATATTTATC...TTTAG|GTG | 2 | 1 | 50.32 |
| 130621152 | GT-AG | 0 | 5.329277610482388e-05 | 1263 | rna-XM_026053543.1 24003294 | 12 | 5435026 | 5436288 | Nothoprocta perdicaria 30464 | CGG|GTACGTTAGA...ATTTTTTTGTTT/GTTTGTTTCACT...AATAG|GCA | 0 | 1 | 54.297 |
| 130621153 | GT-AG | 0 | 1.000000099473604e-05 | 1031 | rna-XM_026053543.1 24003294 | 13 | 5436369 | 5437399 | Nothoprocta perdicaria 30464 | CAG|GTAAACACTG...CCTCTCATATCT/CTGTCATTTATA...TTCAG|GGT | 2 | 1 | 56.994 |
| 130621154 | GT-AG | 0 | 1.000000099473604e-05 | 1152 | rna-XM_026053543.1 24003294 | 14 | 5437565 | 5438716 | Nothoprocta perdicaria 30464 | AAT|GTAAGTGTAC...ATCTTTTTGAAG/CTGCAATTAATA...CATAG|TTC | 2 | 1 | 62.555 |
| 130621155 | GT-AG | 0 | 1.000000099473604e-05 | 1156 | rna-XM_026053543.1 24003294 | 15 | 5438783 | 5439938 | Nothoprocta perdicaria 30464 | TAG|GTGAGTTGTA...TGTTTTTTAATT/TGTTTTTTAATT...CACAG|AAA | 2 | 1 | 64.779 |
| 130621156 | GT-AG | 0 | 1.000000099473604e-05 | 3648 | rna-XM_026053543.1 24003294 | 16 | 5440090 | 5443737 | Nothoprocta perdicaria 30464 | AAG|GTAAGGGGAA...TTAATCGTAACT/AAGTATTTAATC...TGTAG|GGT | 0 | 1 | 69.869 |
| 130621157 | GT-AG | 0 | 1.000000099473604e-05 | 2330 | rna-XM_026053543.1 24003294 | 17 | 5443991 | 5446320 | Nothoprocta perdicaria 30464 | CGG|GTAAGGTGAA...ATAACTGTACTT/AAAAGCATAACT...CTCAG|GTC | 1 | 1 | 78.396 |
| 130621158 | GT-AG | 0 | 1.000000099473604e-05 | 1713 | rna-XM_026053543.1 24003294 | 18 | 5446488 | 5448200 | Nothoprocta perdicaria 30464 | CAA|GTAAGAACTC...AGTGTCTCAACG/AAGTGTCTCAAC...ATCAG|GAG | 0 | 1 | 84.024 |
| 130621159 | GT-AG | 0 | 1.000000099473604e-05 | 2104 | rna-XM_026053543.1 24003294 | 19 | 5448262 | 5450365 | Nothoprocta perdicaria 30464 | AAG|GTGAAACTCA...GTATTCTTACTC/TGTATTCTTACT...TTTAG|AGT | 1 | 1 | 86.08 |
| 130621160 | GT-AG | 0 | 1.000000099473604e-05 | 1196 | rna-XM_026053543.1 24003294 | 20 | 5450482 | 5451677 | Nothoprocta perdicaria 30464 | GAG|GTGGTGATCT...AAAGCTTTGACT/CTGTACTTTATT...TTTAG|ACA | 0 | 1 | 89.99 |
| 130621161 | GT-AG | 0 | 1.000000099473604e-05 | 407 | rna-XM_026053543.1 24003294 | 21 | 5451748 | 5452154 | Nothoprocta perdicaria 30464 | CAG|GTAAAGGATG...GCAGCTTTACTA/TGCAGCTTTACT...TTAAG|GTC | 1 | 1 | 92.349 |
| 130621162 | GT-AG | 0 | 1.000000099473604e-05 | 529 | rna-XM_026053543.1 24003294 | 22 | 5452294 | 5452822 | Nothoprocta perdicaria 30464 | CAA|GTAGGTGTCT...TGTGTGTTAAAT/ATATGTTTCACC...TACAG|GGA | 2 | 1 | 97.034 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);