introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
17 rows where transcript_id = 27151049
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 151209552 | GT-AG | 0 | 1.000000099473604e-05 | 80 | rna-XM_020743729.1 27151049 | 2 | 29538455 | 29538534 | Phalaenopsis equestris 78828 | AAG|GTAAGTTGGT...GTTTCTTTCACT/GTTTCTTTCACT...TTTAG|ATG | 1 | 1 | 8.244 |
| 151209553 | GC-AG | 0 | 1.000000099473604e-05 | 119 | rna-XM_020743729.1 27151049 | 3 | 29538634 | 29538752 | Phalaenopsis equestris 78828 | AAG|GCAATTCTTT...GGCTTCTAAACA/GGGCTTCTAAAC...AGCAG|AGG | 1 | 1 | 10.178 |
| 151209554 | GT-AG | 0 | 0.0001264224099379 | 111 | rna-XM_020743729.1 27151049 | 4 | 29538874 | 29538984 | Phalaenopsis equestris 78828 | AAG|GTATGATACT...TTTTCTTTGGCC/TTTATGTTTATT...CGCAG|AGA | 2 | 1 | 12.542 |
| 151209555 | GT-AG | 0 | 0.0001483663484068 | 101 | rna-XM_020743729.1 27151049 | 5 | 29539070 | 29539170 | Phalaenopsis equestris 78828 | CCT|GTATAACACT...ATGCACTGAATT/ACTGAATTGATT...ATTAG|GGT | 0 | 1 | 14.202 |
| 151209556 | GT-AG | 0 | 1.000000099473604e-05 | 83 | rna-XM_020743729.1 27151049 | 6 | 29539325 | 29539407 | Phalaenopsis equestris 78828 | AGG|GTGTGTGTTT...CAAGCTTTATAT/CTTTATATCATA...CTCAG|GGG | 1 | 1 | 17.21 |
| 151209557 | GT-AG | 0 | 0.0063904679842823 | 130 | rna-XM_020743729.1 27151049 | 7 | 29539544 | 29539673 | Phalaenopsis equestris 78828 | GAG|GTATCTAAAT...CTGTCACTAACT/CTGTCACTAACT...TATAG|GCG | 2 | 1 | 19.867 |
| 151209558 | GT-AG | 0 | 2.507635826091396e-05 | 94 | rna-XM_020743729.1 27151049 | 8 | 29539927 | 29540020 | Phalaenopsis equestris 78828 | CAA|GTAATCCAAT...TTTGGTTTATTA/TTTTGGTTTATT...TGCAG|CTA | 0 | 1 | 24.81 |
| 151209559 | GT-AG | 0 | 0.0007590548909869 | 85 | rna-XM_020743729.1 27151049 | 9 | 29540210 | 29540294 | Phalaenopsis equestris 78828 | GAT|GTAAGTTTTA...TTGCTGTTAATC/TTGCTGTTAATC...TCCAG|ATT | 0 | 1 | 28.502 |
| 151209560 | GT-AG | 0 | 2.4996580447690563e-05 | 103 | rna-XM_020743729.1 27151049 | 10 | 29540403 | 29540505 | Phalaenopsis equestris 78828 | AAG|GTAGGCAAGT...ATGTTCTTAGTA/AATGTTCTTAGT...TGAAG|GTT | 0 | 1 | 30.611 |
| 151209561 | GT-AG | 0 | 4.281533474596137e-05 | 182 | rna-XM_020743729.1 27151049 | 11 | 29540893 | 29541074 | Phalaenopsis equestris 78828 | GTG|GTTTGTATGC...GATATGTTAATC/GATATGTTAATC...TGCAG|AGG | 0 | 1 | 38.172 |
| 151209562 | GT-AG | 0 | 0.0017845624223282 | 172 | rna-XM_020743729.1 27151049 | 12 | 29541186 | 29541357 | Phalaenopsis equestris 78828 | ATG|GTATTTAATC...ATATTTTTATTG/GATATTTTTATT...TGCAG|GCA | 0 | 1 | 40.34 |
| 151209563 | GT-AG | 0 | 3.068701495356746e-05 | 1795 | rna-XM_020743729.1 27151049 | 13 | 29542144 | 29543938 | Phalaenopsis equestris 78828 | GAG|GTATAGTCAA...AGATTTTCAGCA/TAGATTTTCAGC...TTCAG|GTT | 0 | 1 | 55.694 |
| 151209564 | GT-AG | 0 | 1.000000099473604e-05 | 88 | rna-XM_020743729.1 27151049 | 14 | 29544078 | 29544165 | Phalaenopsis equestris 78828 | CAG|GTAATGTCTT...TTCTTTTTAGTG/TTTCTTTTTAGT...TGCAG|GCA | 1 | 1 | 58.41 |
| 151209565 | CG-CA | 0 | 1.000000099473604e-05 | 145 | rna-XM_020743729.1 27151049 | 15 | 29545337 | 29545481 | Phalaenopsis equestris 78828 | AGG|CGGAAGGGGG...GTTTTTTTCATG/GTTTTTTTCATG...GGACA|GTT | 2 | 1 | 81.285 |
| 151209566 | GT-AG | 0 | 1.000000099473604e-05 | 5646 | rna-XM_020743729.1 27151049 | 16 | 29545693 | 29551338 | Phalaenopsis equestris 78828 | GAG|GTTAGTCTAC...TTTGTTTTGAGA/ATGCATTTCATT...TGCAG|GTT | 0 | 1 | 85.407 |
| 151209567 | GT-AG | 0 | 1.000000099473604e-05 | 1960 | rna-XM_020743729.1 27151049 | 17 | 29551820 | 29553779 | Phalaenopsis equestris 78828 | GAG|GTAAGTAAAT...TGGTTCTTACAT/GTGGTTCTTACA...AAAAG|GAG | 1 | 1 | 94.804 |
| 151220758 | GT-AG | 0 | 0.003893715133271 | 233 | rna-XM_020743729.1 27151049 | 1 | 29538088 | 29538320 | Phalaenopsis equestris 78828 | CTG|GTATGTATTC...AGACCCCTAACT/AGACCCCTAACT...CGCAG|TGA | 0 | 7.58 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);