introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
23 rows where transcript_id = 15550548
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 84004740 | GT-AG | 0 | 1.000000099473604e-05 | 369 | rna-XM_028380844.1 15550548 | 1 | 40636297 | 40636665 | Glycine soja 3848 | AAG|GTACGTAATG...AGTTTATTGATA/TTTATATTTATG...GTTAG|TGC | 1 | 1 | 3.136 |
| 84004741 | GT-AG | 0 | 0.0020694358645437 | 5327 | rna-XM_028380844.1 15550548 | 2 | 40636823 | 40642149 | Glycine soja 3848 | CAT|GTACGTCTCT...TGATTTTTAATG/CATTTCTTCATT...TGCAG|CAT | 2 | 1 | 8.375 |
| 84004742 | GT-AG | 0 | 0.0001967485903693 | 390 | rna-XM_028380844.1 15550548 | 3 | 40642222 | 40642611 | Glycine soja 3848 | AAT|GTAAGTATGG...TTGTCTTTAATT/TATTATTTGATT...TGCAG|TGA | 2 | 1 | 10.777 |
| 84004743 | GT-AG | 0 | 2.0271135660303893e-05 | 277 | rna-XM_028380844.1 15550548 | 4 | 40642681 | 40642957 | Glycine soja 3848 | CAT|GTAATATATC...ATTTCCTAAAAC/GGGAATCTAATT...GTAAG|TTC | 2 | 1 | 13.08 |
| 84004744 | GT-AG | 0 | 0.0012149168699229 | 89 | rna-XM_028380844.1 15550548 | 5 | 40643030 | 40643118 | Glycine soja 3848 | TTT|GTAAGTTTCT...TTTTTCTTTTTG/CTGAATTTAAAA...AATAG|AGT | 2 | 1 | 15.482 |
| 84004745 | GT-AG | 0 | 0.0002849127645914 | 128 | rna-XM_028380844.1 15550548 | 6 | 40643191 | 40643318 | Glycine soja 3848 | ACT|GTAAGCCCTT...CATTCATTACTT/CTTCCATTCATT...TTTAG|GCA | 2 | 1 | 17.885 |
| 84004746 | GT-AG | 0 | 0.0112779237435292 | 265 | rna-XM_028380844.1 15550548 | 7 | 40643391 | 40643655 | Glycine soja 3848 | ACT|GTAAGCTTTG...ATATCTCTAACG/ACGTATTTAACT...TGCAG|TAG | 2 | 1 | 20.287 |
| 84004747 | GT-AG | 0 | 0.00048865611063 | 94 | rna-XM_028380844.1 15550548 | 8 | 40643728 | 40643821 | Glycine soja 3848 | ACT|GTGTGATTAT...TTTCTTTTAACG/GTTGTTTTAATT...TTTAG|TGT | 2 | 1 | 22.689 |
| 84004748 | GT-AG | 0 | 4.292222042687047e-05 | 83 | rna-XM_028380844.1 15550548 | 9 | 40643894 | 40643976 | Glycine soja 3848 | CTT|GTAAGTATGA...TGCTTTTTATTG/ATGCTTTTTATT...TGCAG|GAG | 2 | 1 | 25.092 |
| 84004749 | GT-AG | 0 | 1.000000099473604e-05 | 141 | rna-XM_028380844.1 15550548 | 10 | 40644049 | 40644189 | Glycine soja 3848 | ACT|GTGAGTCAGA...ATCACTTTGCAT/TATTGTATGATT...GCCAG|AAT | 2 | 1 | 27.494 |
| 84004750 | GT-AG | 0 | 1.000000099473604e-05 | 381 | rna-XM_028380844.1 15550548 | 11 | 40644262 | 40644642 | Glycine soja 3848 | ATT|GTAAGTGGTG...CTGACCGTAATT/CATATTCTGACC...TCTAG|AGA | 2 | 1 | 29.897 |
| 84004751 | GT-AG | 0 | 1.000000099473604e-05 | 616 | rna-XM_028380844.1 15550548 | 12 | 40644715 | 40645330 | Glycine soja 3848 | GCT|GTAAGAAATT...ATTGACTTGATA/GTTATATTAATT...GCTAG|TTT | 2 | 1 | 32.299 |
| 84004752 | GT-AG | 0 | 1.095580502000802e-05 | 1991 | rna-XM_028380844.1 15550548 | 13 | 40645394 | 40647384 | Glycine soja 3848 | TGT|GTGAGTTACC...GTGATCTTAGTT/TAGTTTCTAAAT...TACAG|AGA | 2 | 1 | 34.401 |
| 84004753 | GT-AG | 0 | 8.434928518332071e-05 | 99 | rna-XM_028380844.1 15550548 | 14 | 40647454 | 40647552 | Glycine soja 3848 | TGT|GTAAGACTCC...TAAGCCTTGACC/GTAGTTTTAAGA...CTCAG|GAA | 2 | 1 | 36.703 |
| 84004754 | GT-AG | 0 | 0.0005005589020157 | 133 | rna-XM_028380844.1 15550548 | 15 | 40647589 | 40647721 | Glycine soja 3848 | CTT|GTAAGTTTAA...TTTATTTTATTT/CTTTATTTTATT...TTCAG|AGG | 2 | 1 | 37.905 |
| 84004755 | GT-AG | 0 | 2.597145098481073e-05 | 15575 | rna-XM_028380844.1 15550548 | 16 | 40647760 | 40663334 | Glycine soja 3848 | AAA|GTAAGTTGAT...TAGTTGTTAACC/TTGTGATTAATT...TCCAG|CTT | 1 | 1 | 39.173 |
| 84004756 | GT-AG | 0 | 1.000000099473604e-05 | 304 | rna-XM_028380844.1 15550548 | 17 | 40663715 | 40664018 | Glycine soja 3848 | CAG|GTCCATATTT...CATTTCTCAACT/TTTTGTTTGAAT...TGCAG|AGA | 0 | 1 | 51.852 |
| 84004757 | GT-AG | 0 | 1.000000099473604e-05 | 82 | rna-XM_028380844.1 15550548 | 18 | 40664215 | 40664296 | Glycine soja 3848 | CTA|GTGAGTTGTA...TCATCTTTAGCT/TCTGTTCTCATC...TGCAG|ACT | 1 | 1 | 58.392 |
| 84004758 | GT-AG | 0 | 4.31594825190232e-05 | 261 | rna-XM_028380844.1 15550548 | 19 | 40664447 | 40664707 | Glycine soja 3848 | GAG|GTAATCACCA...TTTGTTTTATTA/ATTTGTTTTATT...GGTAG|AAC | 1 | 1 | 63.397 |
| 84004759 | GT-AG | 0 | 0.0002364943860431 | 4519 | rna-XM_028380844.1 15550548 | 20 | 40664827 | 40669345 | Glycine soja 3848 | AAG|GTATATCCTC...TAATGTTTAATG/CTATTTTTCACC...TTTAG|GGT | 0 | 1 | 67.367 |
| 84004760 | GT-AG | 0 | 3.103384367798838e-05 | 105 | rna-XM_028380844.1 15550548 | 21 | 40669557 | 40669661 | Glycine soja 3848 | TTG|GTAATTGTTT...CCTATCTTATTC/TGTTGTTTAATT...ATAAG|CCA | 1 | 1 | 74.408 |
| 84004761 | GT-AG | 0 | 1.000000099473604e-05 | 354 | rna-XM_028380844.1 15550548 | 22 | 40669912 | 40670265 | Glycine soja 3848 | TTA|GTGAGTTTCC...TGTACTGTAATT/TGAATTTTCAAT...TGTAG|TGG | 2 | 1 | 82.749 |
| 84004762 | GT-AG | 0 | 5.807826515485036e-05 | 528 | rna-XM_028380844.1 15550548 | 23 | 40670417 | 40670944 | Glycine soja 3848 | AGG|GTAAATTATG...TTTGCATTAATT/AAATGTTTGATT...ACTAG|GTA | 0 | 1 | 87.788 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);