introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
23 rows where transcript_id = 15550549
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 84004763 | GT-AG | 0 | 1.000000099473604e-05 | 139 | rna-XM_028380855.1 15550549 | 1 | 40737418 | 40737556 | Glycine soja 3848 | AAG|GTACGTAATG...TTTTATTTGATA/TTTTATTTGATA...GTTAG|TGC | 1 | 1 | 3.136 |
| 84004764 | GT-AG | 0 | 0.001647296505405 | 1786 | rna-XM_028380855.1 15550549 | 2 | 40737714 | 40739499 | Glycine soja 3848 | CAT|GTACGTCTCT...TGGTTTTTAATG/AATATATTCATT...TGCAG|CAT | 2 | 1 | 8.375 |
| 84004765 | GT-AG | 0 | 0.0712786200360404 | 402 | rna-XM_028380855.1 15550549 | 3 | 40739572 | 40739973 | Glycine soja 3848 | AAT|GTATGTATGG...TTGTCTTTAATT/TATTATTTGATT...TGCAG|TGA | 2 | 1 | 10.777 |
| 84004766 | GT-AG | 0 | 2.0271135660303893e-05 | 278 | rna-XM_028380855.1 15550549 | 4 | 40740043 | 40740320 | Glycine soja 3848 | CAT|GTAATATATC...ATTTCCTAAAAC/GGGAATCTAATT...GTAAG|TTC | 2 | 1 | 13.08 |
| 84004767 | GT-AG | 0 | 0.0014211496393531 | 91 | rna-XM_028380855.1 15550549 | 5 | 40740393 | 40740483 | Glycine soja 3848 | TTT|GTAAGTTTCT...TTTTTCTTTTTG/CTGAATTTAAAA...AATAG|AAA | 2 | 1 | 15.482 |
| 84004768 | GT-AG | 0 | 0.0002849127645914 | 128 | rna-XM_028380855.1 15550549 | 6 | 40740556 | 40740683 | Glycine soja 3848 | ACT|GTAAGCCCTT...CATTCATTACTT/CTTCCATTCATT...TTTAG|GCA | 2 | 1 | 17.885 |
| 84004769 | GT-AG | 0 | 0.0110349164722188 | 268 | rna-XM_028380855.1 15550549 | 7 | 40740756 | 40741023 | Glycine soja 3848 | ACT|GTAAGCTTTG...ACGTATTTAACT/ACGTATTTAACT...TGCAG|TAG | 2 | 1 | 20.287 |
| 84004770 | GT-AG | 0 | 0.000530253431643 | 93 | rna-XM_028380855.1 15550549 | 8 | 40741096 | 40741188 | Glycine soja 3848 | ACT|GTGTGATTGT...GTCGCCTTATCC/GTTGTTTTAATT...TTTAG|TAT | 2 | 1 | 22.689 |
| 84004771 | GT-AG | 0 | 0.0003950755785773 | 83 | rna-XM_028380855.1 15550549 | 9 | 40741261 | 40741343 | Glycine soja 3848 | CTT|GTAAGTTTGA...TGCTTTTTATTG/ATGCTTTTTATT...TGCAG|GAG | 2 | 1 | 25.092 |
| 84004772 | GT-AG | 0 | 1.000000099473604e-05 | 141 | rna-XM_028380855.1 15550549 | 10 | 40741416 | 40741556 | Glycine soja 3848 | ACT|GTGAGTCAGA...ATCACTTTGCAT/TATTGTATGATT...GCCAG|AAT | 2 | 1 | 27.494 |
| 84004773 | GT-AG | 0 | 1.000000099473604e-05 | 377 | rna-XM_028380855.1 15550549 | 11 | 40741629 | 40742005 | Glycine soja 3848 | ATT|GTAAGTGGTG...CTGACCGTAATT/CATATTCTGACC...TCTAG|AGA | 2 | 1 | 29.897 |
| 84004774 | GT-AG | 0 | 1.000000099473604e-05 | 625 | rna-XM_028380855.1 15550549 | 12 | 40742078 | 40742702 | Glycine soja 3848 | GCT|GTAAGAAATT...ATTGACTTGATA/GTTATATTAATT...GCTAG|TTT | 2 | 1 | 32.299 |
| 84004775 | GT-AG | 0 | 1.000000099473604e-05 | 2081 | rna-XM_028380855.1 15550549 | 13 | 40742766 | 40744846 | Glycine soja 3848 | TGT|GTGAGTTACC...TGTTTCTAAATC/TTGTTTCTAAAT...TACAG|AGA | 2 | 1 | 34.401 |
| 84004776 | GT-AG | 0 | 8.434928518332071e-05 | 99 | rna-XM_028380855.1 15550549 | 14 | 40744916 | 40745014 | Glycine soja 3848 | TGT|GTAAGACTCC...TAAGCCTTGACC/GTAGTTTTAAGA...CTCAG|GAA | 2 | 1 | 36.703 |
| 84004777 | GT-AG | 0 | 0.0005005589020157 | 132 | rna-XM_028380855.1 15550549 | 15 | 40745051 | 40745182 | Glycine soja 3848 | CTT|GTAAGTTTAA...TTTATTTTATTT/CTTTATTTTATT...TTCAG|AGG | 2 | 1 | 37.905 |
| 84004778 | GT-AG | 0 | 2.597145098481073e-05 | 13562 | rna-XM_028380855.1 15550549 | 16 | 40745221 | 40758782 | Glycine soja 3848 | AAA|GTAAGTTGAT...TAGTTGTTAACC/TTGTGATTAATT...TCCAG|CTT | 1 | 1 | 39.173 |
| 84004779 | GT-AG | 0 | 1.1436174427465498e-05 | 244 | rna-XM_028380855.1 15550549 | 17 | 40759163 | 40759406 | Glycine soja 3848 | CAG|GTCCATATTT...CATTTCTCAACT/ACATTTCTCAAC...CGCAG|AGA | 0 | 1 | 51.852 |
| 84004780 | GT-AG | 0 | 1.000000099473604e-05 | 82 | rna-XM_028380855.1 15550549 | 18 | 40759603 | 40759684 | Glycine soja 3848 | CTA|GTGAGTTGTA...TCATCTTTAGCT/TCTGTTCTCATC...TGCAG|ACT | 1 | 1 | 58.392 |
| 84004781 | GT-AG | 0 | 0.0002995011841754 | 270 | rna-XM_028380855.1 15550549 | 19 | 40759835 | 40760104 | Glycine soja 3848 | GAG|GTAATCACCA...GTTTTCTTAATA/TGTTTTCTTAAT...GGTAG|AAC | 1 | 1 | 63.397 |
| 84004782 | GT-AG | 0 | 0.0002364943860431 | 8207 | rna-XM_028380855.1 15550549 | 20 | 40760224 | 40768430 | Glycine soja 3848 | AAG|GTATATCCTC...TAATGTTTAATG/CTATTTTTCACC...TTTAG|GGT | 0 | 1 | 67.367 |
| 84004783 | GT-AG | 0 | 3.103384367798838e-05 | 105 | rna-XM_028380855.1 15550549 | 21 | 40768642 | 40768746 | Glycine soja 3848 | TTG|GTAATTGTTT...CCTATCTTATTC/TGTTGTTTAATT...ATAAG|CCA | 1 | 1 | 74.408 |
| 84004784 | GT-AG | 0 | 1.000000099473604e-05 | 354 | rna-XM_028380855.1 15550549 | 22 | 40768997 | 40769350 | Glycine soja 3848 | TTA|GTGAGTTTCC...TAGTTTATGACT/TGAATTTTCAAT...TGTAG|TGG | 2 | 1 | 82.749 |
| 84004785 | GT-AG | 0 | 5.807826515485036e-05 | 534 | rna-XM_028380855.1 15550549 | 23 | 40769502 | 40770035 | Glycine soja 3848 | AGG|GTAAATTATG...TTTGCATTAATT/AAATGTTTGATT...ACTAG|GTA | 0 | 1 | 87.788 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);