introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
24 rows where transcript_id = 15550499
This data as json, CSV (advanced)
Suggested facets: score, length, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 84004222 | GT-AG | 0 | 0.0002357764224501 | 87 | rna-XM_028324342.1 15550499 | 1 | 56450555 | 56450641 | Glycine soja 3848 | AAG|GTATTACTCT...GAACCTTTACTG/CTGTAGCTAATT...ATTAG|TGA | 2 | 1 | 3.713 |
| 84004223 | GT-AG | 0 | 0.0092328512081176 | 114 | rna-XM_028324342.1 15550499 | 2 | 56450880 | 56450993 | Glycine soja 3848 | TCT|GTAAGTTTTC...TGATTCTTATTG/TTGATTCTTATT...TCAAG|CAG | 0 | 1 | 9.202 |
| 84004224 | GT-AG | 0 | 1.000000099473604e-05 | 602 | rna-XM_028324342.1 15550499 | 3 | 56451224 | 56451825 | Glycine soja 3848 | TAG|GTGATTTTTA...AACTACTTAATA/TTGTTGCTAACT...TTTAG|GTT | 2 | 1 | 14.506 |
| 84004225 | GT-AG | 0 | 1.000000099473604e-05 | 1015 | rna-XM_028324342.1 15550499 | 4 | 56452025 | 56453039 | Glycine soja 3848 | AAG|GTATGGGCCA...GATATTTTATTG/ATTTTATTGATG...GTCAG|GAT | 0 | 1 | 19.096 |
| 84004226 | GT-AG | 0 | 0.0064560765801764 | 172 | rna-XM_028324342.1 15550499 | 5 | 56453088 | 56453259 | Glycine soja 3848 | GAG|GTACGCTTTA...TTGTTATTGATG/CATTTACTCACA...TGCAG|ATA | 0 | 1 | 20.203 |
| 84004227 | GT-AG | 0 | 1.000000099473604e-05 | 115 | rna-XM_028324342.1 15550499 | 6 | 56453368 | 56453482 | Glycine soja 3848 | CAG|GTGAGGAACA...TTGCTTTTAATT/CCTATTTTCATT...TGCAG|TTG | 0 | 1 | 22.694 |
| 84004228 | GT-AG | 0 | 0.0057766113832381 | 80 | rna-XM_028324342.1 15550499 | 7 | 56453672 | 56453751 | Glycine soja 3848 | GAA|GTATGTCCTA...TCATTCTTACAA/ATCATTCTTACA...CACAG|GCA | 0 | 1 | 27.053 |
| 84004229 | GT-AG | 0 | 1.000000099473604e-05 | 971 | rna-XM_028324342.1 15550499 | 8 | 56454049 | 56455019 | Glycine soja 3848 | AAG|GTATGAAAAG...TGTTTTTTATAT/ATGTTTTTTATA...TGCAG|ATT | 0 | 1 | 33.902 |
| 84004230 | GT-AG | 0 | 1.000000099473604e-05 | 419 | rna-XM_028324342.1 15550499 | 9 | 56455122 | 56455540 | Glycine soja 3848 | AAG|GTGAGATGAA...TGTTTATTGATG/TAAATGTTTATT...TACAG|AGT | 0 | 1 | 36.255 |
| 84004231 | GT-AG | 0 | 0.0001848824383546 | 352 | rna-XM_028324342.1 15550499 | 10 | 56455676 | 56456027 | Glycine soja 3848 | GAG|GTTTGTTTCT...CTTTCTATGACA/CTTTACTTCATT...TACAG|GTT | 0 | 1 | 39.368 |
| 84004232 | GT-AG | 0 | 1.000000099473604e-05 | 701 | rna-XM_028324342.1 15550499 | 11 | 56456253 | 56456953 | Glycine soja 3848 | GAG|GTAATTTCAT...ATTTACTTAAAA/TAGTCATTTACT...TTCAG|GTC | 0 | 1 | 44.557 |
| 84004233 | GT-AG | 0 | 0.0008859123599279 | 305 | rna-XM_028324342.1 15550499 | 12 | 56457158 | 56457462 | Glycine soja 3848 | GAG|GTATTTCTTC...ATATACATGATT/TTGTTGTTCAGC...TGTAG|CTT | 0 | 1 | 49.262 |
| 84004234 | GT-AG | 0 | 0.0015040070543109 | 374 | rna-XM_028324342.1 15550499 | 13 | 56457634 | 56458007 | Glycine soja 3848 | AAG|GTTTTTTGTG...GTTTTCTTGAAC/TATTAATTCATT...GACAG|ATA | 0 | 1 | 53.206 |
| 84004235 | GT-AG | 0 | 2.588981764235507e-05 | 780 | rna-XM_028324342.1 15550499 | 14 | 56458158 | 56458937 | Glycine soja 3848 | CAG|GTACTTGTCA...TTCCATTTAACG/TTGTATTTCATG...ACCAG|GTG | 0 | 1 | 56.665 |
| 84004236 | GT-AG | 0 | 2.5961291583987645e-05 | 154 | rna-XM_028324342.1 15550499 | 15 | 56459076 | 56459229 | Glycine soja 3848 | GAG|GTAAACATCT...TTCCTTGTGATG/TTCCTTGTGATG...TGCAG|CAT | 0 | 1 | 59.848 |
| 84004237 | GT-AG | 0 | 0.0001084666883635 | 77 | rna-XM_028324342.1 15550499 | 16 | 56459458 | 56459534 | Glycine soja 3848 | GAG|GTTTGTTATC...AGTTCTTTTACA/AGTTCTTTTACA...CTCAG|GTA | 0 | 1 | 65.106 |
| 84004238 | GT-AG | 0 | 1.000000099473604e-05 | 172 | rna-XM_028324342.1 15550499 | 17 | 56459688 | 56459859 | Glycine soja 3848 | GAA|GTGAGAAGCT...TGGCTTCTGACA/TGGCTTCTGACA...TGCAG|GTA | 0 | 1 | 68.635 |
| 84004239 | GT-AG | 0 | 2.529355196675522e-05 | 114 | rna-XM_028324342.1 15550499 | 18 | 56459987 | 56460100 | Glycine soja 3848 | ATG|GTGAACTTTG...TTACTGTTAATT/TTACTGTTAATT...TGCAG|ATG | 1 | 1 | 71.564 |
| 84004240 | GT-AG | 0 | 0.2473563951547327 | 1175 | rna-XM_028324342.1 15550499 | 19 | 56460216 | 56461390 | Glycine soja 3848 | TAG|GTACCTCTTT...TTATTTTTAGTT/TGAAATCTGATT...TACAG|GCT | 2 | 1 | 74.216 |
| 84004241 | GT-AG | 0 | 1.000000099473604e-05 | 87 | rna-XM_028324342.1 15550499 | 20 | 56461512 | 56461598 | Glycine soja 3848 | AAG|GTGTGAGCTT...GTTTCCTTCTGT/GTCAAATTGATT...TGCAG|GTC | 0 | 1 | 77.006 |
| 84004242 | GT-AG | 0 | 1.000000099473604e-05 | 327 | rna-XM_028324342.1 15550499 | 21 | 56461680 | 56462006 | Glycine soja 3848 | AGG|GTTAGTTATT...TGCATTTTAAAT/TGCATTTTAAAT...TGTAG|GCT | 0 | 1 | 78.875 |
| 84004243 | GT-AG | 0 | 1.000000099473604e-05 | 184 | rna-XM_028324342.1 15550499 | 22 | 56462109 | 56462292 | Glycine soja 3848 | AAG|GTGAGATAGA...TTATTCTTATTC/ATTATTCTTATT...TCCAG|GAC | 0 | 1 | 81.227 |
| 84004244 | GT-AG | 0 | 1.000000099473604e-05 | 552 | rna-XM_028324342.1 15550499 | 23 | 56462444 | 56462995 | Glycine soja 3848 | CAG|GTTTAGTAAT...GTAATTTTACTA/AGTAATTTTACT...TGCAG|CTC | 1 | 1 | 84.709 |
| 84012283 | GT-AG | 0 | 1.000000099473604e-05 | 232 | rna-XM_028324342.1 15550499 | 24 | 56463132 | 56463363 | Glycine soja 3848 | GAG|GTTAGCATGT...TTTTTTGTATCT/GATACACTAATT...AACAG|GAG | 0 | 87.846 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);