introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
30 rows where transcript_id = 5530394
This data as json, CSV (advanced)
Suggested facets: score, length, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 28483562 | GT-AG | 0 | 1.000000099473604e-05 | 43436 | rna-XM_044275306.1 5530394 | 1 | 511004150 | 511047585 | Bufo gargarizans 30331 | CTG|GTAATGCTCA...GATGTCTTAACC/GATGTCTTAACC...TCCAG|GCT | 0 | 1 | 1.116 |
| 28483563 | GT-AG | 0 | 1.000000099473604e-05 | 46219 | rna-XM_044275306.1 5530394 | 2 | 511047824 | 511094042 | Bufo gargarizans 30331 | CCA|GTGAGTACAC...TACATCCTACTA/ATCCTACTAAGC...TGTAG|TGC | 1 | 1 | 4.804 |
| 28483564 | GT-AG | 0 | 1.000000099473604e-05 | 793 | rna-XM_044275306.1 5530394 | 3 | 511094128 | 511094920 | Bufo gargarizans 30331 | AAG|GTAAGAGGTT...CACTGTTTATTT/ACACTGTTTATT...CCTAG|GTG | 2 | 1 | 6.121 |
| 28483565 | GT-AG | 0 | 0.0101897945584386 | 4210 | rna-XM_044275306.1 5530394 | 4 | 511095005 | 511099214 | Bufo gargarizans 30331 | GAG|GTAACTTTTT...TGTCCTTTTGTT/TGAGATCTGATG...TCTAG|TGA | 2 | 1 | 7.423 |
| 28483566 | GT-AG | 0 | 1.000000099473604e-05 | 419 | rna-XM_044275306.1 5530394 | 5 | 511099361 | 511099779 | Bufo gargarizans 30331 | AAG|GTAAGCACCA...TTTTTTTTTTCT/TTTTCTCTCATT...TACAG|TTA | 1 | 1 | 9.685 |
| 28483567 | GT-AG | 0 | 1.000000099473604e-05 | 234 | rna-XM_044275306.1 5530394 | 6 | 511099975 | 511100208 | Bufo gargarizans 30331 | TGG|GTAAGGATCT...TTTTCCCCATCT/ATTATAGTTACC...CTCAG|GAG | 1 | 1 | 12.707 |
| 28483568 | GT-AG | 0 | 1.000000099473604e-05 | 2349 | rna-XM_044275306.1 5530394 | 7 | 511100398 | 511102746 | Bufo gargarizans 30331 | CTG|GTAAGTTGCC...ACTCATTTAATA/CAGGAACTCATT...TACAG|GAG | 1 | 1 | 15.636 |
| 28483569 | GT-AG | 0 | 0.0002532207943881 | 3151 | rna-XM_044275306.1 5530394 | 8 | 511102913 | 511106063 | Bufo gargarizans 30331 | AAA|GTAAGTTTTT...TCATCCTCAACT/TTTTTTGTCATC...TAAAG|GTC | 2 | 1 | 18.209 |
| 28483570 | GT-AG | 0 | 1.000000099473604e-05 | 3445 | rna-XM_044275306.1 5530394 | 9 | 511106163 | 511109607 | Bufo gargarizans 30331 | CAG|GTGAGATCTA...TGATATTTAATT/TTTAATTTAATA...TGCAG|GCA | 2 | 1 | 19.743 |
| 28483571 | GT-AG | 0 | 0.0002460192140194 | 581 | rna-XM_044275306.1 5530394 | 10 | 511110334 | 511110914 | Bufo gargarizans 30331 | AAG|GTACAATTGG...TTTTCCTTACTG/CTTTTCCTTACT...TACAG|GCT | 2 | 1 | 30.993 |
| 28483572 | GT-AG | 0 | 1.000000099473604e-05 | 6602 | rna-XM_044275306.1 5530394 | 11 | 511111138 | 511117739 | Bufo gargarizans 30331 | AAG|GTTATTTTCT...TTGGTCATAATT/TGTTTATTTATT...TTCAG|TCT | 0 | 1 | 34.449 |
| 28483573 | GT-AG | 0 | 1.1928048173102534e-05 | 196 | rna-XM_044275306.1 5530394 | 12 | 511117843 | 511118038 | Bufo gargarizans 30331 | CAG|GTAGATATCC...CTTTCTTTGGCA/AAATCTCTCACT...TACAG|ACC | 1 | 1 | 36.045 |
| 28483574 | GT-AG | 0 | 0.0150104407863976 | 1203 | rna-XM_044275306.1 5530394 | 13 | 511118149 | 511119351 | Bufo gargarizans 30331 | CTG|GTATGTTTTT...TTTTCTTTTGTT/TCATTAGTCATT...AAAAG|GCG | 0 | 1 | 37.75 |
| 28483575 | GT-AG | 0 | 1.000000099473604e-05 | 477 | rna-XM_044275306.1 5530394 | 14 | 511119452 | 511119928 | Bufo gargarizans 30331 | CAA|GTGAGTGTTA...TACTTTTTACTT/TTACTTTTTACT...TTCAG|CAA | 1 | 1 | 39.3 |
| 28483576 | GT-AG | 0 | 0.0174072305026316 | 8449 | rna-XM_044275306.1 5530394 | 15 | 511120150 | 511128598 | Bufo gargarizans 30331 | AAG|GTATACATGC...CTGCCTTTATTG/CATCTACTAATA...AATAG|GAC | 0 | 1 | 42.724 |
| 28483577 | GT-AG | 0 | 1.000000099473604e-05 | 111 | rna-XM_044275306.1 5530394 | 16 | 511128805 | 511128915 | Bufo gargarizans 30331 | TAT|GTGAGTACTG...AGGGCATTAATG/TCTGCATTTACC...TACAG|GAA | 2 | 1 | 45.917 |
| 28483578 | GT-AG | 0 | 1.000000099473604e-05 | 1288 | rna-XM_044275306.1 5530394 | 17 | 511129845 | 511131132 | Bufo gargarizans 30331 | ATG|GTGAGAATTC...TTTTTTTTATTT/TTTTTTTTTATT...TTAAG|TGC | 1 | 1 | 60.313 |
| 28483579 | GT-AG | 0 | 1.5428149140419245e-05 | 2815 | rna-XM_044275306.1 5530394 | 18 | 511131313 | 511134127 | Bufo gargarizans 30331 | ATG|GTATGATGAT...TATTTTCTATAT/GTGTTACTGATG...CTCAG|GAT | 1 | 1 | 63.102 |
| 28483580 | GT-AG | 0 | 1.5306018563868154e-05 | 6430 | rna-XM_044275306.1 5530394 | 19 | 511134352 | 511140781 | Bufo gargarizans 30331 | GAG|GTAAGCGTCT...TTTACATTGACC/CTCGTTTTTACA...TACAG|ATG | 0 | 1 | 66.574 |
| 28483581 | GT-AG | 0 | 1.000000099473604e-05 | 122 | rna-XM_044275306.1 5530394 | 20 | 511140948 | 511141069 | Bufo gargarizans 30331 | TAG|GTTGGTCATT...TTCTCCCTATTT/TATTTATTTATA...CGTAG|CCC | 1 | 1 | 69.146 |
| 28483582 | GT-AG | 0 | 1.000000099473604e-05 | 2436 | rna-XM_044275306.1 5530394 | 21 | 511141395 | 511143830 | Bufo gargarizans 30331 | AAG|GTAAGTGAAA...GTCTTCTTTTTA/TTGTAATTCATG...AACAG|TAC | 2 | 1 | 74.183 |
| 28483583 | GT-AG | 0 | 1.000000099473604e-05 | 620 | rna-XM_044275306.1 5530394 | 22 | 511144054 | 511144673 | Bufo gargarizans 30331 | CAG|GTAAGTGATC...TTTTTTTTATCT/GTTGGTTTCAAT...TAAAG|GTG | 0 | 1 | 77.638 |
| 28483584 | GT-AG | 0 | 1.000000099473604e-05 | 109 | rna-XM_044275306.1 5530394 | 23 | 511144863 | 511144971 | Bufo gargarizans 30331 | GAG|GTAATGGAAA...ACTTGCTTAATC/ACTTGCTTAATC...TGCAG|CAT | 0 | 1 | 80.567 |
| 28483585 | GT-AG | 0 | 1.000000099473604e-05 | 3659 | rna-XM_044275306.1 5530394 | 24 | 511145196 | 511148854 | Bufo gargarizans 30331 | TAG|GTAAGAAACT...TGTCTCTCATTT/ATGTCTCTCATT...CTCAG|GTT | 2 | 1 | 84.038 |
| 28483586 | GT-AG | 0 | 0.0002824982401668 | 85 | rna-XM_044275306.1 5530394 | 25 | 511148998 | 511149082 | Bufo gargarizans 30331 | AAG|GTATATAGCT...CTTTTCTTTTCA/TTTCTTTTCATA...TGCAG|ACT | 1 | 1 | 86.254 |
| 28483587 | GT-AG | 0 | 1.000000099473604e-05 | 1113 | rna-XM_044275306.1 5530394 | 26 | 511149242 | 511150354 | Bufo gargarizans 30331 | CAG|GTAAATGTGA...CCAATGTTAAAT/CTAGTGCTAATC...TACAG|ATG | 1 | 1 | 88.718 |
| 28483588 | GT-AG | 0 | 1.000000099473604e-05 | 736 | rna-XM_044275306.1 5530394 | 27 | 511150526 | 511151261 | Bufo gargarizans 30331 | ATG|GTGAGTAGCT...TTGTTCTAAATG/CTTGTTCTAAAT...TTCAG|ATG | 1 | 1 | 91.368 |
| 28483589 | GT-AG | 0 | 1.000000099473604e-05 | 1720 | rna-XM_044275306.1 5530394 | 28 | 511151411 | 511153130 | Bufo gargarizans 30331 | CGG|GTACGTAGGG...TTTTGCTTACTT/CTTTTGTTCATT...TGTAG|AGT | 0 | 1 | 93.677 |
| 28483590 | GT-AG | 0 | 1.000000099473604e-05 | 2809 | rna-XM_044275306.1 5530394 | 29 | 511153293 | 511156101 | Bufo gargarizans 30331 | AAG|GTAAAAACTG...TGTTTCTTCGTA/TGTTTACTGATC...CACAG|GCT | 0 | 1 | 96.188 |
| 28483591 | GT-AG | 0 | 1.000000099473604e-05 | 111 | rna-XM_044275306.1 5530394 | 30 | 511156215 | 511156325 | Bufo gargarizans 30331 | TAG|GTAAGTTTAC...TATTCATTGAAA/TGTGTATTCATT...TTTAG|GTT | 2 | 1 | 97.939 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);