introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
33 rows where transcript_id = 10113145
This data as json, CSV (advanced)
Suggested facets: score, length, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 55534469 | GT-AG | 0 | 0.1379280511130578 | 320 | rna-XM_023079539.1 10113145 | 2 | 2215786 | 2216105 | Cucurbita moschata 3662 | TAG|GTCTCATTCT...TTTTCCTTACTG/ATTTTCCTTACT...TTCAG|ATA | 1 | 1 | 3.784 |
| 55534470 | GT-AG | 0 | 0.0001021563906998 | 250 | rna-XM_023079539.1 10113145 | 3 | 2215454 | 2215703 | Cucurbita moschata 3662 | CAG|GTGTTTCTCC...AAGTCTTTGACA/AACTTTTTAAAT...TATAG|TGA | 2 | 1 | 5.282 |
| 55534471 | GT-AG | 0 | 0.0181411220894588 | 133 | rna-XM_023079539.1 10113145 | 4 | 2215098 | 2215230 | Cucurbita moschata 3662 | CAT|GTATGTCTTT...GTTATCTTAAGA/AGATCTCTCATT...GACAG|AAA | 0 | 1 | 9.358 |
| 55534472 | GT-AG | 0 | 6.400045145390162e-05 | 83 | rna-XM_023079539.1 10113145 | 5 | 2214893 | 2214975 | Cucurbita moschata 3662 | TAG|GTATGTGCCG...TCTTTTATGACC/TCTTTTATGACC...TGCAG|TAC | 2 | 1 | 11.588 |
| 55534473 | GT-AG | 0 | 0.0003521933736882 | 1027 | rna-XM_023079539.1 10113145 | 6 | 2213807 | 2214833 | Cucurbita moschata 3662 | GAG|GTACATTGCT...CTCATCTTGAAA/CTTATTCTCATC...TGCAG|TAC | 1 | 1 | 12.667 |
| 55534474 | GT-AG | 0 | 3.0456093141700903e-05 | 84 | rna-XM_023079539.1 10113145 | 7 | 2213665 | 2213748 | Cucurbita moschata 3662 | TAG|GTAGTTTGCG...ATTTCCTTCTAT/TTATTAGTCAAC...TGTAG|CAA | 2 | 1 | 13.727 |
| 55534475 | GT-AG | 0 | 0.0110793507244515 | 112 | rna-XM_023079539.1 10113145 | 8 | 2213468 | 2213579 | Cucurbita moschata 3662 | CAG|GTATGCAGTT...TCTACCTTGATT/TCTACCTTGATT...TTCAG|GTT | 0 | 1 | 15.281 |
| 55534476 | GT-AG | 0 | 4.59376792578385e-05 | 132 | rna-XM_023079539.1 10113145 | 9 | 2213108 | 2213239 | Cucurbita moschata 3662 | AAG|GTACATATTC...ATTGTTATGAAT/ATTGTTATGAAT...AACAG|GGT | 0 | 1 | 19.448 |
| 55534477 | GT-AG | 0 | 0.0098446594021487 | 97 | rna-XM_023079539.1 10113145 | 10 | 2212952 | 2213048 | Cucurbita moschata 3662 | ACG|GTATATATTA...CTCTCGTTATTT/TCGTTATTTACA...TCAAG|AGG | 2 | 1 | 20.526 |
| 55534478 | GT-AG | 0 | 1.000000099473604e-05 | 82 | rna-XM_023079539.1 10113145 | 11 | 2212800 | 2212881 | Cucurbita moschata 3662 | AAG|GTTGGTTGTC...TTTTGTTTATCA/TTGTTTATCATT...TACAG|ATG | 0 | 1 | 21.806 |
| 55534479 | GT-AG | 0 | 1.000000099473604e-05 | 968 | rna-XM_023079539.1 10113145 | 12 | 2211763 | 2212730 | Cucurbita moschata 3662 | CAG|GTAAATCATG...AAGTCCTTTCCC/TGAATTCTGAAA...TGCAG|GGT | 0 | 1 | 23.067 |
| 55534480 | GT-AG | 0 | 1.000000099473604e-05 | 88 | rna-XM_023079539.1 10113145 | 13 | 2211552 | 2211639 | Cucurbita moschata 3662 | CAG|GTGCGCATAC...CTGATCTTGAAT/TGTTTTCTGATC...GACAG|TAT | 0 | 1 | 25.315 |
| 55534481 | GT-AG | 0 | 0.0181436523312029 | 108 | rna-XM_023079539.1 10113145 | 14 | 2211390 | 2211497 | Cucurbita moschata 3662 | AAA|GTATGTATCT...TTTCTTTTAATT/TTTCTTTTAATT...TTCAG|GGT | 0 | 1 | 26.302 |
| 55534482 | GT-AG | 0 | 0.0003160361621975 | 98 | rna-XM_023079539.1 10113145 | 15 | 2211203 | 2211300 | Cucurbita moschata 3662 | GAA|GTATGACTCT...AATTCCTTTTTT/TGGAATTTCATT...TCCAG|GAT | 2 | 1 | 27.929 |
| 55534483 | GT-AG | 0 | 0.0002243241850033 | 97 | rna-XM_023079539.1 10113145 | 16 | 2211009 | 2211105 | Cucurbita moschata 3662 | ACG|GTATGGTCCT...TCTTTCTTCTCT/TCTGTGCTTATT...TGTAG|GTA | 0 | 1 | 29.702 |
| 55534484 | GT-AG | 0 | 1.000000099473604e-05 | 102 | rna-XM_023079539.1 10113145 | 17 | 2210808 | 2210909 | Cucurbita moschata 3662 | CAA|GTGGGTCTTA...TTTTCTTTACCA/TTTTTCTTTACC...ACCAG|TGT | 0 | 1 | 31.512 |
| 55534485 | GT-AG | 0 | 1.000000099473604e-05 | 143 | rna-XM_023079539.1 10113145 | 18 | 2210464 | 2210606 | Cucurbita moschata 3662 | GAG|GTGATTAACA...TAGTTTTTATCA/GTAGTTTTTATC...TACAG|TTC | 0 | 1 | 35.186 |
| 55534486 | GT-AG | 0 | 0.000977959791102 | 100 | rna-XM_023079539.1 10113145 | 19 | 2210253 | 2210352 | Cucurbita moschata 3662 | AAG|GTAGTCTCTC...GACATTTTGACA/ATTTCTCTAATG...TACAG|ACT | 0 | 1 | 37.214 |
| 55534487 | GT-AG | 0 | 0.0004290333299393 | 112 | rna-XM_023079539.1 10113145 | 20 | 2209956 | 2210067 | Cucurbita moschata 3662 | ACG|GTATTATAGA...CATTTTCTAATG/CATTTTCTAATG...GACAG|ATA | 2 | 1 | 40.596 |
| 55534488 | GT-AG | 0 | 0.008341994707105 | 452 | rna-XM_023079539.1 10113145 | 21 | 2208886 | 2209337 | Cucurbita moschata 3662 | CAG|GTATCAGTCC...ATAACCTTACAC/CTATTTCTAATT...GCCAG|GTT | 2 | 1 | 51.892 |
| 55534489 | GT-AG | 0 | 0.0011192226113941 | 584 | rna-XM_023079539.1 10113145 | 22 | 2207884 | 2208467 | Cucurbita moschata 3662 | TTG|GTATGATGAG...TAGTTCTTAATG/TTAGTTCTTAAT...GCCAG|GTT | 0 | 1 | 59.532 |
| 55534490 | GT-AG | 0 | 1.64523520785501e-05 | 105 | rna-XM_023079539.1 10113145 | 23 | 2207539 | 2207643 | Cucurbita moschata 3662 | CAA|GTACGTCAAA...TATTTATTAGAT/AGCATATTTATT...TACAG|ATG | 0 | 1 | 63.919 |
| 55534491 | GT-AG | 0 | 7.479748880493439e-05 | 95 | rna-XM_023079539.1 10113145 | 24 | 2207320 | 2207414 | Cucurbita moschata 3662 | AGG|GTAAGCATTT...TACTCCTCAAAA/GAATATCTAATG...TGCAG|TTA | 1 | 1 | 66.185 |
| 55534492 | GT-AG | 0 | 0.0001244388716911 | 145 | rna-XM_023079539.1 10113145 | 25 | 2207026 | 2207170 | Cucurbita moschata 3662 | GAG|GTTTGTTGTC...AATTTCTCAACT/CAATTTCTCAAC...CACAG|GGC | 0 | 1 | 68.909 |
| 55534493 | GT-AG | 0 | 0.0001810623742882 | 135 | rna-XM_023079539.1 10113145 | 26 | 2206534 | 2206668 | Cucurbita moschata 3662 | AAG|GTATTAATTT...ACCTTTTTGAAT/TTTGAATTGATA...TTTAG|GAG | 0 | 1 | 75.434 |
| 55534494 | GT-AG | 0 | 1.000000099473604e-05 | 621 | rna-XM_023079539.1 10113145 | 27 | 2205746 | 2206366 | Cucurbita moschata 3662 | AAG|GTGAGCATTA...CATTCTTTTGCT/ATTATTATCATT...CTCAG|AGA | 2 | 1 | 78.487 |
| 55534495 | GT-AG | 0 | 1.5563248691645842e-05 | 118 | rna-XM_023079539.1 10113145 | 28 | 2205371 | 2205488 | Cucurbita moschata 3662 | AAG|GTCATCTTTT...ATAAGCTTGACC/ATAAGCTTGACC...TGTAG|GCC | 1 | 1 | 83.184 |
| 55534496 | GT-AG | 0 | 0.0001385336724012 | 1368 | rna-XM_023079539.1 10113145 | 29 | 2203833 | 2205200 | Cucurbita moschata 3662 | AAG|GTATTGTACT...TATCTCTTATGT/ATTTGTTTAAAT...TGTAG|CTT | 0 | 1 | 86.291 |
| 55534497 | GT-AG | 0 | 1.000000099473604e-05 | 135 | rna-XM_023079539.1 10113145 | 30 | 2203578 | 2203712 | Cucurbita moschata 3662 | CAG|GTTTGAAGCT...TTTGCATTATAA/TAAGTGCTCATT...TGTAG|AAA | 0 | 1 | 88.485 |
| 55534498 | GT-AG | 0 | 0.0001450566571556 | 81 | rna-XM_023079539.1 10113145 | 31 | 2203360 | 2203440 | Cucurbita moschata 3662 | GAG|GTACACAGAT...TAGTTCTTTTTT/TTTGTATTGAAA...AACAG|GCC | 2 | 1 | 90.989 |
| 55534499 | GT-AG | 0 | 0.0079239786688323 | 317 | rna-XM_023079539.1 10113145 | 32 | 2202709 | 2203025 | Cucurbita moschata 3662 | AAG|GTATCAATCA...ATTGTTTCAATC/AATTGTTTCAAT...TTCAG|GTT | 0 | 1 | 97.094 |
| 55534500 | GT-AG | 0 | 0.0477352783241923 | 557 | rna-XM_023079539.1 10113145 | 33 | 2202056 | 2202612 | Cucurbita moschata 3662 | CAG|GTATTCATTC...TATTTCTTTTCT/TCTTTTCTGATA...TTCAG|ATC | 0 | 1 | 98.848 |
| 55540890 | GT-AG | 0 | 0.0005873269068312 | 117 | rna-XM_023079539.1 10113145 | 1 | 2216312 | 2216428 | Cucurbita moschata 3662 | CTC|GTACTGTTTC...TTTTCCCCATTC/TCCCCATTCATT...TGCAG|AAG | 0 | 1.846 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);