introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
27 rows where transcript_id = 10113166
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, length, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 55534802 | GT-AG | 0 | 0.0018554830469415 | 907 | rna-XM_023080516.1 10113166 | 1 | 6397111 | 6398017 | Cucurbita moschata 3662 | CTG|GTATGTTGTT...CTCACATTAGCT/AGAGAACTCACA...ATTAG|GTC | 0 | 1 | 13.633 |
| 55534803 | GT-AG | 0 | 1.000000099473604e-05 | 88 | rna-XM_023080516.1 10113166 | 2 | 6398270 | 6398357 | Cucurbita moschata 3662 | AAG|GTTTGTGTAT...ACCTCTTTGTTA/TATTGTGTCACT...TGTAG|ATT | 0 | 1 | 20.252 |
| 55534804 | GT-AG | 0 | 0.0021420070074738 | 118 | rna-XM_023080516.1 10113166 | 3 | 6398529 | 6398646 | Cucurbita moschata 3662 | CAG|GTGTGCTTGT...TTTTCTTTGATA/TTTTCTTTGATA...TATAG|TAT | 0 | 1 | 24.744 |
| 55534805 | GT-AG | 0 | 0.0007507337557048 | 188 | rna-XM_023080516.1 10113166 | 4 | 6398698 | 6398885 | Cucurbita moschata 3662 | AAG|GTATTGTACC...TGTCCTTTGAAA/CATAATATGATG...ATTAG|ATG | 0 | 1 | 26.084 |
| 55534806 | GT-AG | 0 | 0.0048508733643259 | 112 | rna-XM_023080516.1 10113166 | 5 | 6398979 | 6399090 | Cucurbita moschata 3662 | AAG|GTACTTTTGG...TTTCTCTTGATT/TTTCTCTTGATT...TATAG|GAG | 0 | 1 | 28.526 |
| 55534807 | GT-AG | 0 | 0.0148873951532491 | 409 | rna-XM_023080516.1 10113166 | 6 | 6399187 | 6399595 | Cucurbita moschata 3662 | GTG|GTATGCACCT...TGTCTCTTGAAC/ATTGTATTGACA...GTTAG|ATC | 0 | 1 | 31.048 |
| 55534808 | GT-AG | 0 | 3.907242776085566e-05 | 78 | rna-XM_023080516.1 10113166 | 7 | 6399647 | 6399724 | Cucurbita moschata 3662 | CAG|GTAGCACTCA...ATTATTTTATGT/GATTATTTTATG...AACAG|GGG | 0 | 1 | 32.388 |
| 55534809 | GT-AG | 0 | 1.000000099473604e-05 | 261 | rna-XM_023080516.1 10113166 | 8 | 6399844 | 6400104 | Cucurbita moschata 3662 | TGG|GTAAGAACTG...TGTATTTCAACA/GTGTATTTCAAC...TGCAG|TTA | 2 | 1 | 35.514 |
| 55534810 | GT-AG | 0 | 0.4254657504081139 | 640 | rna-XM_023080516.1 10113166 | 9 | 6400214 | 6400853 | Cucurbita moschata 3662 | CAA|GTATGCTCCT...TTCTTTTTAATC/ATTTTCTTTATT...GGTAG|GTT | 0 | 1 | 38.377 |
| 55534811 | GT-AG | 0 | 1.000000099473604e-05 | 186 | rna-XM_023080516.1 10113166 | 10 | 6401049 | 6401234 | Cucurbita moschata 3662 | AAG|GTAATGAACT...ATATCATTCACA/ATATCATTCACA...TCAAG|GTT | 0 | 1 | 43.499 |
| 55534812 | GT-AG | 0 | 6.130671459775669e-05 | 96 | rna-XM_023080516.1 10113166 | 11 | 6401307 | 6401402 | Cucurbita moschata 3662 | CAG|GTATTTGATC...GATTGTTTAAAT/CTCATTTTCATG...AATAG|GTT | 0 | 1 | 45.39 |
| 55534813 | GT-AG | 0 | 1.000000099473604e-05 | 859 | rna-XM_023080516.1 10113166 | 12 | 6401470 | 6402328 | Cucurbita moschata 3662 | AAG|GTCTGGGTTA...TTTCTCTTATTC/CATTTTCTAATT...CACAG|GAG | 1 | 1 | 47.15 |
| 55534814 | GT-AG | 0 | 1.000000099473604e-05 | 76 | rna-XM_023080516.1 10113166 | 13 | 6402472 | 6402547 | Cucurbita moschata 3662 | CAG|GTTAGACCAA...TAAAACTTAAAG/TTAAAACTTAAA...TTTAG|GTT | 0 | 1 | 50.906 |
| 55534815 | GC-AG | 0 | 1.000000099473604e-05 | 79 | rna-XM_023080516.1 10113166 | 14 | 6402620 | 6402698 | Cucurbita moschata 3662 | CAG|GCATGTGGAG...CTTTCCTTTCTT/CTTCTACTGACT...TATAG|GCT | 0 | 1 | 52.797 |
| 55534816 | GT-AG | 0 | 1.000000099473604e-05 | 1432 | rna-XM_023080516.1 10113166 | 15 | 6402828 | 6404259 | Cucurbita moschata 3662 | CAG|GTATTAGGTT...TTCTCCTTTTTC/AGAATTCTGATG...TGTAG|GTT | 0 | 1 | 56.186 |
| 55534817 | GT-AG | 0 | 9.822919184689958e-05 | 499 | rna-XM_023080516.1 10113166 | 16 | 6404416 | 6404914 | Cucurbita moschata 3662 | CAG|GTATTAATGG...CATGTTTTGAAT/TGAATTTTCACA...TTTAG|ATG | 0 | 1 | 60.284 |
| 55534818 | GT-AG | 0 | 0.0006908974615438 | 100 | rna-XM_023080516.1 10113166 | 17 | 6405029 | 6405128 | Cucurbita moschata 3662 | CAG|GTACATTAGT...TTTTTTTTGATG/TATATACTGATT...TATAG|ATT | 0 | 1 | 63.278 |
| 55534819 | GT-AG | 0 | 1.000000099473604e-05 | 68 | rna-XM_023080516.1 10113166 | 18 | 6405228 | 6405295 | Cucurbita moschata 3662 | GAG|GTAATACTCA...ATATCCTTGCTA/CAATTTTGTACT...AACAG|GTT | 0 | 1 | 65.879 |
| 55534820 | GT-AG | 0 | 1.000000099473604e-05 | 466 | rna-XM_023080516.1 10113166 | 19 | 6405374 | 6405839 | Cucurbita moschata 3662 | CAG|GTTAGTTACT...TTTCTTTCAATA/AAGGTTGTCATT...TTTAG|GGC | 0 | 1 | 67.928 |
| 55534821 | GT-AG | 0 | 0.0028998393857484 | 168 | rna-XM_023080516.1 10113166 | 20 | 6405975 | 6406142 | Cucurbita moschata 3662 | AAG|GTATTTTGTT...TAAGCCTTTGCT/GTGTTTCTGACT...GGCAG|GAA | 0 | 1 | 71.474 |
| 55534822 | GT-AG | 0 | 0.0007089314023814 | 187 | rna-XM_023080516.1 10113166 | 21 | 6406218 | 6406404 | Cucurbita moschata 3662 | AAG|GTAGACCGTT...GATTTCTTACCT/TGATTTCTTACC...TGCAG|ATA | 0 | 1 | 73.444 |
| 55534823 | GT-AG | 0 | 0.0002376576386905 | 213 | rna-XM_023080516.1 10113166 | 22 | 6406523 | 6406735 | Cucurbita moschata 3662 | GAG|GTGTGTTTTC...TTTATTTTATTT/TTTTATTTTATT...TTCAG|GTT | 1 | 1 | 76.543 |
| 55534824 | GT-AG | 0 | 0.0010708255271245 | 77 | rna-XM_023080516.1 10113166 | 23 | 6407107 | 6407183 | Cucurbita moschata 3662 | AAT|GTGTGTATTT...TCCACCTTGATG/CTATGTCTTAAA...AATAG|ATG | 0 | 1 | 86.288 |
| 55534825 | GT-AG | 0 | 1.000000099473604e-05 | 83 | rna-XM_023080516.1 10113166 | 24 | 6407247 | 6407329 | Cucurbita moschata 3662 | CAG|GTAGTAATTG...CTTTTCTTAAGT/CTTTTCTTAAGT...ATCAG|GAC | 0 | 1 | 87.943 |
| 55534826 | GT-AG | 0 | 0.0001217025994419 | 278 | rna-XM_023080516.1 10113166 | 25 | 6407402 | 6407679 | Cucurbita moschata 3662 | CAG|GTACTTTCTA...TCTCTTTTGGTT/TTTTGGTTTATC...TGCAG|GCC | 0 | 1 | 89.835 |
| 55534827 | GT-AG | 0 | 0.0112762338947526 | 78 | rna-XM_023080516.1 10113166 | 26 | 6407812 | 6407889 | Cucurbita moschata 3662 | CAG|GTTTTCTTCT...TGGCTTTTAATC/TTTGATTTGATA...CACAG|GAT | 0 | 1 | 93.302 |
| 55534828 | GT-AG | 0 | 0.0246317187655398 | 212 | rna-XM_023080516.1 10113166 | 27 | 6408010 | 6408221 | Cucurbita moschata 3662 | CAG|GTTTCTCTTA...TTTTTCTTCTTT/AAAATGATAATT...TACAG|CAG | 0 | 1 | 96.454 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);