introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
31 rows where transcript_id = 32191395
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 179733012 | GT-AG | 0 | 1.000000099473604e-05 | 31325 | rna-XM_047132739.1 32191395 | 1 | 207056351 | 207087675 | Schistocerca americana 7009 | CAG|GTAAAGCTAA...TATTCTTTCATC/TATTCTTTCATC...TACAG|GAG | 0 | 1 | 1.251 |
| 179733013 | GT-AG | 0 | 2.2713837718428508e-05 | 8242 | rna-XM_047132739.1 32191395 | 2 | 207047995 | 207056236 | Schistocerca americana 7009 | GAG|GTTTGTATTT...TTAACATTAATG/GTAAGTTTAACA...TACAG|GTG | 0 | 1 | 2.89 |
| 179733014 | GT-AG | 0 | 1.000000099473604e-05 | 4354 | rna-XM_047132739.1 32191395 | 3 | 207043509 | 207047862 | Schistocerca americana 7009 | AAG|GTATTGAAAT...AAGTCATTATTG/TAATATATGATT...TGCAG|GCA | 0 | 1 | 4.789 |
| 179733015 | GT-AG | 0 | 1.000000099473604e-05 | 2128 | rna-XM_047132739.1 32191395 | 4 | 207041279 | 207043406 | Schistocerca americana 7009 | GAG|GTGAGCAGAA...TTGACTTTAGTG/TTGTGATTTATT...TTTAG|GGC | 0 | 1 | 6.255 |
| 179733016 | GT-AG | 0 | 1.000000099473604e-05 | 107 | rna-XM_047132739.1 32191395 | 5 | 207040963 | 207041069 | Schistocerca americana 7009 | CTG|GTGAGTATGT...GTGCTCTCAATA/CAATATTTTATC...AATAG|CAA | 2 | 1 | 9.261 |
| 179733017 | GT-AG | 0 | 1.000000099473604e-05 | 11171 | rna-XM_047132739.1 32191395 | 6 | 207029587 | 207040757 | Schistocerca americana 7009 | AAG|GTATGAGAAA...TTTCTTTTAGTA/TAATATCTTACA...TTCAG|CGT | 0 | 1 | 12.209 |
| 179733018 | GT-AG | 0 | 0.0001878228067043 | 6469 | rna-XM_047132739.1 32191395 | 7 | 207022940 | 207029408 | Schistocerca americana 7009 | AAG|GTCTTTTTGT...ACCACCATGATT/CCATGATTAATA...TATAG|GTT | 1 | 1 | 14.768 |
| 179733019 | GT-AG | 0 | 1.649145029388392e-05 | 7816 | rna-XM_047132739.1 32191395 | 8 | 207014934 | 207022749 | Schistocerca americana 7009 | GCT|GTAAGTAACT...TTTATTTTACAG/GTTTATTTTACA...TTCAG|ACC | 2 | 1 | 17.501 |
| 179733020 | GT-AG | 0 | 1.000000099473604e-05 | 79 | rna-XM_047132739.1 32191395 | 9 | 207014710 | 207014788 | Schistocerca americana 7009 | CAG|GTGTGTGTAA...GAAATTTTAGAG/ATTTCATTCAGA...TACAG|GGA | 0 | 1 | 19.586 |
| 179733021 | GT-AG | 0 | 7.6790190631568e-05 | 10492 | rna-XM_047132739.1 32191395 | 10 | 207003988 | 207014479 | Schistocerca americana 7009 | GAG|GTAAGCATTA...ATATTCTTACAT/AATATTCTTACA...TTCAG|AGA | 2 | 1 | 22.893 |
| 179733022 | GT-AG | 0 | 0.0002474080681982 | 8405 | rna-XM_047132739.1 32191395 | 11 | 206995442 | 207003846 | Schistocerca americana 7009 | ACT|GTAAGTGTTT...TTTTTTTTATTT/TTTTTTTTCACT...TATAG|CGC | 2 | 1 | 24.921 |
| 179733023 | GT-AG | 0 | 1.425933862255856e-05 | 734 | rna-XM_047132739.1 32191395 | 12 | 206994536 | 206995269 | Schistocerca americana 7009 | ACT|GTAAGTATAC...TTCTGTTTATAC/TTTCTGTTTATA...TTCAG|TTG | 0 | 1 | 27.394 |
| 179733024 | GT-AG | 0 | 1.9719934155257524e-05 | 1617 | rna-XM_047132739.1 32191395 | 13 | 206992794 | 206994410 | Schistocerca americana 7009 | CAG|GTACAGTTTA...TCACATTTATTT/AGTGCTCTCACA...TTTAG|GAA | 2 | 1 | 29.192 |
| 179733025 | GT-AG | 0 | 1.000000099473604e-05 | 7558 | rna-XM_047132739.1 32191395 | 14 | 206985046 | 206992603 | Schistocerca americana 7009 | CAG|GTAAGTGTAA...AGTCTTTTATTT/TAGTCTTTTATT...TTCAG|GAA | 0 | 1 | 31.924 |
| 179733026 | GT-AG | 0 | 1.000000099473604e-05 | 2071 | rna-XM_047132739.1 32191395 | 15 | 206982824 | 206984894 | Schistocerca americana 7009 | TTG|GTTGGTAGTT...TACATTTTATCC/TCCTTTTTTACT...TAAAG|TTG | 1 | 1 | 34.095 |
| 179733027 | GT-AG | 0 | 1.000000099473604e-05 | 24321 | rna-XM_047132739.1 32191395 | 16 | 206958364 | 206982684 | Schistocerca americana 7009 | ATG|GTAAGTATAA...ACACTTTTATTT/TAAAATTTAACA...TACAG|GGA | 2 | 1 | 36.094 |
| 179733028 | GT-AG | 0 | 1.000000099473604e-05 | 10102 | rna-XM_047132739.1 32191395 | 17 | 206948154 | 206958255 | Schistocerca americana 7009 | AAA|GTAAGTGACT...AATTCTTTGAAA/AATTCTTTGAAA...TACAG|ACA | 2 | 1 | 37.647 |
| 179733029 | GT-AG | 0 | 0.0360765301579314 | 631 | rna-XM_047132739.1 32191395 | 18 | 206947357 | 206947987 | Schistocerca americana 7009 | AAG|GTATGCTTGC...AAGATTTTAAAT/AAGATTTTAAAT...CCTAG|GCA | 0 | 1 | 40.035 |
| 179733030 | GT-AG | 0 | 1.000000099473604e-05 | 4130 | rna-XM_047132739.1 32191395 | 19 | 206943037 | 206947166 | Schistocerca americana 7009 | CAG|GTGAACTATT...TGTGGTTTGAAT/TGTGGTTTGAAT...TTCAG|TTC | 1 | 1 | 42.767 |
| 179733031 | GT-AG | 0 | 1.202595940902722e-05 | 84 | rna-XM_047132739.1 32191395 | 20 | 206942807 | 206942890 | Schistocerca americana 7009 | TGG|GTTTGTAATT...TGTGCTCTAACT/TTTGTTTTTAAT...TTTAG|CCT | 0 | 1 | 44.866 |
| 179733032 | GT-AG | 0 | 0.0009265426436777 | 17990 | rna-XM_047132739.1 32191395 | 21 | 206924703 | 206942692 | Schistocerca americana 7009 | TCT|GTATGTACTT...GTTCACATATAT/AAAATGCTGATG...TCTAG|TGC | 0 | 1 | 46.506 |
| 179733033 | GT-AG | 0 | 0.2947813852622322 | 7907 | rna-XM_047132739.1 32191395 | 22 | 206916631 | 206924537 | Schistocerca americana 7009 | CAG|GTATCTTATT...TATGTCTTCATC/TTCATTTTCACT...TTCAG|GAG | 0 | 1 | 48.878 |
| 179733034 | GT-AG | 0 | 4.005560789897184e-05 | 10347 | rna-XM_047132739.1 32191395 | 23 | 206906185 | 206916531 | Schistocerca americana 7009 | AAG|GTATGTAAAT...TGGTGCTTATTT/GTGGTGCTTATT...TAAAG|GTC | 0 | 1 | 50.302 |
| 179733035 | GT-AG | 0 | 1.9520529817239647e-05 | 10098 | rna-XM_047132739.1 32191395 | 24 | 206895961 | 206906058 | Schistocerca americana 7009 | AAG|GTATGATACT...GCTCTTGTAATA/AAATTAATAACT...TTCAG|TCT | 0 | 1 | 52.114 |
| 179733036 | GT-AG | 0 | 1.000000099473604e-05 | 301 | rna-XM_047132739.1 32191395 | 25 | 206895427 | 206895727 | Schistocerca americana 7009 | GCT|GTAAGTGTAC...AGGATATTATTT/AACATATTTATA...TACAG|AAA | 2 | 1 | 55.464 |
| 179733037 | GT-AG | 0 | 0.000678064345919 | 13667 | rna-XM_047132739.1 32191395 | 26 | 206881660 | 206895326 | Schistocerca americana 7009 | GTG|GTATGATTGC...ATATTTTTAAAC/CTGTTCCTCATT...TACAG|GAC | 0 | 1 | 56.903 |
| 179733038 | GT-AG | 0 | 2.065108137968557e-05 | 2331 | rna-XM_047132739.1 32191395 | 27 | 206878515 | 206880845 | Schistocerca americana 7009 | GTG|GTAAGTTGAA...TTGTTTTTAAAA/ATGACTCTCACT...TACAG|ATC | 1 | 1 | 68.608 |
| 179733039 | GT-AG | 0 | 1.9919699494333157e-05 | 4420 | rna-XM_047132739.1 32191395 | 28 | 206873850 | 206878269 | Schistocerca americana 7009 | AAT|GTAAGTATGA...CTCTCCATAACC/TTCTGTTTGACT...TCCAG|GGT | 0 | 1 | 72.131 |
| 179733040 | GT-AG | 0 | 1.1939097137485135e-05 | 88 | rna-XM_047132739.1 32191395 | 29 | 206873598 | 206873685 | Schistocerca americana 7009 | TAA|GTTAGTTTTT...AATTTTTTATTT/TTTTTTTTTATT...TTTAG|GAA | 2 | 1 | 74.49 |
| 179733041 | GT-AG | 0 | 1.000000099473604e-05 | 24403 | rna-XM_047132739.1 32191395 | 30 | 206848984 | 206873386 | Schistocerca americana 7009 | AAG|GTAAGGGAGC...TGAATTTTATAG/TTGAATTTTATA...TTTAG|GCT | 0 | 1 | 77.524 |
| 179733042 | GT-AG | 0 | 0.0003615798396866 | 3933 | rna-XM_047132739.1 32191395 | 31 | 206844952 | 206848884 | Schistocerca americana 7009 | CAG|GTATATACTG...TTTCTATTAAAT/TTTCTATTAAAT...TTCAG|GGA | 0 | 1 | 78.947 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);