introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
42 rows where transcript_id = 32210495
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 179852781 | GC-AG | 0 | 1.000000099473604e-05 | 1203 | rna-XM_047245563.1 32210495 | 1 | 341212839 | 341214041 | Schistocerca piceifrons 274613 | TAG|GCAAGTTGAA...TAGCTCTTGATA/TTATATTTTATA...TGCAG|ATG | 1 | 1 | 1.368 |
| 179852782 | GT-AG | 0 | 1.000000099473604e-05 | 3365 | rna-XM_047245563.1 32210495 | 2 | 341209314 | 341212678 | Schistocerca piceifrons 274613 | CAT|GTAAGTTCTC...GTATCATTTACA/ATATCATTTACA...TATAG|GGC | 2 | 1 | 3.494 |
| 179852783 | GT-AG | 0 | 1.000000099473604e-05 | 10573 | rna-XM_047245563.1 32210495 | 3 | 341198596 | 341209168 | Schistocerca piceifrons 274613 | AAG|GTAAGCAACA...TGTTTTCTGATG/TTGTTACTAATT...TGCAG|AAA | 0 | 1 | 5.42 |
| 179852784 | GT-AG | 0 | 1.193077605833826e-05 | 9135 | rna-XM_047245563.1 32210495 | 4 | 341189324 | 341198458 | Schistocerca piceifrons 274613 | CAG|GTGACTATTT...GCATACTTAGTT/GGCATACTTAGT...TTTAG|GGA | 2 | 1 | 7.241 |
| 179852785 | GT-AG | 0 | 0.0011575558562997 | 5848 | rna-XM_047245563.1 32210495 | 5 | 341183322 | 341189169 | Schistocerca piceifrons 274613 | AAG|GTATGTATTC...CATTTATTAATA/CCATTGCTCATT...TTCAG|TCA | 0 | 1 | 9.287 |
| 179852786 | GT-AG | 0 | 9.54882693773958e-05 | 2557 | rna-XM_047245563.1 32210495 | 6 | 341180632 | 341183188 | Schistocerca piceifrons 274613 | AAG|GTAGTTTTTA...TAATCTTTGCTT/ATAACTCTAATT...TTCAG|GTC | 1 | 1 | 11.054 |
| 179852787 | GT-AG | 0 | 0.0001090756165193 | 83 | rna-XM_047245563.1 32210495 | 7 | 341180376 | 341180458 | Schistocerca piceifrons 274613 | CAG|GTATTGTATA...GTGCCTTTAGGA/AAGTGTCTCAGT...TGCAG|GTA | 0 | 1 | 13.352 |
| 179852788 | GT-AG | 0 | 1.000000099473604e-05 | 1611 | rna-XM_047245563.1 32210495 | 8 | 341178570 | 341180180 | Schistocerca piceifrons 274613 | AAG|GTTGGTAAAT...ATTTTCTAAAAT/ATTGAATTAACC...ATTAG|GCA | 0 | 1 | 15.943 |
| 179852789 | GT-AG | 0 | 1.000000099473604e-05 | 1646 | rna-XM_047245563.1 32210495 | 9 | 341176744 | 341178389 | Schistocerca piceifrons 274613 | TTG|GTGAGTAGTG...ACAGTTTTAATA/CTTATTTTCATC...CCTAG|GTT | 0 | 1 | 18.334 |
| 179852790 | GT-AG | 0 | 0.0003515163213614 | 2827 | rna-XM_047245563.1 32210495 | 10 | 341173771 | 341176597 | Schistocerca piceifrons 274613 | CAG|GTATGTTAAG...TTTTTCTGAAAT/CTTTTTCTGAAA...TGCAG|GTA | 2 | 1 | 20.274 |
| 179852791 | GT-AG | 0 | 0.0001119936376111 | 1904 | rna-XM_047245563.1 32210495 | 11 | 341171716 | 341173619 | Schistocerca piceifrons 274613 | CAG|GTGACCAACT...TTTTTTTTAACT/TTTTTTTTAACT...TGCAG|GAT | 0 | 1 | 22.28 |
| 179852792 | GT-AG | 0 | 1.000000099473604e-05 | 82 | rna-XM_047245563.1 32210495 | 12 | 341171457 | 341171538 | Schistocerca piceifrons 274613 | GAG|GTAAAAAATT...TGTGTATTAAAA/TGTGTATTAAAA...TTTAG|GAG | 0 | 1 | 24.631 |
| 179852793 | GT-AG | 0 | 1.000000099473604e-05 | 754 | rna-XM_047245563.1 32210495 | 13 | 341170515 | 341171268 | Schistocerca piceifrons 274613 | CGA|GTGAGTATCA...CTCTTTTTGTTT/ATGAAGTTCATG...TGTAG|ACA | 2 | 1 | 27.129 |
| 179852794 | GT-AG | 0 | 0.0003546064238719 | 223 | rna-XM_047245563.1 32210495 | 14 | 341170091 | 341170313 | Schistocerca piceifrons 274613 | TAG|GTATTGCTTG...ATGTTGTTAATA/ATGTTGTTAATA...CACAG|CCT | 2 | 1 | 29.799 |
| 179852795 | GT-AG | 0 | 0.0001090260239366 | 228 | rna-XM_047245563.1 32210495 | 15 | 341169693 | 341169920 | Schistocerca piceifrons 274613 | GAA|GTACAAAAAT...GCTTTCTTAACT/GCTTTCTTAACT...CACAG|GTC | 1 | 1 | 32.058 |
| 179852796 | GT-AG | 0 | 0.0039092066331524 | 4689 | rna-XM_047245563.1 32210495 | 16 | 341164831 | 341169519 | Schistocerca piceifrons 274613 | TGG|GTATGCTCAC...CATTTTGTAAAT/ACTTGTATCATT...TTCAG|GAC | 0 | 1 | 34.356 |
| 179852797 | GT-AG | 0 | 0.0036962761930109 | 758 | rna-XM_047245563.1 32210495 | 17 | 341163922 | 341164679 | Schistocerca piceifrons 274613 | CAG|GTATTTATAT...GTATTTTTAGTG/TACTGTTTTATT...TTCAG|ATA | 1 | 1 | 36.362 |
| 179852798 | GT-AG | 0 | 7.202310068686466e-05 | 1306 | rna-XM_047245563.1 32210495 | 18 | 341162338 | 341163643 | Schistocerca piceifrons 274613 | GAG|GTAGATATTA...CAACCTTTATTT/TTGGAAATCATT...TTCAG|CAC | 0 | 1 | 40.056 |
| 179852799 | GT-AG | 0 | 1.3463057316297251e-05 | 109 | rna-XM_047245563.1 32210495 | 19 | 341162083 | 341162191 | Schistocerca piceifrons 274613 | CAG|GTTTTGTCTT...ATTTTATTAATT/ATTTTATTAATT...TTTAG|AAC | 2 | 1 | 41.995 |
| 179852800 | GT-AG | 0 | 1.000000099473604e-05 | 113 | rna-XM_047245563.1 32210495 | 20 | 341161825 | 341161937 | Schistocerca piceifrons 274613 | CAG|GTTAGCCTTT...CATGTTTTAAAG/AATAGTTTCATG...TTTAG|GCT | 0 | 1 | 43.922 |
| 179852801 | GT-AG | 0 | 1.000000099473604e-05 | 86 | rna-XM_047245563.1 32210495 | 21 | 341161629 | 341161714 | Schistocerca piceifrons 274613 | TAA|GTAAGTGTAA...AGTTTTTTATAG/CAGTTTTTTATA...AACAG|GTT | 2 | 1 | 45.383 |
| 179852802 | GT-AG | 0 | 1.000000099473604e-05 | 16163 | rna-XM_047245563.1 32210495 | 22 | 341145326 | 341161488 | Schistocerca piceifrons 274613 | CAG|GTGAGTCCTA...GTTTTGTTGATT/GTTTTGTTGATT...GACAG|GAA | 1 | 1 | 47.243 |
| 179852803 | GT-AG | 0 | 0.0001800142502132 | 2020 | rna-XM_047245563.1 32210495 | 23 | 341143081 | 341145100 | Schistocerca piceifrons 274613 | ATG|GTAATCCTTT...AAATTCTAAATT/TAAATTCTAAAT...TTCAG|GTG | 1 | 1 | 50.232 |
| 179852804 | GT-AG | 0 | 1.000000099473604e-05 | 3165 | rna-XM_047245563.1 32210495 | 24 | 341139793 | 341142957 | Schistocerca piceifrons 274613 | CTG|GTAAGTGGAA...TATGTCTGAATT/TTATGTCTGAAT...TCCAG|GCG | 1 | 1 | 51.867 |
| 179852805 | GT-AG | 0 | 1.000000099473604e-05 | 7169 | rna-XM_047245563.1 32210495 | 25 | 341132424 | 341139592 | Schistocerca piceifrons 274613 | AAG|GTGCTGCAAA...TTTTTTTTATTA/ATTTTTTTTATT...GTCAG|ATT | 0 | 1 | 54.524 |
| 179852806 | GT-AG | 0 | 1.000000099473604e-05 | 175 | rna-XM_047245563.1 32210495 | 26 | 341132092 | 341132266 | Schistocerca piceifrons 274613 | CAG|GTAATTGCAT...AAATCTTTATTT/AGGATGTTGATT...ATTAG|GAC | 1 | 1 | 56.61 |
| 179852807 | GT-AG | 0 | 1.000000099473604e-05 | 105 | rna-XM_047245563.1 32210495 | 27 | 341131781 | 341131885 | Schistocerca piceifrons 274613 | GAG|GTGAGTTAAC...TTTGTCTTGATC/TTTGTCTTGATC...ATTAG|GTT | 0 | 1 | 59.346 |
| 179852808 | GT-AG | 0 | 6.292507710943166e-05 | 5596 | rna-XM_047245563.1 32210495 | 28 | 341126024 | 341131619 | Schistocerca piceifrons 274613 | GAG|GTACAGCTTT...TACTTTTTAATT/TACTTTTTAATT...TGTAG|GTT | 2 | 1 | 61.485 |
| 179852809 | GT-AG | 0 | 1.000000099473604e-05 | 117 | rna-XM_047245563.1 32210495 | 29 | 341125662 | 341125778 | Schistocerca piceifrons 274613 | TCG|GTAATACTCT...CTTGTCATGAAT/TTGTAGCTAATA...GACAG|GAA | 1 | 1 | 64.74 |
| 179852810 | GT-AG | 0 | 1.000000099473604e-05 | 2731 | rna-XM_047245563.1 32210495 | 30 | 341122764 | 341125494 | Schistocerca piceifrons 274613 | GAG|GTAAAAATAA...TTGATTTTGATT/TTGATTTTGATT...AAAAG|GCA | 0 | 1 | 66.959 |
| 179852811 | GT-AG | 0 | 0.0003132476056837 | 5147 | rna-XM_047245563.1 32210495 | 31 | 341117504 | 341122650 | Schistocerca piceifrons 274613 | CAG|GTATAACTTC...CCTTCATTGATT/TTGGTATTGATC...TGCAG|CCG | 2 | 1 | 68.46 |
| 179852812 | GT-AG | 0 | 1.000000099473604e-05 | 93 | rna-XM_047245563.1 32210495 | 32 | 341117221 | 341117313 | Schistocerca piceifrons 274613 | ACA|GTGAGTTAAT...ATATTCTTTTTT/ATTGAATTGATA...CAAAG|GTG | 0 | 1 | 70.984 |
| 179852813 | GT-AG | 0 | 1.000000099473604e-05 | 9103 | rna-XM_047245563.1 32210495 | 33 | 341107911 | 341117013 | Schistocerca piceifrons 274613 | AAG|GTAATGCACG...TGTGTTTTCATT/TGTGTTTTCATT...TATAG|GCA | 0 | 1 | 73.735 |
| 179852814 | GT-AG | 0 | 2.9827493590045617e-05 | 4260 | rna-XM_047245563.1 32210495 | 34 | 341103399 | 341107658 | Schistocerca piceifrons 274613 | CAG|GTAGCTGCAT...TATTCATTACAG/TATGTTCTCATT...CACAG|GCA | 0 | 1 | 77.083 |
| 179852815 | GT-AG | 0 | 0.0033626863507221 | 124 | rna-XM_047245563.1 32210495 | 35 | 341103133 | 341103256 | Schistocerca piceifrons 274613 | ATG|GTATGTTGTA...ACATTCTCAACA/TTAAGCCTGACA...TTCAG|GAA | 1 | 1 | 78.969 |
| 179852816 | GT-AG | 0 | 0.0005353223401785 | 8347 | rna-XM_047245563.1 32210495 | 36 | 341094676 | 341103022 | Schistocerca piceifrons 274613 | AAA|GTAAATTTTT...TATTCTTTTTTT/GATAACTTCAAG...TTCAG|GTT | 0 | 1 | 80.43 |
| 179852817 | GT-AG | 0 | 1.107362295694548e-05 | 4821 | rna-XM_047245563.1 32210495 | 37 | 341089732 | 341094552 | Schistocerca piceifrons 274613 | AGG|GTAATGTTTA...TTTTATTTAACT/TTTTATTTAACT...TGCAG|GAA | 0 | 1 | 82.065 |
| 179852818 | GT-AG | 0 | 1.000000099473604e-05 | 10302 | rna-XM_047245563.1 32210495 | 38 | 341079234 | 341089535 | Schistocerca piceifrons 274613 | CAG|GTAAATGACT...ATCATCTTATAA/CATCATCTTATA...TTCAG|AAG | 1 | 1 | 84.669 |
| 179852819 | GT-AG | 0 | 1.000000099473604e-05 | 10153 | rna-XM_047245563.1 32210495 | 39 | 341068894 | 341079046 | Schistocerca piceifrons 274613 | TGT|GTGAGTACAT...CAAGTCTTGATT/ATTCTATTTATT...TTTAG|GGG | 2 | 1 | 87.153 |
| 179852820 | GT-AG | 0 | 1.1288894039417185e-05 | 135 | rna-XM_047245563.1 32210495 | 40 | 341068504 | 341068638 | Schistocerca piceifrons 274613 | CAT|GTAAGTACTA...TATTTTGTAACC/TATTTTGTAACC...TTCAG|AGA | 2 | 1 | 90.541 |
| 179852821 | GT-AG | 0 | 2.7112561708065143e-05 | 471 | rna-XM_047245563.1 32210495 | 41 | 341067833 | 341068303 | Schistocerca piceifrons 274613 | CAG|GTATGACTAT...AAATCCATAATT/CCATAATTAACA...CTTAG|GGT | 1 | 1 | 93.198 |
| 179852822 | GT-AG | 0 | 1.7076710530775747e-05 | 393 | rna-XM_047245563.1 32210495 | 42 | 341067306 | 341067698 | Schistocerca piceifrons 274613 | CAG|GTAAATTATT...TGAACTTTATTT/ATGAACTTTATT...TACAG|GTA | 0 | 1 | 94.978 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);