introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
32 rows where transcript_id = 3555647
This data as json, CSV (advanced)
Suggested facets: score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 17675555 | GT-AG | 0 | 0.0001899824005846 | 775 | rna-XM_038325036.2 3555647 | 2 | 104246274 | 104247048 | Arvicola amphibius 1047088 | AAA|GTAAGTTTTG...ACTGTTTTACTC/GTTTTACTCATA...TTCAG|ATA | 1 | 1 | 3.484 |
| 17675556 | GT-AG | 0 | 1.000000099473604e-05 | 100 | rna-XM_038325036.2 3555647 | 3 | 104246072 | 104246171 | Arvicola amphibius 1047088 | CAG|GTAAGGAATT...CCCACTTTCATC/CCCACTTTCATC...TTCAG|GTG | 1 | 1 | 5.315 |
| 17675557 | GT-AG | 0 | 2.7700997764079995e-05 | 2301 | rna-XM_038325036.2 3555647 | 4 | 104243594 | 104245894 | Arvicola amphibius 1047088 | GGA|GTAAGTAGGA...TCTGCCTTACTT/TTCTTTCTAATT...TATAG|ATT | 1 | 1 | 8.493 |
| 17675558 | GT-AG | 0 | 0.0153863333102862 | 3513 | rna-XM_038325036.2 3555647 | 5 | 104239941 | 104243453 | Arvicola amphibius 1047088 | AAT|GTATGTATTT...ATGTCCTTTGTC/CTTGAACTTAGA...TGCAG|CCT | 0 | 1 | 11.007 |
| 17675559 | GT-AG | 0 | 1.000000099473604e-05 | 1127 | rna-XM_038325036.2 3555647 | 6 | 104238706 | 104239832 | Arvicola amphibius 1047088 | GAG|GTTAGTAAAA...GGTGTTTTATCC/TGGTGTTTTATC...TTCAG|GAA | 0 | 1 | 12.947 |
| 17675560 | GT-AG | 0 | 0.0005222753271643 | 1785 | rna-XM_038325036.2 3555647 | 7 | 104236740 | 104238524 | Arvicola amphibius 1047088 | CTA|GTAAGCACTT...TTTTTCTTTTCT/CTTTTTTTCTTT...GAAAG|GCA | 1 | 1 | 16.197 |
| 17675561 | GT-AG | 0 | 1.000000099473604e-05 | 1390 | rna-XM_038325036.2 3555647 | 8 | 104235186 | 104236575 | Arvicola amphibius 1047088 | ATG|GTAAGGCCTT...TTCATTTTAATC/TTCATTTTAATC...TCCAG|GGC | 0 | 1 | 19.142 |
| 17675562 | GT-AG | 0 | 2.056950239135568e-05 | 2693 | rna-XM_038325036.2 3555647 | 9 | 104232355 | 104235047 | Arvicola amphibius 1047088 | TCG|GTAGGTGTGG...ATTATTTTAATT/ATTATTTTAATT...TTTAG|ACC | 0 | 1 | 21.62 |
| 17675563 | GT-AG | 0 | 0.0003299888626444 | 5512 | rna-XM_038325036.2 3555647 | 10 | 104226704 | 104232215 | Arvicola amphibius 1047088 | TTG|GTACATATAA...ATTGTTTTAAAA/AATCTACTCATT...CTTAG|GTT | 1 | 1 | 24.116 |
| 17675564 | GT-AG | 0 | 1.000000099473604e-05 | 7030 | rna-XM_038325036.2 3555647 | 11 | 104219581 | 104226610 | Arvicola amphibius 1047088 | CAG|GTAAGAAAGT...TTCCTCTTGTCT/ACAATAGTCAAA...CATAG|CTC | 1 | 1 | 25.786 |
| 17675565 | GT-AG | 0 | 1.000000099473604e-05 | 6753 | rna-XM_038325036.2 3555647 | 12 | 104212687 | 104219439 | Arvicola amphibius 1047088 | TAG|GTAAGTGCCA...GGATCCTTGAAC/TTGAACTTAACT...AATAG|CTT | 1 | 1 | 28.317 |
| 17675566 | GT-AG | 0 | 0.0015999908329038 | 2280 | rna-XM_038325036.2 3555647 | 13 | 104210273 | 104212552 | Arvicola amphibius 1047088 | GAG|GTAATTTTTT...GCTTCCTTGATG/GCTTCCTTGATG...TTCAG|AGG | 0 | 1 | 30.724 |
| 17675567 | GT-AG | 0 | 1.5561849055247522e-05 | 4139 | rna-XM_038325036.2 3555647 | 14 | 104205946 | 104210084 | Arvicola amphibius 1047088 | CAA|GTAAGTTTAT...TTTCTTTTGGTC/TGTACATTTATT...TTTAG|GAA | 2 | 1 | 34.099 |
| 17675568 | GT-AG | 0 | 2.083842010200852e-05 | 3737 | rna-XM_038325036.2 3555647 | 15 | 104202092 | 104205828 | Arvicola amphibius 1047088 | GAG|GTAAGCTGAA...TTGTCGTTACTG/AAAGTATTAATA...TTCAG|ATT | 2 | 1 | 36.2 |
| 17675569 | GT-AG | 0 | 1.000000099473604e-05 | 1138 | rna-XM_038325036.2 3555647 | 16 | 104200655 | 104201792 | Arvicola amphibius 1047088 | CAG|GTCAGTTTTA...TACTTTCTGACT/TACTTTCTGACT...TGCAG|GTA | 1 | 1 | 41.569 |
| 17675570 | GT-AG | 0 | 0.0005026131213887 | 695 | rna-XM_038325036.2 3555647 | 17 | 104199846 | 104200540 | Arvicola amphibius 1047088 | AAG|GTATGTGTGA...TCTTTTTTAACT/TCTTTTTTAACT...TTTAG|GAT | 1 | 1 | 43.616 |
| 17675571 | GT-AG | 0 | 1.956751660563618e-05 | 824 | rna-XM_038325036.2 3555647 | 18 | 104198883 | 104199706 | Arvicola amphibius 1047088 | TAG|GTATGAGTTT...ACTTCCTTTTTT/CCTTTTTTAAGT...CGCAG|GAA | 2 | 1 | 46.112 |
| 17675572 | GT-AG | 0 | 0.0008412519037338 | 1596 | rna-XM_038325036.2 3555647 | 19 | 104197109 | 104198704 | Arvicola amphibius 1047088 | AAG|GTATAGTTTC...CGTTTCTTTTCT/ATATTATTAAAA...TCTAG|GTC | 0 | 1 | 49.309 |
| 17675573 | GT-AG | 0 | 1.000000099473604e-05 | 7377 | rna-XM_038325036.2 3555647 | 20 | 104189567 | 104196943 | Arvicola amphibius 1047088 | CAG|GTAGAGAAAG...CTGTTTTTGCTG/GTTTATCTGATG...TGCAG|AGC | 0 | 1 | 52.272 |
| 17675574 | GT-AG | 0 | 1.000000099473604e-05 | 611 | rna-XM_038325036.2 3555647 | 21 | 104188773 | 104189383 | Arvicola amphibius 1047088 | CAG|GTGACGTTCT...TGGTCTTTCATT/TGGTCTTTCATT...TTTAG|ATC | 0 | 1 | 55.558 |
| 17675575 | GT-AG | 0 | 0.0003837553007511 | 4705 | rna-XM_038325036.2 3555647 | 22 | 104183914 | 104188618 | Arvicola amphibius 1047088 | TAG|GTACAGTATA...GTGTTCTTAATT/GTGTTCTTAATT...TTCAG|TTG | 1 | 1 | 58.323 |
| 17675576 | GT-AG | 0 | 1.000000099473604e-05 | 1523 | rna-XM_038325036.2 3555647 | 23 | 104181961 | 104183483 | Arvicola amphibius 1047088 | ATC|GTGAGTTATG...TGTTTCTTCATT/CATTTTCTGACT...TCTAG|TCT | 2 | 1 | 66.044 |
| 17675577 | GT-AG | 0 | 1.000000099473604e-05 | 895 | rna-XM_038325036.2 3555647 | 24 | 104180836 | 104181730 | Arvicola amphibius 1047088 | AAA|GTAAGTACAT...ATGGCATTACTT/CAAACTTTCATG...CTTAG|CTC | 1 | 1 | 70.174 |
| 17675578 | GT-AG | 0 | 1.000000099473604e-05 | 396 | rna-XM_038325036.2 3555647 | 25 | 104180173 | 104180568 | Arvicola amphibius 1047088 | CAG|GTAGGAATGA...CTGTCCTCACCA/CCTGTCCTCACC...TACAG|GTG | 1 | 1 | 74.969 |
| 17675579 | GT-AG | 0 | 1.000000099473604e-05 | 1332 | rna-XM_038325036.2 3555647 | 26 | 104178683 | 104180014 | Arvicola amphibius 1047088 | CCG|GTAATGTCAT...TGATCCTTTGTC/GTAGTAATAACC...TGCAG|GTG | 0 | 1 | 77.806 |
| 17675580 | GT-AG | 0 | 0.0010883309487829 | 1217 | rna-XM_038325036.2 3555647 | 27 | 104177322 | 104178538 | Arvicola amphibius 1047088 | CAG|GTATGTGTGT...ATTGCTTTAACA/ATAAAACTCACA...TTCAG|AAT | 0 | 1 | 80.391 |
| 17675581 | GT-AG | 0 | 1.000000099473604e-05 | 3099 | rna-XM_038325036.2 3555647 | 28 | 104174080 | 104177178 | Arvicola amphibius 1047088 | CAG|GTAAGCCAGG...AGTATCCTAACT/AGTATCCTAACT...CTCAG|ACA | 2 | 1 | 82.959 |
| 17675582 | GT-AG | 0 | 1.000000099473604e-05 | 1584 | rna-XM_038325036.2 3555647 | 29 | 104172274 | 104173857 | Arvicola amphibius 1047088 | AGG|GTGAGCGAAA...GCTTCTTTCTTC/ACAGGCGTGACA...TATAG|CAT | 2 | 1 | 86.946 |
| 17675583 | GT-AG | 0 | 1.000000099473604e-05 | 442 | rna-XM_038325036.2 3555647 | 30 | 104171590 | 104172031 | Arvicola amphibius 1047088 | CTG|GTGAGTGCAA...GTTGTCTTCTTT/ACTGTTGTGAGC...TTCAG|ATG | 1 | 1 | 91.291 |
| 17675584 | GT-AG | 0 | 0.0089081137819441 | 4011 | rna-XM_038325036.2 3555647 | 31 | 104167424 | 104171434 | Arvicola amphibius 1047088 | CAG|GTACCTAAAT...TTGTCCTTACTG/TTTGTCCTTACT...TTCAG|GTG | 0 | 1 | 94.074 |
| 17675585 | GT-AG | 0 | 6.029921566510127e-05 | 99 | rna-XM_038325036.2 3555647 | 32 | 104167213 | 104167311 | Arvicola amphibius 1047088 | CAG|GTAATCTGTG...ACAGCCTCACCA/TCAACACTCATT...TCTAG|ATA | 1 | 1 | 96.085 |
| 17692012 | GT-AG | 0 | 1.000000099473604e-05 | 11700 | rna-XM_038325036.2 3555647 | 1 | 104247153 | 104258852 | Arvicola amphibius 1047088 | GCG|GTAAGTCGGG...GGTTTCTCAAAA/TGGTTTCTCAAA...TACAG|CTT | 0 | 2.119 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);