introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
26 rows where transcript_id = 23220185
This data as json, CSV (advanced)
Suggested facets: score, length, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 126296361 | GT-AG | 0 | 0.000423624384071 | 141 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1616397 23220185 | 1 | 1521033 | 1521173 | Neocallimastix sp. jgi-2020a 2767002 | AAG|GTTTTGTTTT...TAATTTTTAATA/AAATTTTTAATT...AACAG|AGA | 1 | 1 | 1.323 |
| 126296362 | GT-AG | 0 | 0.0003839455162003 | 471 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1616397 23220185 | 2 | 1521389 | 1521859 | Neocallimastix sp. jgi-2020a 2767002 | AAT|GTAATTTTAT...TTTTTTTTTATG/TTTTTTTTTATG...TAAAG|CTT | 0 | 1 | 6.794 |
| 126296363 | GT-AG | 0 | 0.0015327843240632 | 137 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1616397 23220185 | 3 | 1521973 | 1522109 | Neocallimastix sp. jgi-2020a 2767002 | GCA|GTATGAATTA...TTATTATTATTA/ATTATTATTATT...ATAAG|GCT | 2 | 1 | 9.669 |
| 126296364 | GT-AG | 0 | 8.241614821756916e-05 | 111 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1616397 23220185 | 4 | 1522234 | 1522344 | Neocallimastix sp. jgi-2020a 2767002 | GAA|GTAAATTATA...ATGTTTATAATC/AATGTACTAATT...AACAG|ACT | 0 | 1 | 12.824 |
| 126296365 | GT-AG | 0 | 1.000000099473604e-05 | 154 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1616397 23220185 | 5 | 1522544 | 1522697 | Neocallimastix sp. jgi-2020a 2767002 | GAA|GTAATAAAAT...AATTTTTTAAAT/TATTTGTTTATT...TAAAG|ATG | 1 | 1 | 17.888 |
| 126296366 | GT-AG | 0 | 0.0002167463137785 | 69 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1616397 23220185 | 6 | 1522749 | 1522817 | Neocallimastix sp. jgi-2020a 2767002 | AAG|GTATAAATAT...CAAATTTTAATT/CAAATTTTAATT...AATAG|ATA | 1 | 1 | 19.186 |
| 126296367 | GT-AG | 0 | 0.0079861108642248 | 217 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1616397 23220185 | 7 | 1523015 | 1523231 | Neocallimastix sp. jgi-2020a 2767002 | TAT|GTATGTATTA...ATTTATTTATTT/TATTTATTTATT...TAAAG|GGA | 0 | 1 | 24.198 |
| 126296368 | GT-AG | 0 | 1.000000099473604e-05 | 151 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1616397 23220185 | 8 | 1523277 | 1523427 | Neocallimastix sp. jgi-2020a 2767002 | AAT|GTAAATATAT...ATAATAATAATA/ATAATAATAATA...AAAAG|GAT | 0 | 1 | 25.344 |
| 126296369 | GT-AG | 0 | 4.518064393306399e-05 | 64 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1616397 23220185 | 9 | 1523548 | 1523611 | Neocallimastix sp. jgi-2020a 2767002 | GAA|GTAAGTTTAG...CAATTATTAATG/TTTGTACTCATA...GATAG|TTC | 0 | 1 | 28.397 |
| 126296370 | GT-AG | 0 | 1.000000099473604e-05 | 146 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1616397 23220185 | 10 | 1523758 | 1523903 | Neocallimastix sp. jgi-2020a 2767002 | AAG|GTGAAATTTA...ATAATTTTATTT/TTTTATTTAATA...TACAG|AGC | 2 | 1 | 32.112 |
| 126296371 | GT-AG | 0 | 0.0512128287520344 | 84 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1616397 23220185 | 11 | 1524136 | 1524219 | Neocallimastix sp. jgi-2020a 2767002 | GAA|GTATGTATAT...TTTTTTTTAATT/TTTTTTTTAATT...TAAAG|AGA | 0 | 1 | 38.015 |
| 126296372 | GT-AG | 0 | 1.000000099473604e-05 | 87 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1616397 23220185 | 12 | 1524327 | 1524413 | Neocallimastix sp. jgi-2020a 2767002 | TGG|GTAAATGATT...ATTTTGTTAATA/TTAATATTAACC...TAAAG|TCG | 2 | 1 | 40.738 |
| 126296373 | GT-AG | 0 | 9.996987441994882e-05 | 156 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1616397 23220185 | 13 | 1524543 | 1524698 | Neocallimastix sp. jgi-2020a 2767002 | TAA|GTAAATTAAT...TAATTTATAACT/ATATTACTAATA...AAAAG|TTA | 2 | 1 | 44.02 |
| 126296374 | GT-AG | 0 | 0.0009418797660061 | 120 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1616397 23220185 | 14 | 1524835 | 1524954 | Neocallimastix sp. jgi-2020a 2767002 | ACT|GTAATATTTT...TATATTTTAATA/CTTGATTTCATT...TACAG|ACA | 0 | 1 | 47.481 |
| 126296375 | GT-AG | 0 | 1.000000099473604e-05 | 93 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1616397 23220185 | 15 | 1525148 | 1525240 | Neocallimastix sp. jgi-2020a 2767002 | AAG|GTAAAGTTAT...CCATACTTAATA/TTAATATTAATA...TTTAG|GTA | 1 | 1 | 52.392 |
| 126296376 | GT-AG | 0 | 1.000000099473604e-05 | 112 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1616397 23220185 | 16 | 1525339 | 1525450 | Neocallimastix sp. jgi-2020a 2767002 | GAG|GTAAATGATT...TATATCTTATAT/TTTTTATTAATA...TTAAG|GTT | 0 | 1 | 54.885 |
| 126296377 | GT-AG | 0 | 0.0110653866956739 | 102 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1616397 23220185 | 17 | 1525599 | 1525700 | Neocallimastix sp. jgi-2020a 2767002 | CAA|GTATATTATA...AATACTATAAAA/AATTAAATAATT...TTTAG|ATG | 1 | 1 | 58.651 |
| 126296378 | GT-AG | 0 | 3.03036897591132e-05 | 107 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1616397 23220185 | 18 | 1525793 | 1525899 | Neocallimastix sp. jgi-2020a 2767002 | GAG|GTACATAAGA...ATTTATTTAAAT/ATAAATTTCATT...ATAAG|ATT | 0 | 1 | 60.992 |
| 126296379 | GT-AG | 0 | 0.0007860188298301 | 125 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1616397 23220185 | 19 | 1526275 | 1526399 | Neocallimastix sp. jgi-2020a 2767002 | GAA|GTAATTTTAA...AATCTTTTATTT/CTTTTATTTATT...ATTAG|AAA | 0 | 1 | 70.534 |
| 126296380 | GT-AG | 0 | 0.0001649843755192 | 182 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1616397 23220185 | 20 | 1526539 | 1526720 | Neocallimastix sp. jgi-2020a 2767002 | TGA|GTAATAATTT...TTTTTTTTAAAT/TTTTTTTTAAAT...ATAAG|AAA | 1 | 1 | 74.071 |
| 126296381 | GT-AG | 0 | 1.000000099473604e-05 | 115 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1616397 23220185 | 21 | 1526843 | 1526957 | Neocallimastix sp. jgi-2020a 2767002 | AAA|GTAAAAATAT...TACTTTTTATTA/TATTTATTCATT...ATTAG|ATT | 0 | 1 | 77.176 |
| 126296382 | GT-AG | 0 | 1.000000099473604e-05 | 80 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1616397 23220185 | 22 | 1527052 | 1527131 | Neocallimastix sp. jgi-2020a 2767002 | CTG|GTAAATAAAG...ATTTTATTAATA/ATTTTATTAATA...CATAG|ATG | 1 | 1 | 79.567 |
| 126296383 | GT-AG | 0 | 0.0001294078808522 | 98 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1616397 23220185 | 23 | 1527263 | 1527360 | Neocallimastix sp. jgi-2020a 2767002 | CAT|GTAAATTATT...ATAATTTTAAAA/ATATATTTAATT...TATAG|TAT | 0 | 1 | 82.901 |
| 126296384 | GT-AG | 0 | 0.0007745983930412 | 115 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1616397 23220185 | 24 | 1527541 | 1527655 | Neocallimastix sp. jgi-2020a 2767002 | GAA|GTAAGTTGTT...ATATTTTTAACT/ATATTTTTAACT...AAAAG|ATA | 0 | 1 | 87.481 |
| 126296385 | GT-AG | 0 | 0.2231284188459996 | 103 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1616397 23220185 | 25 | 1527809 | 1527911 | Neocallimastix sp. jgi-2020a 2767002 | AAA|GTATATTTCA...CTTTTTTTAAAA/CTTTTTTTAAAA...AATAG|GGT | 0 | 1 | 91.374 |
| 126296386 | GT-AG | 0 | 1.000000099473604e-05 | 263 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1616397 23220185 | 26 | 1528093 | 1528355 | Neocallimastix sp. jgi-2020a 2767002 | AAG|GTAAAATATA...TTATTATTATTA/TTATTGCTAATT...AAAAG|GAT | 1 | 1 | 95.98 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);