introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
25 rows where transcript_id = 25387432
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 140016423 | GT-AG | 0 | 1.000000099473604e-05 | 869 | rna-XM_040246884.1 25387432 | 1 | 58683164 | 58684032 | Oryx dammah 59534 | TCA|GTAAGAATGT...TTTCTCTCGTCT/ATCCTGCTCATC...TTCAG|AAC | 1 | 1 | 0.953 |
| 140016424 | GT-AG | 0 | 1.000000099473604e-05 | 5787 | rna-XM_040246884.1 25387432 | 2 | 58684092 | 58689878 | Oryx dammah 59534 | CCG|GTGAGTATAT...TGTTCTTTTACT/TGTTCTTTTACT...AACAG|GAT | 0 | 1 | 2.768 |
| 140016425 | GT-AG | 0 | 3.200609168459575e-05 | 827 | rna-XM_040246884.1 25387432 | 3 | 58689993 | 58690819 | Oryx dammah 59534 | CAG|GTATTACCTT...CTGTCCTTTCCG/CGCTTTCTGACC...CCCAG|CTG | 0 | 1 | 6.273 |
| 140016426 | GT-AG | 0 | 1.000000099473604e-05 | 1588 | rna-XM_040246884.1 25387432 | 4 | 58690919 | 58692506 | Oryx dammah 59534 | ATG|GTGAAGTGGC...ATTTCCTGAACA/TATTTCCTGAAC...TTTAG|AAA | 0 | 1 | 9.317 |
| 140016427 | GT-AG | 0 | 1.000000099473604e-05 | 2707 | rna-XM_040246884.1 25387432 | 5 | 58692569 | 58695275 | Oryx dammah 59534 | CAG|GTAAGTGCTT...AAATTCTCAGAG/TCAAGGTTCACT...TGCAG|GAT | 2 | 1 | 11.224 |
| 140016428 | GT-AG | 0 | 1.000000099473604e-05 | 357 | rna-XM_040246884.1 25387432 | 6 | 58695349 | 58695705 | Oryx dammah 59534 | ACG|GTGAGTACCT...CCTTCCTTATCT/GCCTTCCTTATC...CACAG|GCT | 0 | 1 | 13.469 |
| 140016429 | GT-AG | 0 | 1.000000099473604e-05 | 1235 | rna-XM_040246884.1 25387432 | 7 | 58695777 | 58697011 | Oryx dammah 59534 | TGG|GTGAGTGTTC...TGGACCTGTACT/CGTATATGCATT...TCTAG|TGA | 2 | 1 | 15.652 |
| 140016430 | GT-AG | 0 | 1.000000099473604e-05 | 1467 | rna-XM_040246884.1 25387432 | 8 | 58697095 | 58698561 | Oryx dammah 59534 | ATG|GTGAGTTTTT...GGACTCTTATTC/AGGACTCTTATT...TTCAG|GAG | 1 | 1 | 18.204 |
| 140016431 | GT-AG | 0 | 0.0005198701303158 | 1221 | rna-XM_040246884.1 25387432 | 9 | 58698721 | 58699941 | Oryx dammah 59534 | CTG|GTATGTCTTT...GAATATTTAGTG/TATTTAGTGATT...CTTAG|GAC | 1 | 1 | 23.093 |
| 140016432 | GT-AG | 0 | 2.753405328022425e-05 | 15601 | rna-XM_040246884.1 25387432 | 10 | 58700031 | 58715631 | Oryx dammah 59534 | TGG|GTAAGTTAAA...CTTTTTTTGATT/CTTTTTTTGATT...TCTAG|ATT | 0 | 1 | 25.83 |
| 140016433 | GT-AG | 0 | 0.0017814937048792 | 14337 | rna-XM_040246884.1 25387432 | 11 | 58715829 | 58730165 | Oryx dammah 59534 | CAG|GTAGGCTTCT...TGTACCTTACTC/TTGTACCTTACT...CCCAG|CTC | 2 | 1 | 31.888 |
| 140016434 | GT-AG | 0 | 1.000000099473604e-05 | 406 | rna-XM_040246884.1 25387432 | 12 | 58730179 | 58730584 | Oryx dammah 59534 | CAG|GTAGAGGACA...TCTGGCTTATGT/GAGAAGTTCATT...TCCAG|AAA | 0 | 1 | 32.288 |
| 140016435 | GT-AG | 0 | 1.000000099473604e-05 | 1912 | rna-XM_040246884.1 25387432 | 13 | 58730751 | 58732662 | Oryx dammah 59534 | AAG|GTAAGTAATC...CTCCTCTTGTCT/TTCATCCACACG...GCTAG|GTG | 1 | 1 | 37.392 |
| 140016436 | GT-AG | 0 | 1.000000099473604e-05 | 707 | rna-XM_040246884.1 25387432 | 14 | 58732852 | 58733558 | Oryx dammah 59534 | TTG|GTGGGTGCCC...CTTTGCTTACTT/ACTTTGCTTATT...TTCAG|TGA | 1 | 1 | 43.204 |
| 140016437 | GT-AG | 0 | 2.353592436436905e-05 | 4713 | rna-XM_040246884.1 25387432 | 15 | 58733675 | 58738387 | Oryx dammah 59534 | GAT|GTAAGTGTTC...TCAGCCCTAACT/TCAGCCCTAACT...TCCAG|GAT | 0 | 1 | 46.771 |
| 140016438 | GT-AG | 0 | 1.000000099473604e-05 | 2365 | rna-XM_040246884.1 25387432 | 16 | 58738553 | 58740917 | Oryx dammah 59534 | ATG|GTTAGTGTGT...TCTTCCTTTTCC/TCCTCTCTCACT...CGAAG|CTG | 0 | 1 | 51.845 |
| 140016439 | GT-AG | 0 | 1.000000099473604e-05 | 394 | rna-XM_040246884.1 25387432 | 17 | 58741030 | 58741423 | Oryx dammah 59534 | AAG|GTAAGGGGAG...TCCCCCTTGCCC/ACTGTAAAGACC...TGCAG|AGA | 1 | 1 | 55.289 |
| 140016440 | GT-AG | 0 | 1.000000099473604e-05 | 364 | rna-XM_040246884.1 25387432 | 18 | 58741530 | 58741893 | Oryx dammah 59534 | CAG|GTATGAGGGA...TCACTCTTGGCT/TTAATCATCACT...TACAG|GTC | 2 | 1 | 58.549 |
| 140016441 | GT-AG | 0 | 1.000000099473604e-05 | 518 | rna-XM_040246884.1 25387432 | 19 | 58742042 | 58742559 | Oryx dammah 59534 | CTG|GTGGGCAACA...GCAGCCTGGAAT/GGAATGTAGATG...TGCAG|GCC | 0 | 1 | 63.1 |
| 140016442 | GC-AG | 0 | 1.000000099473604e-05 | 1607 | rna-XM_040246884.1 25387432 | 20 | 58742744 | 58744350 | Oryx dammah 59534 | AAG|GCATGTGAGG...AAGCTCTTAGCC/CAAGCTCTTAGC...TGCAG|CTG | 1 | 1 | 68.758 |
| 140016443 | GT-AG | 0 | 1.000000099473604e-05 | 1260 | rna-XM_040246884.1 25387432 | 21 | 58744575 | 58745834 | Oryx dammah 59534 | CAG|GTAAGTGCTC...ATCTCCTTGGTG/AATCTCCTAAGA...TCCAG|ACG | 0 | 1 | 75.646 |
| 140016444 | GT-AG | 0 | 1.000000099473604e-05 | 1318 | rna-XM_040246884.1 25387432 | 22 | 58745919 | 58747236 | Oryx dammah 59534 | CAG|GTAAGACCAG...TCCTCCTTACCC/TTCCTCCTTACC...TCCAG|GAC | 0 | 1 | 78.229 |
| 140016445 | GT-AG | 0 | 1.000000099473604e-05 | 171 | rna-XM_040246884.1 25387432 | 23 | 58747483 | 58747653 | Oryx dammah 59534 | CAG|GTGTGGCGGG...AGATCCTGAGTT/TGTAGACTAATG...TTCAG|ATT | 0 | 1 | 85.793 |
| 140016446 | GT-AG | 0 | 1.000000099473604e-05 | 778 | rna-XM_040246884.1 25387432 | 24 | 58747793 | 58748570 | Oryx dammah 59534 | TAG|GTAATATTCT...TGGCTCTTTGTT/CCTCCCTCCAGG...TACAG|GTG | 1 | 1 | 90.068 |
| 140016447 | GT-AG | 0 | 1.000000099473604e-05 | 95 | rna-XM_040246884.1 25387432 | 25 | 58748702 | 58748796 | Oryx dammah 59534 | CAG|GTGAGTGCAA...TTAGCCTTTGCC/CTGTGGATAAAA...TATAG|ATT | 0 | 1 | 94.096 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);