introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
24 rows where transcript_id = 23988670
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 130512849 | GT-AG | 0 | 0.001059426862732 | 1492 | rna-gnl|WGS:VZSG|NOTPEN_R09847_mrna 23988670 | 1 | 4254907 | 4256398 | Nothoprocta pentlandii 2585814 | CAG|GTACCAATGA...CTATTTTTACTG/GCTATTTTTACT...CCTAG|GTT | 0 | 1 | 2.458 |
| 130512850 | GT-AG | 0 | 1.000000099473604e-05 | 418 | rna-gnl|WGS:VZSG|NOTPEN_R09847_mrna 23988670 | 2 | 4254326 | 4254743 | Nothoprocta pentlandii 2585814 | CAG|GTAAGAGCAG...TCTTTTTTAATT/TCTTTTTTAATT...TTTAG|CAA | 1 | 1 | 5.24 |
| 130512851 | GT-AG | 0 | 1.000000099473604e-05 | 1931 | rna-gnl|WGS:VZSG|NOTPEN_R09847_mrna 23988670 | 3 | 4252264 | 4254194 | Nothoprocta pentlandii 2585814 | CAG|GTAATATTCT...TTTTCCTTGTTT/CTTGTTTTCAAT...CATAG|GTT | 0 | 1 | 7.476 |
| 130512852 | GT-AG | 0 | 9.35443706911994e-05 | 537 | rna-gnl|WGS:VZSG|NOTPEN_R09847_mrna 23988670 | 4 | 4251592 | 4252128 | Nothoprocta pentlandii 2585814 | AAG|GTAGGTTGCC...GTACTTTTAACT/TAACTTTTGACT...CACAG|ATC | 0 | 1 | 9.78 |
| 130512853 | GT-AG | 0 | 0.0219162557077576 | 1103 | rna-gnl|WGS:VZSG|NOTPEN_R09847_mrna 23988670 | 5 | 4250328 | 4251430 | Nothoprocta pentlandii 2585814 | CAG|GTATGTTTTA...TTTGACTTGATG/CTGTATTTGACT...TTAAG|ATA | 2 | 1 | 12.528 |
| 130512854 | GT-AG | 0 | 5.260159383867505e-05 | 1029 | rna-gnl|WGS:VZSG|NOTPEN_R09847_mrna 23988670 | 6 | 4249130 | 4250158 | Nothoprocta pentlandii 2585814 | CAG|GTAAGCATTC...GACTCTTTGAAC/TTTGAACTGATA...TGTAG|ATA | 0 | 1 | 15.412 |
| 130512855 | GT-AG | 0 | 1.000000099473604e-05 | 3392 | rna-gnl|WGS:VZSG|NOTPEN_R09847_mrna 23988670 | 7 | 4245265 | 4248656 | Nothoprocta pentlandii 2585814 | TAG|GTAAATCTTC...TTATTTTTCATT/TTATTTTTCATT...TTTAG|GTT | 2 | 1 | 23.485 |
| 130512856 | GT-AG | 0 | 0.0010094385417496 | 457 | rna-gnl|WGS:VZSG|NOTPEN_R09847_mrna 23988670 | 8 | 4244675 | 4245131 | Nothoprocta pentlandii 2585814 | GAG|GTAACTTCAG...AATCTCTCAATT/TATTCTTTCAAT...TGCAG|GTT | 0 | 1 | 25.755 |
| 130512857 | GT-AG | 0 | 0.0001031902866046 | 98 | rna-gnl|WGS:VZSG|NOTPEN_R09847_mrna 23988670 | 9 | 4244334 | 4244431 | Nothoprocta pentlandii 2585814 | AAG|GTATGAAGCT...CTTTTCTTAAAA/TTTGTTTTGAAT...CTAAG|ATC | 0 | 1 | 29.903 |
| 130512858 | GT-AG | 0 | 1.000000099473604e-05 | 1924 | rna-gnl|WGS:VZSG|NOTPEN_R09847_mrna 23988670 | 10 | 4242255 | 4244178 | Nothoprocta pentlandii 2585814 | TAG|GTAGGAGGCA...AAAGTATTGATT/AAAGTATTGATT...TTTAG|GTA | 2 | 1 | 32.548 |
| 130512859 | GT-AG | 0 | 5.866561802965285e-05 | 449 | rna-gnl|WGS:VZSG|NOTPEN_R09847_mrna 23988670 | 11 | 4241673 | 4242121 | Nothoprocta pentlandii 2585814 | GTT|GTAGGTATTT...CAATGTTTAAAT/TAGTAATTAATA...TATAG|GGT | 0 | 1 | 34.818 |
| 130512860 | GT-AG | 0 | 1.000000099473604e-05 | 1112 | rna-gnl|WGS:VZSG|NOTPEN_R09847_mrna 23988670 | 12 | 4240485 | 4241596 | Nothoprocta pentlandii 2585814 | TTG|GTAAGAGCTT...ATTATGTTATTT/TGTATGTTCATT...CTCAG|GTG | 1 | 1 | 36.115 |
| 130512861 | GT-AG | 0 | 1.000000099473604e-05 | 1538 | rna-gnl|WGS:VZSG|NOTPEN_R09847_mrna 23988670 | 13 | 4238807 | 4240344 | Nothoprocta pentlandii 2585814 | GCT|GTCAGTATCA...TTTTTTTTTTCC/ATTTTGCTGATA...TCTAG|ATT | 0 | 1 | 38.505 |
| 130512862 | GT-AG | 0 | 1.000000099473604e-05 | 813 | rna-gnl|WGS:VZSG|NOTPEN_R09847_mrna 23988670 | 14 | 4237814 | 4238626 | Nothoprocta pentlandii 2585814 | CAG|GTGATGAGTC...TTTTGCTTGATT/CTTGATTTCATT...CTTAG|ATT | 0 | 1 | 41.577 |
| 130512863 | GT-AG | 0 | 1.000000099473604e-05 | 377 | rna-gnl|WGS:VZSG|NOTPEN_R09847_mrna 23988670 | 15 | 4237223 | 4237599 | Nothoprocta pentlandii 2585814 | TTG|GTAAGAACAG...ACTTTTCTGACT/ACTTTTCTGACT...TTAAG|TTG | 1 | 1 | 45.23 |
| 130512864 | GT-AG | 0 | 1.000000099473604e-05 | 1680 | rna-gnl|WGS:VZSG|NOTPEN_R09847_mrna 23988670 | 16 | 4235389 | 4237068 | Nothoprocta pentlandii 2585814 | AAG|GTGAGTACCT...CAGGTTTTGTTC/TTTGTTCTAAGG...TTTAG|GTA | 2 | 1 | 47.858 |
| 130512865 | GT-AG | 0 | 6.41794934314902e-05 | 281 | rna-gnl|WGS:VZSG|NOTPEN_R09847_mrna 23988670 | 17 | 4234925 | 4235205 | Nothoprocta pentlandii 2585814 | AAG|GTAACAGTCT...ATTTTTTTATAT/AATTTTTTTATA...CTTAG|ACT | 2 | 1 | 50.981 |
| 130512866 | GT-AG | 0 | 1.000000099473604e-05 | 104 | rna-gnl|WGS:VZSG|NOTPEN_R09847_mrna 23988670 | 18 | 4234715 | 4234818 | Nothoprocta pentlandii 2585814 | CAG|GTAAAATAGA...ATTCTCTTCTCA/TCTCTTCTCACC...CCCAG|ATA | 0 | 1 | 52.791 |
| 130512867 | GT-AG | 0 | 0.008966939273893 | 251 | rna-gnl|WGS:VZSG|NOTPEN_R09847_mrna 23988670 | 19 | 4234288 | 4234538 | Nothoprocta pentlandii 2585814 | CAG|GTATGTATTT...TTTCTTTTAACA/TTTCTTTTAACA...TACAG|ATA | 2 | 1 | 55.795 |
| 130512868 | GT-AG | 0 | 1.000000099473604e-05 | 1445 | rna-gnl|WGS:VZSG|NOTPEN_R09847_mrna 23988670 | 20 | 4232059 | 4233503 | Nothoprocta pentlandii 2585814 | AAG|GTAAGTAGCA...TACTTTTTATCT/TTTTATCTAACT...TGAAG|GTC | 0 | 1 | 69.176 |
| 130512869 | GT-AG | 0 | 3.3768819830623486e-05 | 3902 | rna-gnl|WGS:VZSG|NOTPEN_R09847_mrna 23988670 | 21 | 4228001 | 4231902 | Nothoprocta pentlandii 2585814 | ATG|GTAAGATTTA...TTTTTCTTATTT/ATTTTTCTTATT...TACAG|ACA | 0 | 1 | 71.838 |
| 130512870 | GT-AG | 0 | 1.000000099473604e-05 | 1842 | rna-gnl|WGS:VZSG|NOTPEN_R09847_mrna 23988670 | 22 | 4225279 | 4227120 | Nothoprocta pentlandii 2585814 | CTG|GTGAGTTGTC...CTTCTTTTACTT/CTTGTGTTTACT...ATCAG|ATT | 1 | 1 | 86.858 |
| 130512871 | GT-AG | 0 | 1.000000099473604e-05 | 2541 | rna-gnl|WGS:VZSG|NOTPEN_R09847_mrna 23988670 | 23 | 4222469 | 4225009 | Nothoprocta pentlandii 2585814 | GAG|GTTAGTGCTA...TTTTTCTAATTG/CTTTTTCTAATT...AACAG|CTC | 0 | 1 | 91.449 |
| 130512872 | GT-AG | 0 | 1.000000099473604e-05 | 295 | rna-gnl|WGS:VZSG|NOTPEN_R09847_mrna 23988670 | 24 | 4221836 | 4222130 | Nothoprocta pentlandii 2585814 | TAG|GTAAGAATAA...TTCACCTTAACT/TTCACCTTAACT...TGCAG|TCC | 2 | 1 | 97.218 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);