introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
36 rows where transcript_id = 3982020
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 20494992 | GT-AG | 0 | 1.000000099473604e-05 | 289 | rna-XM_036849454.1 3982020 | 2 | 18749593 | 18749881 | Balaenoptera musculus 9771 | CAG|GTGGGAGCCT...ACTCTCTTTTCC/CCCAGCCAAACC...GGCAG|CGA | 2 | 1 | 5.395 |
| 20494993 | GT-AG | 0 | 1.000000099473604e-05 | 723 | rna-XM_036849454.1 3982020 | 3 | 18748789 | 18749511 | Balaenoptera musculus 9771 | GTG|GTAAGTCAGG...CCACCCCTGACA/CCACCCCTGACA...CGCAG|GGA | 2 | 1 | 7.239 |
| 20494994 | GT-AG | 0 | 1.000000099473604e-05 | 724 | rna-XM_036849454.1 3982020 | 4 | 18747905 | 18748628 | Balaenoptera musculus 9771 | CAG|GTGAGAGGGA...CGGCTCATAGAA/CCCCGGCTCATA...TCCAG|ACA | 0 | 1 | 10.881 |
| 20494995 | GT-AG | 0 | 1.000000099473604e-05 | 3689 | rna-XM_036849454.1 3982020 | 5 | 18744058 | 18747746 | Balaenoptera musculus 9771 | CTG|GTAAGCCCCC...GGATCCTTCCCA/CACAGAATCACA...TACAG|GTA | 2 | 1 | 14.478 |
| 20494996 | GT-AG | 0 | 1.000000099473604e-05 | 1634 | rna-XM_036849454.1 3982020 | 6 | 18742328 | 18743961 | Balaenoptera musculus 9771 | GAG|GTAAGACCAT...ATCTCCCTCCCT/GTGGAGCCCAGC...TCCAG|ATG | 2 | 1 | 16.663 |
| 20494997 | GT-AG | 0 | 1.000000099473604e-05 | 1744 | rna-XM_036849454.1 3982020 | 7 | 18740495 | 18742238 | Balaenoptera musculus 9771 | GCA|GTGAGTCCTG...TGAGCCTTCTCC/TGGGCTCTGAGC...TTCAG|CTT | 1 | 1 | 18.689 |
| 20494998 | GT-AG | 0 | 1.000000099473604e-05 | 479 | rna-XM_036849454.1 3982020 | 8 | 18739971 | 18740449 | Balaenoptera musculus 9771 | AAA|GTAAGGAGGC...CTGCCCTAAATT/TCTGCCCTAAAT...CCCAG|GAC | 1 | 1 | 19.713 |
| 20494999 | GT-AG | 0 | 1.000000099473604e-05 | 1519 | rna-XM_036849454.1 3982020 | 9 | 18738311 | 18739829 | Balaenoptera musculus 9771 | ATG|GTGAGGACCC...TCACCCTGGACC/CACGAGCTCACC...TGCAG|GGA | 1 | 1 | 22.923 |
| 20495000 | GT-AG | 0 | 1.000000099473604e-05 | 627 | rna-XM_036849454.1 3982020 | 10 | 18737522 | 18738148 | Balaenoptera musculus 9771 | GAG|GTGAGGCGGG...ACTCTCTTCCCA/TCTCTTCCCACC...TTCAG|ATG | 1 | 1 | 26.611 |
| 20495001 | GT-AG | 0 | 1.000000099473604e-05 | 973 | rna-XM_036849454.1 3982020 | 11 | 18736407 | 18737379 | Balaenoptera musculus 9771 | GCT|GTGAGTCGGT...GAGATGTGAGTC/GGAGATGTGAGT...CACAG|GTG | 2 | 1 | 29.843 |
| 20495002 | GT-AG | 0 | 1.000000099473604e-05 | 685 | rna-XM_036849454.1 3982020 | 12 | 18735522 | 18736206 | Balaenoptera musculus 9771 | CAG|GTGGGCACAG...CACCCCTTAGTT/TCTCCCCTAACA...ACTAG|AGA | 1 | 1 | 34.396 |
| 20495003 | GT-AG | 0 | 1.000000099473604e-05 | 385 | rna-XM_036849454.1 3982020 | 13 | 18735083 | 18735467 | Balaenoptera musculus 9771 | AAG|GTACGTGTGA...CGCCCCTGGATG/GGGTCTCTGATG...TCCAG|ACT | 1 | 1 | 35.625 |
| 20495004 | GT-AG | 0 | 1.000000099473604e-05 | 2763 | rna-XM_036849454.1 3982020 | 14 | 18732192 | 18734954 | Balaenoptera musculus 9771 | AAG|GTACGGAGCC...AACCACTTAACC/AACCACTTAACC...CACAG|TCG | 0 | 1 | 38.539 |
| 20495005 | GT-AG | 0 | 0.0039408470993575 | 2593 | rna-XM_036849454.1 3982020 | 15 | 18729415 | 18732007 | Balaenoptera musculus 9771 | CAG|GTACCTGTGT...AGCATCTTCACT/ACACTTCTGATT...TTCAG|CCA | 1 | 1 | 42.727 |
| 20495006 | GT-AG | 0 | 1.000000099473604e-05 | 1962 | rna-XM_036849454.1 3982020 | 16 | 18727278 | 18729239 | Balaenoptera musculus 9771 | CAG|GTGAGTCTGC...TTGGCCTTTCCT/CCTTTCCTGAAG...CCCAG|GTT | 2 | 1 | 46.711 |
| 20495007 | GT-AG | 0 | 1.000000099473604e-05 | 397 | rna-XM_036849454.1 3982020 | 17 | 18726759 | 18727155 | Balaenoptera musculus 9771 | TTG|GTAAGTCCAA...ACCCCCTTGCCT/GTGGCCCTCACC...TGTAG|CTA | 1 | 1 | 49.488 |
| 20495008 | GT-AG | 0 | 1.000000099473604e-05 | 235 | rna-XM_036849454.1 3982020 | 18 | 18726336 | 18726570 | Balaenoptera musculus 9771 | AAG|GTGAGGCTGG...CGTCCCCTGACT/CGTCCCCTGACT...TCAAG|GTC | 0 | 1 | 53.767 |
| 20495009 | GT-AG | 0 | 1.000000099473604e-05 | 1132 | rna-XM_036849454.1 3982020 | 19 | 18725077 | 18726208 | Balaenoptera musculus 9771 | CAG|GTGAGCGGCT...ATCTCCTTCTCT/TGGCTATCCATC...CACAG|GAC | 1 | 1 | 56.658 |
| 20495010 | GT-AG | 0 | 1.000000099473604e-05 | 2747 | rna-XM_036849454.1 3982020 | 20 | 18722151 | 18724897 | Balaenoptera musculus 9771 | AGG|GTGAGTCCCT...TCACTCTTGTCC/TCGGTGCTCACT...CCCAG|ATT | 0 | 1 | 60.733 |
| 20495011 | GT-AG | 0 | 1.000000099473604e-05 | 914 | rna-XM_036849454.1 3982020 | 21 | 18721122 | 18722035 | Balaenoptera musculus 9771 | CAG|GTAGGGGCAG...TGTTCCTAAATG/TTGTTCCTAAAT...CTCAG|ATG | 1 | 1 | 63.351 |
| 20495012 | GT-AG | 0 | 1.000000099473604e-05 | 718 | rna-XM_036849454.1 3982020 | 22 | 18720247 | 18720964 | Balaenoptera musculus 9771 | CAA|GTAAGTGAGG...CCCCTCTCAGAG/GCCCCTCTCAGA...TGCAG|GTC | 2 | 1 | 66.925 |
| 20495013 | GT-AG | 0 | 1.000000099473604e-05 | 1502 | rna-XM_036849454.1 3982020 | 23 | 18718635 | 18720136 | Balaenoptera musculus 9771 | AAG|GTGATGCCCC...TCTTCCTTGCTT/GGATATGTCATT...CCCAG|AGC | 1 | 1 | 69.429 |
| 20495014 | GT-AG | 0 | 1.000000099473604e-05 | 695 | rna-XM_036849454.1 3982020 | 24 | 18717895 | 18718589 | Balaenoptera musculus 9771 | CAG|GTAAGCGGTC...CCACCCTTTCCT/TCCCTGCCCACC...CCCAG|TGA | 1 | 1 | 70.453 |
| 20495015 | GT-AG | 0 | 1.000000099473604e-05 | 1294 | rna-XM_036849454.1 3982020 | 25 | 18716464 | 18717757 | Balaenoptera musculus 9771 | CCG|GTAATTGGGG...TGGGTTTTGTTC/TGGGAATGGAAA...CCCAG|AAC | 0 | 1 | 73.572 |
| 20495016 | GT-AG | 0 | 1.000000099473604e-05 | 139 | rna-XM_036849454.1 3982020 | 26 | 18716180 | 18716318 | Balaenoptera musculus 9771 | ACG|GTCAGCTGTG...TCCTCCTGGTTC/TCCTGGTTCACG...CACAG|AGT | 1 | 1 | 76.872 |
| 20495017 | GT-AG | 0 | 1.000000099473604e-05 | 406 | rna-XM_036849454.1 3982020 | 27 | 18715714 | 18716119 | Balaenoptera musculus 9771 | AGG|GTAAGAACTG...GGCGCTCTGACC/GGCGCTCTGACC...CATAG|GTC | 1 | 1 | 78.238 |
| 20495018 | GC-AG | 0 | 1.000000099473604e-05 | 1076 | rna-XM_036849454.1 3982020 | 28 | 18714570 | 18715645 | Balaenoptera musculus 9771 | AAG|GCAAAGCCCC...GTCTCCTCACTC/AGTCTCCTCACT...CTCAG|GTG | 0 | 1 | 79.786 |
| 20495019 | GT-AG | 0 | 1.000000099473604e-05 | 1751 | rna-XM_036849454.1 3982020 | 29 | 18712708 | 18714458 | Balaenoptera musculus 9771 | GAG|GTAAGTGCAG...ACCGCCTTCTCA/CTGGAACTCATG...TCCAG|CTT | 0 | 1 | 82.313 |
| 20495020 | GT-AG | 0 | 1.000000099473604e-05 | 516 | rna-XM_036849454.1 3982020 | 30 | 18712104 | 18712619 | Balaenoptera musculus 9771 | AAG|GTGGGAGCTG...GAGCCCTTCATC/CATCAGCTGACC...TCCAG|CCC | 1 | 1 | 84.316 |
| 20495021 | GT-AG | 0 | 1.000000099473604e-05 | 244 | rna-XM_036849454.1 3982020 | 31 | 18711821 | 18712064 | Balaenoptera musculus 9771 | GTG|GTAAGCAAGC...TCACCCTTCTCC/GGAAGACTCACC...CCCAG|CCC | 1 | 1 | 85.204 |
| 20495022 | GT-AG | 0 | 0.0001185854497075 | 1153 | rna-XM_036849454.1 3982020 | 32 | 18710559 | 18711711 | Balaenoptera musculus 9771 | CAA|GTAGGTTGGC...ATTTCTTTGCCC/TTTGCCCTCACG...TTCAG|AGA | 2 | 1 | 87.685 |
| 20495023 | GT-AG | 0 | 1.000000099473604e-05 | 223 | rna-XM_036849454.1 3982020 | 33 | 18710172 | 18710394 | Balaenoptera musculus 9771 | AAG|GTGAGCTCTC...GATGCCCTGACG/GATGCCCTGACG...CATAG|CTT | 1 | 1 | 91.418 |
| 20495024 | GT-AG | 0 | 0.0017085483171547 | 177 | rna-XM_036849454.1 3982020 | 34 | 18709958 | 18710134 | Balaenoptera musculus 9771 | GAA|GTAAGTTTCC...TGTCCTTTAACA/CTTTTTCCCATT...TTCAG|AGC | 2 | 1 | 92.26 |
| 20495025 | GT-AG | 0 | 1.000000099473604e-05 | 1616 | rna-XM_036849454.1 3982020 | 35 | 18708319 | 18709934 | Balaenoptera musculus 9771 | AGA|GTAAGTCCAG...TCGTTTTTCCCT/AAAACATTCATC...TCCAG|ATC | 1 | 1 | 92.784 |
| 20495026 | GT-AG | 0 | 1.000000099473604e-05 | 1012 | rna-XM_036849454.1 3982020 | 36 | 18707251 | 18708262 | Balaenoptera musculus 9771 | AAG|GTACTGACTG...GTTCCCTTCCTG/GGTGGACTGAGC...AACAG|ACC | 0 | 1 | 94.059 |
| 20508177 | GT-AG | 0 | 1.000000099473604e-05 | 2342 | rna-XM_036849454.1 3982020 | 1 | 18750061 | 18752402 | Balaenoptera musculus 9771 | TCT|GTGAGTCACC...TGCACCTTGGCT/GGTGGAGTGACC...GGCAG|GTT | 0 | 2.345 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);