introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
40 rows where transcript_id = 3981966
This data as json, CSV (advanced)
Suggested facets: is_minor, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 20493442 | GT-AG | 0 | 1.000000099473604e-05 | 9966 | rna-XM_036831471.1 3981966 | 1 | 4255218 | 4265183 | Balaenoptera musculus 9771 | CAG|GTGAGCGGGA...GTCTCCCTGAAG/CCACTTCTGATT...TCCAG|AAG | 1 | 1 | 1.329 |
| 20493443 | GT-AG | 0 | 1.000000099473604e-05 | 7655 | rna-XM_036831471.1 3981966 | 2 | 4247435 | 4255089 | Balaenoptera musculus 9771 | GAG|GTGAGTGGAG...TTGTCCTCACCG/GTTGTCCTCACC...TCCAG|GGG | 0 | 1 | 3.482 |
| 20493444 | GT-AG | 0 | 0.0010547983012041 | 2424 | rna-XM_036831471.1 3981966 | 3 | 4244831 | 4247254 | Balaenoptera musculus 9771 | AAG|GTAACCCTGA...CACCTCTCATTC/CCACCTCTCATT...TGTAG|GAA | 0 | 1 | 6.51 |
| 20493445 | GT-AG | 0 | 1.000000099473604e-05 | 591 | rna-XM_036831471.1 3981966 | 4 | 4244121 | 4244711 | Balaenoptera musculus 9771 | CAG|GTGGGCCTCG...CTGTCCTCACCC/GCTGTCCTCACC...CACAG|GCC | 2 | 1 | 8.511 |
| 20493446 | GT-AG | 0 | 1.000000099473604e-05 | 1410 | rna-XM_036831471.1 3981966 | 5 | 4242472 | 4243881 | Balaenoptera musculus 9771 | AGG|GTAAGGCTGC...TGAGCCGTGGCG/GCGCTGGTTATT...TTCAG|GGC | 1 | 1 | 12.532 |
| 20493447 | GT-AG | 0 | 1.000000099473604e-05 | 1226 | rna-XM_036831471.1 3981966 | 6 | 4241121 | 4242346 | Balaenoptera musculus 9771 | TCG|GTGAGTGTGA...CTCCCCTGCACC/TCTGGGCTCACA...CACAG|AGC | 0 | 1 | 14.634 |
| 20493448 | GT-AG | 0 | 1.000000099473604e-05 | 607 | rna-XM_036831471.1 3981966 | 7 | 4240390 | 4240996 | Balaenoptera musculus 9771 | GAA|GTAAGACTCC...CCCTCCATGACT/CCCTCCATGACT...CCCAG|TTG | 1 | 1 | 16.72 |
| 20493449 | GT-AG | 0 | 0.0004579130145028 | 179 | rna-XM_036831471.1 3981966 | 8 | 4240044 | 4240222 | Balaenoptera musculus 9771 | TGT|GTAAGCCCTT...TCCACCTGGACT/GGTGCCTCCACC...TCCAG|GAG | 0 | 1 | 19.529 |
| 20493450 | GT-AG | 0 | 1.000000099473604e-05 | 1076 | rna-XM_036831471.1 3981966 | 9 | 4238746 | 4239821 | Balaenoptera musculus 9771 | ACT|GTGAGTGTTA...AAGCCCTGAACT/TAAGCCCTGAAC...CACAG|TGT | 0 | 1 | 23.263 |
| 20493451 | GT-AG | 0 | 1.000000099473604e-05 | 213 | rna-XM_036831471.1 3981966 | 10 | 4238326 | 4238538 | Balaenoptera musculus 9771 | CAG|GTGAGGCACC...GTCCCCTTCGCG/CCTCCTGTAACC...AGCAG|CTG | 0 | 1 | 26.745 |
| 20493452 | GT-AG | 0 | 1.000000099473604e-05 | 2522 | rna-XM_036831471.1 3981966 | 11 | 4235592 | 4238113 | Balaenoptera musculus 9771 | CAG|GTGTGTGCTC...GGACTCTTGCCC/CACCTGTTTACC...TGCAG|CTT | 2 | 1 | 30.311 |
| 20493453 | GT-AG | 0 | 1.000000099473604e-05 | 85 | rna-XM_036831471.1 3981966 | 12 | 4235375 | 4235459 | Balaenoptera musculus 9771 | CAG|GTGGGCCACG...CAGGCCTGAGCT/TCTGCTCTCATC...CCCAG|GGA | 2 | 1 | 32.532 |
| 20493454 | GT-AG | 0 | 1.000000099473604e-05 | 1151 | rna-XM_036831471.1 3981966 | 13 | 4234115 | 4235265 | Balaenoptera musculus 9771 | GAT|GTGAGCGGGG...TACGTCTGAACC/GTACGTCTGAAC...TGCAG|CCC | 0 | 1 | 34.365 |
| 20493455 | GT-AG | 0 | 1.000000099473604e-05 | 87 | rna-XM_036831471.1 3981966 | 14 | 4233836 | 4233922 | Balaenoptera musculus 9771 | GAG|GTTTGAGGCC...GCAGCCTCAGCG/AGCAGCCTCAGC...CCCAG|GGC | 0 | 1 | 37.595 |
| 20493456 | GT-AG | 0 | 1.000000099473604e-05 | 2849 | rna-XM_036831471.1 3981966 | 15 | 4230786 | 4233634 | Balaenoptera musculus 9771 | AAG|GTGAGCACTC...CCTCCCTTCCCC/TCTCCTCCCAAA...TCCAG|AAA | 0 | 1 | 40.976 |
| 20493457 | GT-AG | 0 | 1.000000099473604e-05 | 90 | rna-XM_036831471.1 3981966 | 16 | 4230558 | 4230647 | Balaenoptera musculus 9771 | AAG|GTAGGAAGTG...TGCCCTCTGACA/TGCCCTCTGACA...CCAAG|TTC | 0 | 1 | 43.297 |
| 20493458 | GT-AG | 0 | 1.000000099473604e-05 | 814 | rna-XM_036831471.1 3981966 | 17 | 4229622 | 4230435 | Balaenoptera musculus 9771 | CAA|GTGAGTCAGG...AGGGCTTTGGCC/CAGGCTCTCATC...TGCAG|CAA | 2 | 1 | 45.349 |
| 20493459 | GT-AG | 0 | 1.000000099473604e-05 | 380 | rna-XM_036831471.1 3981966 | 18 | 4229068 | 4229447 | Balaenoptera musculus 9771 | GAA|GTGAGCAGCC...CTTGCCTTTCCC/GGTGTATCCAGC...GGCAG|GAA | 2 | 1 | 48.276 |
| 20493460 | GT-AG | 0 | 1.000000099473604e-05 | 1225 | rna-XM_036831471.1 3981966 | 19 | 4227701 | 4228925 | Balaenoptera musculus 9771 | GTG|GTAGGTCCCC...GAGCCCCCAATG/CAATGCCCCACC...CACAG|GAG | 0 | 1 | 50.664 |
| 20493461 | GT-AG | 0 | 1.000000099473604e-05 | 2704 | rna-XM_036831471.1 3981966 | 20 | 4224865 | 4227568 | Balaenoptera musculus 9771 | CAG|GTGAGCTGCC...CATACCATGTCC/CAGTGCCTCAGG...TGCAG|ATG | 0 | 1 | 52.885 |
| 20493462 | GT-AG | 0 | 0.0943202056100971 | 1348 | rna-XM_036831471.1 3981966 | 21 | 4223399 | 4224746 | Balaenoptera musculus 9771 | ACG|GTACCCCCGC...TGCCCCTCACTC/CCCTCACTCACC...CTCAG|CCC | 1 | 1 | 54.87 |
| 20493463 | GT-AG | 0 | 1.000000099473604e-05 | 1058 | rna-XM_036831471.1 3981966 | 22 | 4222216 | 4223273 | Balaenoptera musculus 9771 | CAG|GTGTGTGGCC...GGGTGCTAAGCT/TGGGTGCTAAGC...CGCAG|GCC | 0 | 1 | 56.972 |
| 20493464 | GT-AG | 0 | 1.000000099473604e-05 | 185 | rna-XM_036831471.1 3981966 | 23 | 4221799 | 4221983 | Balaenoptera musculus 9771 | AGG|GTGGGTATGC...GCCCCCCCAGCT/CGGCCCCCCACC...TACAG|GCA | 1 | 1 | 60.875 |
| 20493465 | GT-AG | 0 | 1.000000099473604e-05 | 252 | rna-XM_036831471.1 3981966 | 24 | 4221436 | 4221687 | Balaenoptera musculus 9771 | CAG|GTAGGGGCCC...GGTCCCCCAACT/CAACTCTGCACC...CCCAG|GCG | 1 | 1 | 62.742 |
| 20493466 | GT-AG | 0 | 1.000000099473604e-05 | 522 | rna-XM_036831471.1 3981966 | 25 | 4220741 | 4221262 | Balaenoptera musculus 9771 | GTG|GTAAGCCCCG...CCCGCCCTCACG/CCCGCCCTCACG...CCCAG|GAG | 0 | 1 | 65.652 |
| 20493467 | GT-AG | 0 | 1.000000099473604e-05 | 461 | rna-XM_036831471.1 3981966 | 26 | 4220105 | 4220565 | Balaenoptera musculus 9771 | AGG|GTGCGTGTCC...TCTGCCTGACCC/CTCTGCCTGACC...CACAG|AGT | 1 | 1 | 68.595 |
| 20493468 | GT-AG | 0 | 1.000000099473604e-05 | 200 | rna-XM_036831471.1 3981966 | 27 | 4219812 | 4220011 | Balaenoptera musculus 9771 | AGA|GTGAGTCTCA...GGACCCTGGGCC/GGTTGGGTCAGG...CCCAG|GTG | 1 | 1 | 70.16 |
| 20493469 | GT-AG | 0 | 1.000000099473604e-05 | 307 | rna-XM_036831471.1 3981966 | 28 | 4219416 | 4219722 | Balaenoptera musculus 9771 | GAG|GTGGGGCCTG...CATCTCTGAATT/CGCTCCCTCACC...CCCAG|GTG | 0 | 1 | 71.657 |
| 20493470 | GT-AG | 0 | 1.000000099473604e-05 | 425 | rna-XM_036831471.1 3981966 | 29 | 4218857 | 4219281 | Balaenoptera musculus 9771 | TAG|GTAAAGGGGC...ACTCCCTGAGCC/CACTCCCTGAGC...CCCAG|AGC | 2 | 1 | 73.911 |
| 20493471 | GT-AG | 0 | 1.000000099473604e-05 | 385 | rna-XM_036831471.1 3981966 | 30 | 4218327 | 4218711 | Balaenoptera musculus 9771 | AAG|GTGGGTGAGT...CAGTTCTGAGCT/TCAGTTCTGAGC...CCCAG|GTT | 0 | 1 | 76.35 |
| 20493472 | GT-AG | 0 | 1.000000099473604e-05 | 2042 | rna-XM_036831471.1 3981966 | 31 | 4216125 | 4218166 | Balaenoptera musculus 9771 | CAG|GTCTGGGCTC...TCTCCCTTGATC/ACCTGTCTCACC...TTCAG|ACA | 1 | 1 | 79.041 |
| 20493473 | GT-AG | 0 | 1.000000099473604e-05 | 244 | rna-XM_036831471.1 3981966 | 32 | 4215801 | 4216044 | Balaenoptera musculus 9771 | CAG|GTATGGAAGA...GGCCCCGCAGCT/CTCCTGTTGAGG...CCTAG|GCC | 0 | 1 | 80.387 |
| 20493474 | GT-AG | 0 | 1.000000099473604e-05 | 5517 | rna-XM_036831471.1 3981966 | 33 | 4210151 | 4215667 | Balaenoptera musculus 9771 | AAG|GTGCGGGCTT...CACACTTTGTCT/TGTGAACACACT...TGCAG|AAG | 1 | 1 | 82.624 |
| 20493475 | GT-AG | 0 | 1.000000099473604e-05 | 631 | rna-XM_036831471.1 3981966 | 34 | 4209430 | 4210060 | Balaenoptera musculus 9771 | CAG|GTTTGGGGCT...AGTGCCTTTGGG/TTGGGATGGATA...CACAG|ATG | 1 | 1 | 84.138 |
| 20493476 | GT-AG | 0 | 1.000000099473604e-05 | 269 | rna-XM_036831471.1 3981966 | 35 | 4209023 | 4209291 | Balaenoptera musculus 9771 | CAG|GTACTGGGGT...CTCCCCTTGGCT/GGGCACCTCATT...CCCAG|AGC | 1 | 1 | 86.459 |
| 20493477 | GT-AG | 1 | 99.33516531766227 | 1628 | rna-XM_036831471.1 3981966 | 36 | 4207286 | 4208913 | Balaenoptera musculus 9771 | GAC|GTATCCTTGG...TTGCCCTTAACG/TTTGCCCTTAAC...CCCAG|GCA | 2 | 1 | 88.293 |
| 20493478 | GT-AG | 0 | 1.000000099473604e-05 | 2137 | rna-XM_036831471.1 3981966 | 37 | 4205016 | 4207152 | Balaenoptera musculus 9771 | AAG|GTCAGTCCTT...CTCTCCTCCCCC/CCAGCGCTCAGT...CTCAG|CTG | 0 | 1 | 90.53 |
| 20493479 | GT-AG | 0 | 1.000000099473604e-05 | 1134 | rna-XM_036831471.1 3981966 | 38 | 4203686 | 4204819 | Balaenoptera musculus 9771 | AGG|GTGAGCGAGC...CTCCCCTTCCCT/CCGCGCCTCCCC...CCCAG|TCC | 1 | 1 | 93.827 |
| 20493480 | GT-AG | 0 | 1.000000099473604e-05 | 114 | rna-XM_036831471.1 3981966 | 39 | 4203408 | 4203521 | Balaenoptera musculus 9771 | CAG|GTCAGGCCCG...GGACCTTTAAAC/TTAAACCTGAGT...TCCAG|GGC | 0 | 1 | 96.585 |
| 20493481 | GT-AG | 0 | 1.000000099473604e-05 | 105 | rna-XM_036831471.1 3981966 | 40 | 4203188 | 4203292 | Balaenoptera musculus 9771 | CCG|GTAGGTGCCC...CTCTCCTCTCTT/GGCAAGCCGAGC...CGCAG|ATA | 1 | 1 | 98.52 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);