introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
29 rows where transcript_id = 3982012
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 20494813 | GT-AG | 0 | 1.000000099473604e-05 | 873 | rna-XM_036846234.1 3982012 | 3 | 81970210 | 81971082 | Balaenoptera musculus 9771 | TAG|GTGAGTGAGT...TTTTTCTTTTTT/TGAATACTTAAA...TCTAG|TGT | 2 | 1 | 11.932 |
| 20494814 | GT-AG | 0 | 1.000000099473604e-05 | 2205 | rna-XM_036846234.1 3982012 | 4 | 81971258 | 81973462 | Balaenoptera musculus 9771 | AAG|GTTATTATTT...TAATGCTTAATG/TTCTTTTTCATT...TTCAG|ATG | 0 | 1 | 15.514 |
| 20494815 | GT-AG | 0 | 0.0107373588441427 | 4482 | rna-XM_036846234.1 3982012 | 5 | 81973625 | 81978106 | Balaenoptera musculus 9771 | AAA|GTATGTTTCT...ATTAATTTATTT/GATTAATTTATT...TTTAG|GTT | 0 | 1 | 18.829 |
| 20494816 | GT-AG | 0 | 1.000000099473604e-05 | 1207 | rna-XM_036846234.1 3982012 | 6 | 81978214 | 81979420 | Balaenoptera musculus 9771 | GAG|GTAAAACTTA...CAGCCTTTAAAA/AATCTCTTCAAC...TATAG|AGA | 2 | 1 | 21.019 |
| 20494817 | GT-AG | 0 | 0.0011633100164206 | 9818 | rna-XM_036846234.1 3982012 | 7 | 81979550 | 81989367 | Balaenoptera musculus 9771 | GAG|GTATGTGTTT...CATTCTTTACTT/CTTTTATTAATT...TTTAG|AGC | 2 | 1 | 23.659 |
| 20494818 | GT-AG | 0 | 1.000000099473604e-05 | 1411 | rna-XM_036846234.1 3982012 | 8 | 81989465 | 81990875 | Balaenoptera musculus 9771 | AAG|GTAAGAGGCA...ATCCACTTAAAA/CTGAAACTTATG...TTTAG|GTT | 0 | 1 | 25.645 |
| 20494819 | GT-AG | 0 | 2.31001945088026e-05 | 930 | rna-XM_036846234.1 3982012 | 9 | 81990998 | 81991927 | Balaenoptera musculus 9771 | AAG|GTACAGTATC...TATGTTTTATTT/TTTATTTTCATT...CATAG|GCA | 2 | 1 | 28.142 |
| 20494820 | GT-AG | 0 | 0.0001130304211828 | 2660 | rna-XM_036846234.1 3982012 | 10 | 81992220 | 81994879 | Balaenoptera musculus 9771 | CCA|GTAAGTATGT...TTGTTTTTAAAC/TTGTTTTTAAAC...TGCAG|TTA | 0 | 1 | 34.118 |
| 20494821 | GT-AG | 0 | 3.174620872585153e-05 | 861 | rna-XM_036846234.1 3982012 | 11 | 81995005 | 81995865 | Balaenoptera musculus 9771 | GAG|GTACGGTTTT...ATATTTTTATAG/ATCTTTCTAATA...GCCAG|GAT | 2 | 1 | 36.676 |
| 20494822 | GT-AG | 0 | 1.000000099473604e-05 | 6039 | rna-XM_036846234.1 3982012 | 12 | 81996027 | 82002065 | Balaenoptera musculus 9771 | GAG|GTAAGTATAT...TATTCTTTAAGT/TTTATGTTTATT...AACAG|AAA | 1 | 1 | 39.971 |
| 20494823 | GT-AG | 0 | 0.0034032242052753 | 1139 | rna-XM_036846234.1 3982012 | 13 | 82002287 | 82003425 | Balaenoptera musculus 9771 | GAA|GTAAGCTTGA...AATTTTTTATTT/TTTTTATTTATT...TTCAG|ATG | 0 | 1 | 44.494 |
| 20494824 | GT-AG | 0 | 1.000000099473604e-05 | 477 | rna-XM_036846234.1 3982012 | 14 | 82003537 | 82004013 | Balaenoptera musculus 9771 | ATG|GTAAAAATTA...TCGTTCTTAATT/TCGTTCTTAATT...TCTAG|GAA | 0 | 1 | 46.766 |
| 20494825 | GT-AG | 0 | 1.000000099473604e-05 | 18262 | rna-XM_036846234.1 3982012 | 15 | 82004146 | 82022407 | Balaenoptera musculus 9771 | AGA|GTAAGATATT...TGAAAATTAATA/TGAAAATTAATA...TTTAG|CTT | 0 | 1 | 49.468 |
| 20494826 | GT-AG | 0 | 1.000000099473604e-05 | 171 | rna-XM_036846234.1 3982012 | 16 | 82022543 | 82022713 | Balaenoptera musculus 9771 | GGG|GTAAAATCAT...ATATTTTTGCTA/AAAATATTTATA...TTCAG|GAA | 0 | 1 | 52.231 |
| 20494827 | GC-AG | 0 | 1.000000099473604e-05 | 5938 | rna-XM_036846234.1 3982012 | 17 | 82022792 | 82028729 | Balaenoptera musculus 9771 | AAG|GCTAGTATGC...TTTTCCTTTTTT/TATTGGTTGATT...CTTAG|GTC | 0 | 1 | 53.827 |
| 20494828 | GT-AG | 0 | 0.0001214511259546 | 5093 | rna-XM_036846234.1 3982012 | 18 | 82028847 | 82033939 | Balaenoptera musculus 9771 | GAA|GTAAATTAAC...TGCTTTTTATTT/TTGCTTTTTATT...GTTAG|GTA | 0 | 1 | 56.222 |
| 20494829 | GT-AG | 0 | 0.0087558433505472 | 2906 | rna-XM_036846234.1 3982012 | 19 | 82034108 | 82037013 | Balaenoptera musculus 9771 | CAG|GTATATATTA...AAATACTTAACT/CTGTGTTTGAAT...ACAAG|GTA | 0 | 1 | 59.66 |
| 20494830 | GT-AG | 0 | 1.000000099473604e-05 | 2911 | rna-XM_036846234.1 3982012 | 20 | 82037157 | 82040067 | Balaenoptera musculus 9771 | CAG|GTAAAATGAA...TTTATTTTATTT/TTATTATTTATT...ATTAG|GCA | 2 | 1 | 62.587 |
| 20494831 | GT-AG | 0 | 1.000000099473604e-05 | 217 | rna-XM_036846234.1 3982012 | 21 | 82040232 | 82040448 | Balaenoptera musculus 9771 | CTG|GTAAGCAACA...TAATTCTAAAAT/ATAATTCTAAAA...AACAG|TGA | 1 | 1 | 65.944 |
| 20494832 | GT-AG | 0 | 1.000000099473604e-05 | 5556 | rna-XM_036846234.1 3982012 | 22 | 82040663 | 82046218 | Balaenoptera musculus 9771 | TAG|GTGAGTTAAA...TTCACTTTGATA/ATCTTTTTCACT...TGCAG|GGA | 2 | 1 | 70.323 |
| 20494833 | GT-AG | 0 | 1.000000099473604e-05 | 3427 | rna-XM_036846234.1 3982012 | 23 | 82046319 | 82049745 | Balaenoptera musculus 9771 | CAA|GTTGGTTTTC...GACAGCTTAATG/ATAAAATTGAAT...CTTAG|GTT | 0 | 1 | 72.37 |
| 20494834 | GT-AG | 0 | 0.0002617312620775 | 1232 | rna-XM_036846234.1 3982012 | 24 | 82049863 | 82051094 | Balaenoptera musculus 9771 | CAG|GTAAATTATT...TTTTCTTTAGTT/TTTTTCTTTAGT...TTCAG|ATA | 0 | 1 | 74.765 |
| 20494835 | GT-AG | 0 | 5.059401657197975e-05 | 1599 | rna-XM_036846234.1 3982012 | 25 | 82051239 | 82052837 | Balaenoptera musculus 9771 | AGT|GTAAGCAAAT...TTTATTTTGAAA/TATCATTTTATT...TTCAG|GTT | 0 | 1 | 77.712 |
| 20494836 | GT-AG | 0 | 1.000000099473604e-05 | 2541 | rna-XM_036846234.1 3982012 | 26 | 82053042 | 82055582 | Balaenoptera musculus 9771 | AAA|GTAATGCCTA...TAACCTTTACTT/CCTTTACTTAAT...TTAAG|GAA | 0 | 1 | 81.887 |
| 20494837 | GT-AG | 0 | 1.000000099473604e-05 | 4088 | rna-XM_036846234.1 3982012 | 27 | 82055721 | 82059808 | Balaenoptera musculus 9771 | GAG|GTAAATAAAA...CTTTCCTTTTCA/TTCCTTTTCAAT...TGTAG|ATG | 0 | 1 | 84.711 |
| 20494838 | GT-AG | 0 | 1.000000099473604e-05 | 9513 | rna-XM_036846234.1 3982012 | 28 | 82060010 | 82069522 | Balaenoptera musculus 9771 | AAA|GTAAGTAAAT...AAATACGTAATC/ATTCATTTAAGA...TACAG|GAA | 0 | 1 | 88.825 |
| 20494839 | GT-AG | 0 | 0.0007854356561165 | 13541 | rna-XM_036846234.1 3982012 | 29 | 82069991 | 82083531 | Balaenoptera musculus 9771 | CAG|GTATGTATTT...AAGTTCTTTTCC/TAAAATATTACG...TCTAG|AGC | 0 | 1 | 98.404 |
| 20508168 | GT-AG | 0 | 1.000000099473604e-05 | 883 | rna-XM_036846234.1 3982012 | 1 | 81967470 | 81968352 | Balaenoptera musculus 9771 | TCG|GTGGGTCTCT...TTTGCCTAAATT/TTTTCTTTCACC...TATAG|GGA | 0 | 5.424 | |
| 20508169 | GT-AG | 0 | 1.000000099473604e-05 | 1536 | rna-XM_036846234.1 3982012 | 2 | 81968537 | 81970072 | Balaenoptera musculus 9771 | AAG|GTGAGAGTTT...GATTCTTTCAGG/AAGAAATTCATT...TTTAG|GAA | 0 | 9.19 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);