introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
37 rows where transcript_id = 22607847
This data as json, CSV (advanced)
Suggested facets: score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 122606120 | GT-AG | 0 | 1.000000099473604e-05 | 28054 | rna-XM_029539287.1 22607847 | 2 | 47230151 | 47258204 | Mus pahari 10093 | TAT|GTAAGTAAAT...GGAACCTTACTA/ACCTTACTAATA...TCCAG|GGC | 0 | 1 | 2.463 |
| 122606121 | GT-AG | 0 | 1.000000099473604e-05 | 9094 | rna-XM_029539287.1 22607847 | 3 | 47258353 | 47267446 | Mus pahari 10093 | CTG|GTAAGTATCT...ATTTTCTTTGCT/TCTTTCTTCAAT...TTTAG|GTC | 1 | 1 | 4.199 |
| 122606122 | GT-AG | 0 | 1.000000099473604e-05 | 8169 | rna-XM_029539287.1 22607847 | 4 | 47267744 | 47275912 | Mus pahari 10093 | ATG|GTGAGAAGCC...CAGGCTTTAATT/TTAATTTTCACG...CACAG|ACG | 1 | 1 | 7.682 |
| 122606123 | GT-AG | 0 | 1.000000099473604e-05 | 17345 | rna-XM_029539287.1 22607847 | 5 | 47276097 | 47293441 | Mus pahari 10093 | TGA|GTGAGTATCC...GATAGATTAACA/GATAGATTAACA...TGCAG|GGA | 2 | 1 | 9.84 |
| 122606124 | GT-AG | 0 | 1.000000099473604e-05 | 4883 | rna-XM_029539287.1 22607847 | 6 | 47293635 | 47298517 | Mus pahari 10093 | ATG|GTAAGGAAAA...CAATTTTTAAAA/TTTCTTTTCATG...CCTAG|AAA | 0 | 1 | 12.104 |
| 122606125 | GT-AG | 0 | 1.000000099473604e-05 | 3462 | rna-XM_029539287.1 22607847 | 7 | 47301555 | 47305016 | Mus pahari 10093 | CAG|GTAAGCAAAG...TTTTTGTTAATT/TTTTTGTTAATT...TTCAG|AAA | 1 | 1 | 47.725 |
| 122606126 | GT-AG | 0 | 0.0001804349999216 | 45089 | rna-XM_029539287.1 22607847 | 8 | 47305142 | 47350230 | Mus pahari 10093 | GCG|GTAAGCAGCA...GTTTCCTTATTG/AGTTTCCTTATT...TCCAG|ATC | 0 | 1 | 49.191 |
| 122606127 | GT-AG | 0 | 0.007646106363745 | 3248 | rna-XM_029539287.1 22607847 | 9 | 47350307 | 47353554 | Mus pahari 10093 | CAG|GTATGCATGC...AAATTTTTAAAA/AAATTTTTAAAA...TTCAG|AAT | 1 | 1 | 50.082 |
| 122606128 | GT-AG | 0 | 1.000000099473604e-05 | 8666 | rna-XM_029539287.1 22607847 | 10 | 47353692 | 47362357 | Mus pahari 10093 | AAG|GTACTGGATG...TTGGTTTCAACC/GTTGGTTTCAAC...TACAG|CCT | 0 | 1 | 51.689 |
| 122606129 | GT-AG | 0 | 1.000000099473604e-05 | 3361 | rna-XM_029539287.1 22607847 | 11 | 47362729 | 47366089 | Mus pahari 10093 | GAG|GTAAGACAGA...TCTCCATTACTG/TAGTGATTCATC...TTCAG|TTC | 2 | 1 | 56.04 |
| 122606130 | GT-AG | 0 | 1.000000099473604e-05 | 3630 | rna-XM_029539287.1 22607847 | 12 | 47366144 | 47369773 | Mus pahari 10093 | AAG|GTACAGAGTT...CTGCCTTTGATG/TTTGATGTCATC...TCCAG|TAT | 2 | 1 | 56.674 |
| 122606131 | GT-AG | 0 | 0.0001133095473261 | 1651 | rna-XM_029539287.1 22607847 | 13 | 47369840 | 47371490 | Mus pahari 10093 | AAG|GTATAGTATT...CCATCCTGAGAC/CAGTGACTCAAC...TTCAG|TTT | 2 | 1 | 57.448 |
| 122606132 | GT-AG | 0 | 1.000000099473604e-05 | 1961 | rna-XM_029539287.1 22607847 | 14 | 47371681 | 47373641 | Mus pahari 10093 | CAG|GTAAGGCTAA...TTTTTTTTATTT/TTTTTTTTTATT...TGTAG|ACA | 0 | 1 | 59.676 |
| 122606133 | GT-AG | 0 | 1.000000099473604e-05 | 7961 | rna-XM_029539287.1 22607847 | 15 | 47373751 | 47381711 | Mus pahari 10093 | ACA|GTGAGTATAT...CTAACTTTACCT/TGTGTGCTAACT...TCTAG|ATT | 1 | 1 | 60.955 |
| 122606134 | GT-AG | 0 | 1.000000099473604e-05 | 1819 | rna-XM_029539287.1 22607847 | 16 | 47381755 | 47383573 | Mus pahari 10093 | GAG|GTACGAGACG...ACATTTTTAGCC/CACTATTTAACA...TTCAG|TAT | 2 | 1 | 61.459 |
| 122606135 | GT-AG | 0 | 1.000000099473604e-05 | 7196 | rna-XM_029539287.1 22607847 | 17 | 47383707 | 47390902 | Mus pahari 10093 | AAA|GTAAGTACCT...TGATTCTTTTCT/TAACTGTTGATT...GGCAG|GAA | 0 | 1 | 63.019 |
| 122606136 | GT-AG | 0 | 1.000000099473604e-05 | 15003 | rna-XM_029539287.1 22607847 | 18 | 47391078 | 47406080 | Mus pahari 10093 | CAA|GTAAGAGACA...ATTTTCTTGTTG/CTTGTTGTTACC...TCCAG|GCT | 1 | 1 | 65.072 |
| 122606137 | GT-AG | 0 | 1.000000099473604e-05 | 2406 | rna-XM_029539287.1 22607847 | 19 | 47406149 | 47408554 | Mus pahari 10093 | AAG|GTAAGATATT...TTTACCTTCTTT/TGAGTTTTCAGT...TTTAG|CAG | 0 | 1 | 65.869 |
| 122606138 | GT-AG | 0 | 1.000000099473604e-05 | 3404 | rna-XM_029539287.1 22607847 | 20 | 47408622 | 47412025 | Mus pahari 10093 | AGT|GTAAGTAGCT...GATACTTCGATT/CTTCGATTAATA...TGCAG|CTT | 1 | 1 | 66.655 |
| 122606139 | GT-AG | 0 | 1.000000099473604e-05 | 1456 | rna-XM_029539287.1 22607847 | 21 | 47412162 | 47413617 | Mus pahari 10093 | TGG|GTAAGTGGGG...ACCTATTTGACA/ACCTATTTGACA...TTTAG|AGT | 2 | 1 | 68.25 |
| 122606140 | GT-AG | 0 | 1.000000099473604e-05 | 558 | rna-XM_029539287.1 22607847 | 22 | 47413731 | 47414288 | Mus pahari 10093 | AGG|GTAAGAGGAA...TTCTTTTTTTCT/TTTTTCTGTATT...TTGAG|GAG | 1 | 1 | 69.575 |
| 122606141 | GT-AG | 0 | 1.000000099473604e-05 | 656 | rna-XM_029539287.1 22607847 | 23 | 47414442 | 47415097 | Mus pahari 10093 | ACG|GTGAGGTGCT...ATGAGCTTAACT/ATGAGCTTAACT...CATAG|AGT | 1 | 1 | 71.37 |
| 122606142 | GT-AG | 0 | 1.000000099473604e-05 | 1522 | rna-XM_029539287.1 22607847 | 24 | 47415349 | 47416870 | Mus pahari 10093 | CAG|GTGAGCAGCG...ATACCCTGAATT/TATACCCTGAAT...AGCAG|TTC | 0 | 1 | 74.314 |
| 122606143 | GT-AG | 0 | 0.000362860431126 | 112 | rna-XM_029539287.1 22607847 | 25 | 47416997 | 47417108 | Mus pahari 10093 | AAG|GTACCGACCT...CCTTCCTTCATG/AGTTTATTCACT...TGCAG|AAG | 0 | 1 | 75.792 |
| 122606144 | GT-AG | 0 | 1.000000099473604e-05 | 1063 | rna-XM_029539287.1 22607847 | 26 | 47417227 | 47418289 | Mus pahari 10093 | AAG|GTGAGTCTCC...TTGGTCTTCTCT/GAGCCATTAAAG...CCAAG|ACA | 1 | 1 | 77.176 |
| 122606145 | GT-AG | 0 | 1.000000099473604e-05 | 2080 | rna-XM_029539287.1 22607847 | 27 | 47418539 | 47420618 | Mus pahari 10093 | AAG|GTAAGACTGC...GATTTCATGACA/CTGAACCTGATT...TGCAG|AGG | 1 | 1 | 80.096 |
| 122606146 | GT-AG | 0 | 4.275136822576971e-05 | 649 | rna-XM_029539287.1 22607847 | 28 | 47420696 | 47421344 | Mus pahari 10093 | TTG|GTAAGCTGAA...CACGTCTTATAT/ATTTTGTTTACT...TCCAG|GAC | 0 | 1 | 80.999 |
| 122606147 | GT-AG | 0 | 1.000000099473604e-05 | 152 | rna-XM_029539287.1 22607847 | 29 | 47421527 | 47421678 | Mus pahari 10093 | CCT|GTGAGTGAGC...CCTGCTTTATTT/GTTTGTTTCACC...GTCAG|GAA | 2 | 1 | 83.134 |
| 122606148 | GT-AG | 0 | 1.000000099473604e-05 | 2884 | rna-XM_029539287.1 22607847 | 30 | 47421762 | 47424645 | Mus pahari 10093 | AAG|GTAATGGAAA...TGATCCACAACT/GGTTTCTCCATT...TGCAG|AAC | 1 | 1 | 84.107 |
| 122606149 | GT-AG | 0 | 1.000000099473604e-05 | 3114 | rna-XM_029539287.1 22607847 | 31 | 47424841 | 47427954 | Mus pahari 10093 | AAG|GTAATTAACA...AGTGTTTTGTTT/ATCCAGCTGAGT...TGCAG|TGG | 1 | 1 | 86.395 |
| 122606150 | GT-AG | 0 | 2.1357333477855856e-05 | 1047 | rna-XM_029539287.1 22607847 | 32 | 47428117 | 47429163 | Mus pahari 10093 | AAG|GTAAACCATC...ACAACTTTAAAC/ACAACTTTAAAC...TTTAG|GAG | 1 | 1 | 88.295 |
| 122606151 | GT-AG | 0 | 0.001122189308258 | 460 | rna-XM_029539287.1 22607847 | 33 | 47429235 | 47429694 | Mus pahari 10093 | AAG|GTATGATTTT...ATTTTTTTAATG/ATTTTTTTAATG...TATAG|GGT | 0 | 1 | 89.127 |
| 122606152 | GT-AG | 0 | 1.000000099473604e-05 | 2657 | rna-XM_029539287.1 22607847 | 34 | 47429740 | 47432396 | Mus pahari 10093 | GAG|GTAAGGTTCT...AGATTTTTGACA/AGATTTTTGACA...GCTAG|CAG | 0 | 1 | 89.655 |
| 122606153 | GT-AG | 0 | 4.944678866625298e-05 | 773 | rna-XM_029539287.1 22607847 | 35 | 47432448 | 47433220 | Mus pahari 10093 | CAG|GTACATGTGC...TTCTCCTAGACC/CTAGACCTGAGT...GACAG|GGT | 0 | 1 | 90.253 |
| 122606154 | GT-AG | 0 | 0.1662339872804001 | 1183 | rna-XM_029539287.1 22607847 | 36 | 47433671 | 47434853 | Mus pahari 10093 | CAG|GTACCCAAGA...AGTTCCTTAGAA/TCTGGTTTAAAT...TGCAG|GTT | 0 | 1 | 95.531 |
| 122606155 | GT-AG | 0 | 1.000000099473604e-05 | 1033 | rna-XM_029539287.1 22607847 | 37 | 47435185 | 47436217 | Mus pahari 10093 | GTG|GTGAGTTGTC...TCACTCTTGGTT/CAGGTTCTCACT...TGCAG|ACG | 1 | 1 | 99.414 |
| 122621580 | GT-AG | 0 | 1.000000099473604e-05 | 79309 | rna-XM_029539287.1 22607847 | 1 | 47150798 | 47230106 | Mus pahari 10093 | ACG|GTGAGAGCGC...TTATTCTCATCT/TTTATTCTCATC...TTCAG|TGT | 0 | 2.076 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);