introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
33 rows where transcript_id = 22607846
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 122606089 | GT-AG | 0 | 1.000000099473604e-05 | 118452 | rna-XM_029539617.1 22607846 | 3 | 67287503 | 67405954 | Mus pahari 10093 | CAG|GTCAGTCCCT...CTTTCCTTGATC/CTTTCCTTGATC...TGCAG|ACC | 1 | 1 | 3.799 |
| 122606090 | GT-AG | 0 | 1.000000099473604e-05 | 3323 | rna-XM_029539617.1 22607846 | 4 | 67406243 | 67409565 | Mus pahari 10093 | CAG|GTGAGTGAGT...ATTATCTTGTTT/CCTATGATTATC...CTCAG|GCA | 1 | 1 | 7.026 |
| 122606091 | GT-AG | 0 | 1.000000099473604e-05 | 94540 | rna-XM_029539617.1 22607846 | 5 | 67409836 | 67504375 | Mus pahari 10093 | CCG|GTGAGTGGCT...TTTTCATTATTT/TCGATTTTCATT...CGCAG|GTG | 1 | 1 | 10.052 |
| 122606092 | GT-AG | 0 | 1.000000099473604e-05 | 51368 | rna-XM_029539617.1 22607846 | 6 | 67504586 | 67555953 | Mus pahari 10093 | ACG|GTAAGTGCTT...CTGACCATGATT/CGCCCACTGACC...CCCAG|ATC | 1 | 1 | 12.405 |
| 122606093 | GT-AG | 0 | 1.000000099473604e-05 | 10697 | rna-XM_029539617.1 22607846 | 7 | 67556210 | 67566906 | Mus pahari 10093 | CAG|GTGGGCCCCA...ATTGCTTCATTC/CATTGCTTCATT...CGTAG|AAA | 2 | 1 | 15.273 |
| 122606094 | GT-AG | 0 | 1.000000099473604e-05 | 1214 | rna-XM_029539617.1 22607846 | 8 | 67567006 | 67568219 | Mus pahari 10093 | CGG|GTAGGTGGCA...CAGACCCTGACA/CAGACCCTGACA...CACAG|GCA | 2 | 1 | 16.383 |
| 122606095 | GT-AG | 0 | 1.8056202462737305e-05 | 23403 | rna-XM_029539617.1 22607846 | 9 | 67568456 | 67591858 | Mus pahari 10093 | TGG|GTAAGCGTTT...TTTTCCATTGCT/ACTTTTTCCATT...TACAG|CCA | 1 | 1 | 19.027 |
| 122606096 | GT-AG | 0 | 1.000000099473604e-05 | 6617 | rna-XM_029539617.1 22607846 | 10 | 67592030 | 67598646 | Mus pahari 10093 | AAG|GTAGGGTTGC...ACGTTCTTGTCT/GTGTGGCTAACC...GGTAG|GAA | 1 | 1 | 20.944 |
| 122606097 | GT-AG | 0 | 3.927285004121855e-05 | 1445 | rna-XM_029539617.1 22607846 | 11 | 67598862 | 67600306 | Mus pahari 10093 | CAG|GTAACTGGAC...CTAACTTTACTC/TCGTTGCTAACT...TTTAG|TTT | 0 | 1 | 23.353 |
| 122606098 | GT-AG | 0 | 1.000000099473604e-05 | 36053 | rna-XM_029539617.1 22607846 | 12 | 67600518 | 67636570 | Mus pahari 10093 | TTG|GTAAGCAGCT...CATCTCTTCTCT/TCCATTTTCAGC...TCCAG|AAT | 1 | 1 | 25.717 |
| 122606099 | GT-AG | 0 | 1.000000099473604e-05 | 1904 | rna-XM_029539617.1 22607846 | 13 | 67636673 | 67638576 | Mus pahari 10093 | GAG|GTAAGCATTC...AGGCTTTTGGTA/AAGCCATTCACC...TTCAG|CAT | 1 | 1 | 26.86 |
| 122606100 | GT-AG | 0 | 1.000000099473604e-05 | 3625 | rna-XM_029539617.1 22607846 | 14 | 67638772 | 67642396 | Mus pahari 10093 | AAG|GTGAGAGTCA...TCTTCCTTCCCT/CCAATACTCACG...CACAG|TGG | 1 | 1 | 29.045 |
| 122606101 | GT-AG | 0 | 1.000000099473604e-05 | 20998 | rna-XM_029539617.1 22607846 | 15 | 67642598 | 67663595 | Mus pahari 10093 | TTG|GTAAGTACCA...CGCCCCTTGGCC/CTCTCTCCCACC...CGCAG|AGA | 1 | 1 | 31.298 |
| 122606102 | GT-AG | 0 | 0.0002356947443782 | 7411 | rna-XM_029539617.1 22607846 | 16 | 67663782 | 67671192 | Mus pahari 10093 | TCG|GTATGGCGGG...CCTACCTTAAAT/CTGTCTTTTATC...CCAAG|CTC | 1 | 1 | 33.382 |
| 122606103 | GT-AG | 0 | 0.0022328742296724 | 6753 | rna-XM_029539617.1 22607846 | 17 | 67671220 | 67677972 | Mus pahari 10093 | AAG|GTATGGTCTT...CAGTCCTTGACT/CAGTCCTTGACT...TGCAG|AGG | 1 | 1 | 33.684 |
| 122606104 | GT-AG | 0 | 1.000000099473604e-05 | 11162 | rna-XM_029539617.1 22607846 | 18 | 67678120 | 67689281 | Mus pahari 10093 | GAG|GTAGGGCTCT...ATAGCTGTATAA/ACATGGCTGATT...TGCAG|ATG | 1 | 1 | 35.332 |
| 122606105 | GT-AG | 0 | 1.000000099473604e-05 | 14146 | rna-XM_029539617.1 22607846 | 19 | 67689499 | 67703644 | Mus pahari 10093 | AGG|GTAAGTCCAT...ATGTTCTTCTCT/AGGTATTTCATG...CACAG|GCA | 2 | 1 | 37.763 |
| 122606106 | GT-AG | 0 | 1.000000099473604e-05 | 5354 | rna-XM_029539617.1 22607846 | 20 | 67703765 | 67709118 | Mus pahari 10093 | CAG|GTAGGAAACC...TTCATCTTATTT/CAGGATCTCATT...CACAG|CTT | 2 | 1 | 39.108 |
| 122606107 | GT-AG | 0 | 0.0139060446158985 | 2444 | rna-XM_029539617.1 22607846 | 21 | 67709381 | 67711824 | Mus pahari 10093 | CAG|GTACCCAGGC...ATATCCCTACTA/CCTACTATGATC...TGTAG|GCT | 0 | 1 | 42.044 |
| 122606108 | GT-AG | 0 | 1.000000099473604e-05 | 2019 | rna-XM_029539617.1 22607846 | 22 | 67712093 | 67714111 | Mus pahari 10093 | TTG|GTAAGTGCTC...TGCACTTTGCTC/ACTTTGCTCACT...TGTAG|TTT | 1 | 1 | 45.047 |
| 122606109 | GT-AG | 0 | 1.000000099473604e-05 | 4313 | rna-XM_029539617.1 22607846 | 23 | 67714256 | 67718568 | Mus pahari 10093 | GTG|GTGAGTGGGA...CCTCCCTTATGT/CCCTCCCTTATG...CCCAG|GCA | 1 | 1 | 46.661 |
| 122606110 | GT-AG | 0 | 7.028993793840575e-05 | 1958 | rna-XM_029539617.1 22607846 | 24 | 67718819 | 67720776 | Mus pahari 10093 | GAG|GTATGTAAGG...CATTCTTTCTTC/TAACACCTCATT...GTCAG|AAA | 2 | 1 | 49.462 |
| 122606111 | GT-AG | 0 | 1.000000099473604e-05 | 8774 | rna-XM_029539617.1 22607846 | 25 | 67720798 | 67729571 | Mus pahari 10093 | TAG|GTAAGCCAAG...GCATCTCTAGTT/AAAGTGCTCACC...TGCAG|TCA | 2 | 1 | 49.697 |
| 122606112 | GT-AG | 0 | 1.000000099473604e-05 | 4235 | rna-XM_029539617.1 22607846 | 26 | 67729805 | 67734039 | Mus pahari 10093 | GGG|GTAAGGCTTT...TTTGCCTTCTCC/CCAATGCTGAAT...CCCAG|GAA | 1 | 1 | 52.308 |
| 122606113 | GT-AG | 0 | 1.000000099473604e-05 | 5752 | rna-XM_029539617.1 22607846 | 27 | 67734195 | 67739946 | Mus pahari 10093 | CAG|GTGAGAAGAC...GGAATCTTGGCA/CTTGGCATCATT...GTCAG|GTT | 0 | 1 | 54.045 |
| 122606114 | GT-AG | 0 | 4.623100746790962e-05 | 11151 | rna-XM_029539617.1 22607846 | 28 | 67740825 | 67751975 | Mus pahari 10093 | TGA|GTAAGTACCA...ACCCCTTTGATT/CCGTCTCTAACA...TCTAG|GTA | 2 | 1 | 63.884 |
| 122606115 | GT-AG | 0 | 1.000000099473604e-05 | 3069 | rna-XM_029539617.1 22607846 | 29 | 67752149 | 67755217 | Mus pahari 10093 | AAG|GTAAGTCAGC...CTCCTCTTAAGC/CTCCTCTTAAGC...TTCAG|ACC | 1 | 1 | 65.823 |
| 122606116 | GT-AG | 0 | 1.000000099473604e-05 | 3926 | rna-XM_029539617.1 22607846 | 30 | 67755454 | 67759379 | Mus pahari 10093 | CGG|GTAAGTGGAA...TCTTTCTTATCC/CAGATTCTCACT...CCCAG|GTT | 0 | 1 | 68.467 |
| 122606117 | GC-AG | 0 | 1.000000099473604e-05 | 1533 | rna-XM_029539617.1 22607846 | 31 | 67759677 | 67761209 | Mus pahari 10093 | AAG|GCAGGTGTCT...ACCTCCTTCATC/ACCTCCTTCATC...TCCAG|TCC | 0 | 1 | 71.795 |
| 122606118 | GT-AG | 0 | 1.000000099473604e-05 | 6370 | rna-XM_029539617.1 22607846 | 32 | 67762825 | 67769194 | Mus pahari 10093 | CAG|GTGAGGATTG...GCTCACTTACTC/CATTGGCTCACT...CCCAG|ATG | 1 | 1 | 89.892 |
| 122606119 | GT-AG | 0 | 1.000000099473604e-05 | 2806 | rna-XM_029539617.1 22607846 | 33 | 67769338 | 67772143 | Mus pahari 10093 | AAG|GTAATGCTGC...TTTGTCTTACAC/CTTTGTCTTACA...ATCAG|TCT | 0 | 1 | 91.495 |
| 122621578 | GT-AG | 0 | 1.000000099473604e-05 | 139064 | rna-XM_029539617.1 22607846 | 1 | 67067270 | 67206333 | Mus pahari 10093 | ACG|GTAAGGCGGC...GTCTTCTTGTTT/TCTGGGCTCATT...TTCAG|ATG | 0 | 3.059 | |
| 122621579 | GT-AG | 0 | 1.4875578528013934e-05 | 81014 | rna-XM_029539617.1 22607846 | 2 | 67206390 | 67287403 | Mus pahari 10093 | TAG|GTAAGTTATC...TCATGCTTAATC/TCATGCTTAATC...AACAG|AAC | 0 | 3.687 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);