introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
21 rows where transcript_id = 19079944
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position | 
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 101757079 | GT-AG | 0 | 1.000000099473604e-05 | 35217 | rna-XM_042872170.1 19079944 | 1 | 11290918 | 11326134 | Lagopus leucura 30410 | AAG|GTGCGCGGCG...TTTTTCTTCTTT/AGATTACTGACT...TTTAG|ACA | 0 | 1 | 3.93 | 
| 101757080 | GT-AG | 0 | 1.000000099473604e-05 | 4458 | rna-XM_042872170.1 19079944 | 2 | 11326250 | 11330707 | Lagopus leucura 30410 | CAG|GTGAGAGAGG...TTTCTCTTTTCT/TATTTTCTAATA...CTTAG|GTG | 1 | 1 | 8.115 | 
| 101757081 | GT-AG | 0 | 1.000000099473604e-05 | 6886 | rna-XM_042872170.1 19079944 | 3 | 11330828 | 11337713 | Lagopus leucura 30410 | AAG|GTAAAGAAAC...TATTTCTTAATT/TATTTCTTAATT...CCTAG|AAA | 1 | 1 | 12.482 | 
| 101757082 | GT-AG | 0 | 1.000000099473604e-05 | 640 | rna-XM_042872170.1 19079944 | 4 | 11337801 | 11338440 | Lagopus leucura 30410 | AAG|GTAAATCTCA...CATTTTTTCTTT/AATGGATTGATT...ATTAG|AGA | 1 | 1 | 15.648 | 
| 101757083 | GT-AG | 0 | 1.000000099473604e-05 | 717 | rna-XM_042872170.1 19079944 | 5 | 11338529 | 11339245 | Lagopus leucura 30410 | CAG|GTGAGAACAG...TGAGCCTTCTCA/AGCCTTCTCATT...CACAG|TTG | 2 | 1 | 18.85 | 
| 101757084 | GT-AG | 0 | 1.000000099473604e-05 | 5136 | rna-XM_042872170.1 19079944 | 6 | 11339325 | 11344460 | Lagopus leucura 30410 | AAG|GTACAGCTCA...TCTGTTTTTATT/TCTGTTTTTATT...CTCAG|GAT | 0 | 1 | 21.725 | 
| 101757085 | GT-AG | 0 | 1.000000099473604e-05 | 1615 | rna-XM_042872170.1 19079944 | 7 | 11344543 | 11346157 | Lagopus leucura 30410 | AAG|GTAAAGTACA...ATGCTCTTAACG/CATCAGTTGATT...GACAG|ACA | 1 | 1 | 24.709 | 
| 101757086 | GT-AG | 0 | 0.0035595765132257 | 6500 | rna-XM_042872170.1 19079944 | 8 | 11346244 | 11352743 | Lagopus leucura 30410 | AAG|GTATCGCAGG...TTTTTCTTCTCT/AACTTTTTCACA...GGCAG|GCG | 0 | 1 | 27.838 | 
| 101757087 | GT-AG | 0 | 0.0001466901658311 | 1075 | rna-XM_042872170.1 19079944 | 9 | 11352837 | 11353911 | Lagopus leucura 30410 | GAT|GTAGGTGTCT...GTAGTTTTAACA/CTTCTTGTCATT...TTCAG|CTG | 0 | 1 | 31.223 | 
| 101757088 | GT-AG | 0 | 1.000000099473604e-05 | 530 | rna-XM_042872170.1 19079944 | 10 | 11354143 | 11354672 | Lagopus leucura 30410 | TCT|GTGAGTGGGT...TCTACCCTAACT/TAACTTCTCACC...TCTAG|GAT | 0 | 1 | 39.629 | 
| 101757089 | GT-AG | 0 | 1.000000099473604e-05 | 554 | rna-XM_042872170.1 19079944 | 11 | 11354821 | 11355374 | Lagopus leucura 30410 | ATG|GTAAATGCCA...CAGTCCTTCATT/CAGTCCTTCATT...CCCAG|GTA | 1 | 1 | 45.015 | 
| 101757090 | GT-AG | 0 | 1.2609557733691685e-05 | 2334 | rna-XM_042872170.1 19079944 | 12 | 11355526 | 11357859 | Lagopus leucura 30410 | GAA|GTACGGCACC...AGGCCTTTATTT/TGTTCTTTTACT...CATAG|TGA | 2 | 1 | 50.509 | 
| 101757091 | GT-AG | 0 | 1.000000099473604e-05 | 467 | rna-XM_042872170.1 19079944 | 13 | 11357969 | 11358435 | Lagopus leucura 30410 | GAG|GTGAGTCATA...TGATTTTTATTC/GTGATTTTTATT...CTTAG|AGG | 0 | 1 | 54.476 | 
| 101757092 | GT-AG | 0 | 1.000000099473604e-05 | 2254 | rna-XM_042872170.1 19079944 | 14 | 11358577 | 11360830 | Lagopus leucura 30410 | CAG|GTACAGAGCA...CTCTTTTTGCTG/CAGTGACTAATG...CACAG|GGA | 0 | 1 | 59.607 | 
| 101757093 | GT-AG | 0 | 1.000000099473604e-05 | 861 | rna-XM_042872170.1 19079944 | 15 | 11361014 | 11361874 | Lagopus leucura 30410 | CAT|GTGAGTTCTT...GGTTTCATATAT/TTTGGTTTCATA...TCAAG|GTG | 0 | 1 | 66.266 | 
| 101757094 | GT-AG | 0 | 1.851360047183088e-05 | 875 | rna-XM_042872170.1 19079944 | 16 | 11362010 | 11362884 | Lagopus leucura 30410 | GAG|GTATGGCTTT...TTCCTTCTAACA/TTCCTTCTAACA...TGCAG|GAG | 0 | 1 | 71.179 | 
| 101757095 | GT-AG | 0 | 1.176867949483083e-05 | 939 | rna-XM_042872170.1 19079944 | 17 | 11363065 | 11364003 | Lagopus leucura 30410 | GAG|GTAAATGTTT...ATTGTCTTATTC/AATTGTCTTATT...TTCAG|CAT | 0 | 1 | 77.729 | 
| 101757096 | GT-AG | 0 | 1.000000099473604e-05 | 1950 | rna-XM_042872170.1 19079944 | 18 | 11364151 | 11366100 | Lagopus leucura 30410 | AGG|GTAAGTCAGT...CATGTCCTGCTG/CGTCACCCCACT...ACCAG|GAC | 0 | 1 | 83.079 | 
| 101757097 | GT-AG | 0 | 1.000000099473604e-05 | 539 | rna-XM_042872170.1 19079944 | 19 | 11366182 | 11366720 | Lagopus leucura 30410 | CAG|GTACTGAATG...TCTTCCTTTTCT/CTTTGCATCACT...TGTAG|AAG | 0 | 1 | 86.026 | 
| 101757098 | GT-AG | 0 | 1.000000099473604e-05 | 1466 | rna-XM_042872170.1 19079944 | 20 | 11366787 | 11368252 | Lagopus leucura 30410 | ATG|GTAATGTAGA...ACTCTCTTGTTC/TCCAGGCTTACA...TCCAG|GGA | 0 | 1 | 88.428 | 
| 101757099 | GT-AG | 0 | 1.000000099473604e-05 | 1883 | rna-XM_042872170.1 19079944 | 21 | 11368412 | 11370294 | Lagopus leucura 30410 | AAG|GTGAGGCTCT...TCTGCCTTATCC/GTCTGCCTTATC...TGCAG|AAC | 0 | 1 | 94.214 | 
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
  "dinucleotide_pair" TEXT,
  "is_minor" INTEGER,
  "score" REAL,
  "length" INTEGER,
  "transcript_id" INTEGER,
  "ordinal_index" INTEGER,
  "start" INTEGER,
  "end" INTEGER,
  "taxonomy_id" INTEGER,
  "scored_motifs" TEXT,
  "phase" INTEGER,
  "in_cds" INTEGER,
  "relative_position" REAL
  ,PRIMARY KEY ([id]),
   FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
   FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
    ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
    ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
    ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
    ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
    ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
    ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
    ON [introns] ([in_cds]);