introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
24 rows where transcript_id = 19079851
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position | 
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 101754903 | GT-AG | 0 | 1.000000099473604e-05 | 4997 | rna-XM_042865544.1 19079851 | 2 | 8071485 | 8076481 | Lagopus leucura 30410 | GAG|GTGGGTTAGC...TTTTCCTTCCTG/GGAGATTTTATG...CCCAG|TCC | 2 | 1 | 11.717 | 
| 101754904 | GT-AG | 0 | 1.000000099473604e-05 | 305 | rna-XM_042865544.1 19079851 | 3 | 8077416 | 8077720 | Lagopus leucura 30410 | AAG|GTAAAACCAA...GGCTCTTTATGT/ATATTATTCAAG...CGTAG|GAA | 0 | 1 | 21.266 | 
| 101754905 | GT-AG | 0 | 0.0001138711295947 | 137 | rna-XM_042865544.1 19079851 | 4 | 8077809 | 8077945 | Lagopus leucura 30410 | CAG|GTAATTTTGG...TACTTTTTATAT/ATACTTTTTATA...AGAAG|ATT | 1 | 1 | 22.165 | 
| 101754906 | GT-AG | 0 | 1.000000099473604e-05 | 4489 | rna-XM_042865544.1 19079851 | 5 | 8078042 | 8082530 | Lagopus leucura 30410 | AAG|GTAAGTGTGC...TTCTCATTAACA/TTTTTTCTCATT...TGCAG|TGG | 1 | 1 | 23.147 | 
| 101754907 | GT-AG | 0 | 1.000000099473604e-05 | 757 | rna-XM_042865544.1 19079851 | 6 | 8082660 | 8083416 | Lagopus leucura 30410 | CAG|GTGAGTAGAA...TCAGTCTTGATT/TCTTGATTTATA...TCCAG|ATA | 1 | 1 | 24.466 | 
| 101754908 | GT-AG | 0 | 1.000000099473604e-05 | 645 | rna-XM_042865544.1 19079851 | 7 | 8083516 | 8084160 | Lagopus leucura 30410 | AAG|GTAGAAAAAG...ATGCTTGTAATT/ATGCTTGTAATT...TTTAG|GTG | 1 | 1 | 25.478 | 
| 101754909 | GT-AG | 0 | 1.000000099473604e-05 | 650 | rna-XM_042865544.1 19079851 | 8 | 8084811 | 8085460 | Lagopus leucura 30410 | CAG|GTAGGTAGCA...GTTTACTTACTA/TGTTTACTTACT...TTCAG|GCT | 0 | 1 | 32.124 | 
| 101754910 | GT-AG | 0 | 1.000000099473604e-05 | 969 | rna-XM_042865544.1 19079851 | 9 | 8085819 | 8086787 | Lagopus leucura 30410 | ACA|GTGAGTATTG...GGGTTTTTAAAA/AAATATATCACT...CCCAG|CTA | 1 | 1 | 35.784 | 
| 101754911 | GT-AG | 0 | 1.000000099473604e-05 | 973 | rna-XM_042865544.1 19079851 | 10 | 8087012 | 8087984 | Lagopus leucura 30410 | GTG|GTAAGAAGTG...GTATTTTTTTTG/GTTTTATAGATC...TGTAG|ATT | 0 | 1 | 38.074 | 
| 101754912 | GT-AG | 0 | 1.000000099473604e-05 | 401 | rna-XM_042865544.1 19079851 | 11 | 8088126 | 8088526 | Lagopus leucura 30410 | AAG|GTAAGAATTT...CTTTTTTTCTTC/ATTTTGTACACT...CACAG|ATG | 0 | 1 | 39.515 | 
| 101754913 | GT-AG | 0 | 1.000000099473604e-05 | 1349 | rna-XM_042865544.1 19079851 | 12 | 8088660 | 8090008 | Lagopus leucura 30410 | GAG|GTAAGAGCTG...CCTATTTCAATT/ACCTATTTCAAT...CTCAG|AGA | 1 | 1 | 40.875 | 
| 101754914 | GT-AG | 0 | 1.000000099473604e-05 | 1444 | rna-XM_042865544.1 19079851 | 13 | 8090530 | 8091973 | Lagopus leucura 30410 | CAG|GTGGGTCCCT...TTTCCCTTTTCC/ATTCTTTTCACA...TTCAG|GTT | 0 | 1 | 46.202 | 
| 101754915 | GT-AG | 0 | 1.5345804828134035e-05 | 1694 | rna-XM_042865544.1 19079851 | 14 | 8092260 | 8093953 | Lagopus leucura 30410 | TAG|GTAGGTTGGA...AAAATTTTGACC/CAAGTTTTTATT...TGCAG|CTG | 1 | 1 | 49.126 | 
| 101754916 | GT-AG | 0 | 1.000000099473604e-05 | 520 | rna-XM_042865544.1 19079851 | 15 | 8094584 | 8095103 | Lagopus leucura 30410 | GAG|GTAAGACTTG...GTTTTCTTAACC/GTTTTCTTAACC...TTTAG|AAG | 1 | 1 | 55.567 | 
| 101754917 | GT-AG | 0 | 1.000000099473604e-05 | 922 | rna-XM_042865544.1 19079851 | 16 | 8095425 | 8096346 | Lagopus leucura 30410 | CAG|GTACAAAGTC...ATTTTCTTTCCT/TATAAGCTGATA...CACAG|GAT | 1 | 1 | 58.849 | 
| 101754918 | GT-TG | 0 | 2.3039397035894637e-05 | 7889 | rna-XM_042865544.1 19079851 | 17 | 8098035 | 8105923 | Lagopus leucura 30410 | GTG|GTAAGTATTT...TCATTTTTAAAA/TAGATACTCATT...AACTG|TTC | 0 | 1 | 76.107 | 
| 101754919 | GT-AG | 0 | 0.0001103422052789 | 1326 | rna-XM_042865544.1 19079851 | 18 | 8106055 | 8107380 | Lagopus leucura 30410 | GAA|GTAAGCATCC...AACTTTCTACCT/CATGAGATGAGT...TCCAG|ATA | 2 | 1 | 77.446 | 
| 101754920 | GT-AG | 0 | 0.0002111913157154 | 1584 | rna-XM_042865544.1 19079851 | 19 | 8107433 | 8109016 | Lagopus leucura 30410 | AAG|GTATATGCTC...ATTTCCTTTTTT/CTTTTTTTCATT...TCCAG|CAG | 0 | 1 | 77.978 | 
| 101754921 | GT-AG | 0 | 1.000000099473604e-05 | 1125 | rna-XM_042865544.1 19079851 | 20 | 8109203 | 8110327 | Lagopus leucura 30410 | CAA|GTGAGTATTT...GTGTTCTTCACT/GTGTTCTTCACT...TCTAG|GCC | 0 | 1 | 79.879 | 
| 101754922 | GT-AG | 0 | 1.000000099473604e-05 | 1102 | rna-XM_042865544.1 19079851 | 21 | 8110440 | 8111541 | Lagopus leucura 30410 | CAG|GTGAGGATGT...GAGTCTTTGGAA/CATAGATTAACT...TATAG|GGA | 1 | 1 | 81.024 | 
| 101754923 | GT-AG | 0 | 1.000000099473604e-05 | 324 | rna-XM_042865544.1 19079851 | 22 | 8111776 | 8112099 | Lagopus leucura 30410 | CAG|GTGGGTTAAA...TTGCCATTACAG/AAATAGTTGATC...TACAG|AGA | 1 | 1 | 83.417 | 
| 101754924 | GT-AG | 0 | 1.000000099473604e-05 | 437 | rna-XM_042865544.1 19079851 | 23 | 8112277 | 8112713 | Lagopus leucura 30410 | CAG|GTACGGTCAT...TTCCCCTTTTTT/TTTCCACTGAAT...GTAAG|GTA | 1 | 1 | 85.226 | 
| 101754925 | GT-AG | 0 | 0.0001924690182361 | 684 | rna-XM_042865544.1 19079851 | 24 | 8112867 | 8113550 | Lagopus leucura 30410 | CAG|GTAGGCTCTT...TTGCCGTTAGTG/CTTGCCGTTAGT...TGCAG|AAA | 1 | 1 | 86.791 | 
| 101760709 | GT-AG | 0 | 0.0042532179663065 | 3029 | rna-XM_042865544.1 19079851 | 1 | 8067317 | 8070345 | Lagopus leucura 30410 | ACG|GTAACTACCG...AGAACTTTAACT/CTTTAACTGATT...TCCAG|AAT | 0 | 0.869 | 
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
  "dinucleotide_pair" TEXT,
  "is_minor" INTEGER,
  "score" REAL,
  "length" INTEGER,
  "transcript_id" INTEGER,
  "ordinal_index" INTEGER,
  "start" INTEGER,
  "end" INTEGER,
  "taxonomy_id" INTEGER,
  "scored_motifs" TEXT,
  "phase" INTEGER,
  "in_cds" INTEGER,
  "relative_position" REAL
  ,PRIMARY KEY ([id]),
   FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
   FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
    ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
    ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
    ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
    ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
    ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
    ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
    ON [introns] ([in_cds]);