introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
27 rows where transcript_id = 24436975
This data as json, CSV (advanced)
Suggested facets: score, length, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position | 
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 133513156 | GT-AG | 0 | 0.000162001600065 | 281 | rna-XM_029784578.2 24436975 | 1 | 100319020 | 100319300 | Octopus sinensis 2607531 | TAG|GTTTGTTATT...TTTATTTTATTT/TTTTATTTTATT...TTCAG|TTG | 2 | 1 | 2.213 | 
| 133513157 | GT-AG | 0 | 1.000000099473604e-05 | 2367 | rna-XM_029784578.2 24436975 | 2 | 100316589 | 100318955 | Octopus sinensis 2607531 | AAG|GTAGGTTGTG...AGCCACTTGAAA/AATTCTGTTATA...TTCAG|GGT | 0 | 1 | 3.92 | 
| 133513158 | GT-AG | 0 | 1.000000099473604e-05 | 1154 | rna-XM_029784578.2 24436975 | 3 | 100315293 | 100316446 | Octopus sinensis 2607531 | AAG|GTAAGTCATA...TATATTTTACTC/ATTTTACTCAAT...TTCAG|CTC | 1 | 1 | 7.707 | 
| 133513159 | GT-AG | 0 | 1.000000099473604e-05 | 13797 | rna-XM_029784578.2 24436975 | 4 | 100301446 | 100315242 | Octopus sinensis 2607531 | AAG|GTAATTTAAT...TATTTTTTTGAT/AAAGTATTTATG...TTTAG|GAT | 0 | 1 | 9.04 | 
| 133513160 | GT-AG | 0 | 1.000000099473604e-05 | 83280 | rna-XM_029784578.2 24436975 | 5 | 100218107 | 100301386 | Octopus sinensis 2607531 | TTG|GTGAGTAATT...AAATTCTTACAT/TTTTTGTTTACT...TACAG|GGC | 2 | 1 | 10.613 | 
| 133513161 | GT-AG | 0 | 1.000000099473604e-05 | 8152 | rna-XM_029784578.2 24436975 | 6 | 100209834 | 100217985 | Octopus sinensis 2607531 | AAG|GTAATTATTG...ATGTTATTAAAT/ATGTTATTAAAT...GACAG|CAT | 0 | 1 | 13.84 | 
| 133513162 | GT-AG | 0 | 1.727330931339274e-05 | 1645 | rna-XM_029784578.2 24436975 | 7 | 100208134 | 100209778 | Octopus sinensis 2607531 | AAA|GTAAGTTAAT...GCTGCTTTATTA/ATTATATTTACT...TCCAG|TAC | 1 | 1 | 15.307 | 
| 133513163 | GT-AG | 0 | 1.000000099473604e-05 | 3734 | rna-XM_029784578.2 24436975 | 8 | 100204365 | 100208098 | Octopus sinensis 2607531 | CAG|GTTAGTTTTA...TAATTTTTAAAT/TAAATTTTTATT...TTTAG|GGC | 0 | 1 | 16.24 | 
| 133513164 | GT-AG | 0 | 1.3329720727662796e-05 | 1719 | rna-XM_029784578.2 24436975 | 9 | 100202552 | 100204270 | Octopus sinensis 2607531 | TAG|GTAAAATATC...TATTTCTTAACT/ATAATTTTCACT...TCTAG|GCT | 1 | 1 | 18.747 | 
| 133513165 | GT-AG | 0 | 1.000000099473604e-05 | 824 | rna-XM_029784578.2 24436975 | 10 | 100201601 | 100202424 | Octopus sinensis 2607531 | CAG|GTTGGTATTT...CCAACCTTATTT/CCACATTTCATT...TTCAG|GTG | 2 | 1 | 22.133 | 
| 133513166 | GT-AG | 0 | 1.000000099473604e-05 | 1645 | rna-XM_029784578.2 24436975 | 11 | 100199857 | 100201501 | Octopus sinensis 2607531 | AGG|GTAAGTAATT...TTTTCTTTCTCT/TTATTTGTTACT...TTTAG|AGT | 2 | 1 | 24.773 | 
| 133513167 | GT-AG | 0 | 0.0194190457137234 | 666 | rna-XM_029784578.2 24436975 | 12 | 100199018 | 100199683 | Octopus sinensis 2607531 | AAG|GTATTTTTGC...TATTTCTGAATT/ATCGTTTTCATC...TGCAG|CTT | 1 | 1 | 29.387 | 
| 133513168 | GT-AG | 0 | 1.000000099473604e-05 | 2453 | rna-XM_029784578.2 24436975 | 13 | 100196480 | 100198932 | Octopus sinensis 2607531 | CAG|GTAGGGGAAC...CCATTTTTAACT/CCATTTTTAACT...TCCAG|ACC | 2 | 1 | 31.653 | 
| 133513169 | GT-AG | 0 | 1.000000099473604e-05 | 3724 | rna-XM_029784578.2 24436975 | 14 | 100192659 | 100196382 | Octopus sinensis 2607531 | CAG|GTTGGTAATT...TGAATTTTATTG/TTTATTTTCATT...CACAG|CAA | 0 | 1 | 34.24 | 
| 133513170 | GT-AG | 0 | 3.770385453231293e-05 | 3856 | rna-XM_029784578.2 24436975 | 15 | 100188641 | 100192496 | Octopus sinensis 2607531 | AGG|GTAAATTATC...TTCATCTTCATC/TTCATCTTCATC...AACAG|ATC | 0 | 1 | 38.56 | 
| 133513171 | GT-AG | 0 | 1.000000099473604e-05 | 2394 | rna-XM_029784578.2 24436975 | 16 | 100185709 | 100188102 | Octopus sinensis 2607531 | TAG|GTAAGTAAAT...TTTGCTTTATTT/GTTTGCTTTATT...TATAG|GTA | 1 | 1 | 52.907 | 
| 133513172 | GT-AG | 0 | 1.000000099473604e-05 | 994 | rna-XM_029784578.2 24436975 | 17 | 100184602 | 100185595 | Octopus sinensis 2607531 | TAT|GTAAGGCTTC...TTAGCCTTATAT/TTTTGTTTCATT...TATAG|TTG | 0 | 1 | 55.92 | 
| 133513173 | GT-AG | 0 | 1.000000099473604e-05 | 1321 | rna-XM_029784578.2 24436975 | 18 | 100183181 | 100184501 | Octopus sinensis 2607531 | TAG|GTAAGTCAAA...TTTGCTGTAATA/TAAGGATTTATA...TTTAG|ATC | 1 | 1 | 58.587 | 
| 133513174 | GT-AG | 0 | 1.000000099473604e-05 | 444 | rna-XM_029784578.2 24436975 | 19 | 100182666 | 100183109 | Octopus sinensis 2607531 | CAA|GTAAGAGATT...TTTCTCTTATAT/TTTTCTCTTATA...CTTAG|CGT | 0 | 1 | 60.48 | 
| 133513175 | GT-AG | 0 | 1.000000099473604e-05 | 2973 | rna-XM_029784578.2 24436975 | 20 | 100179561 | 100182533 | Octopus sinensis 2607531 | AAG|GTAAGATTTA...TATCCCTTCTGT/AAAAAATTGATT...TCCAG|GAA | 0 | 1 | 64.0 | 
| 133513176 | GT-AG | 0 | 1.000000099473604e-05 | 907 | rna-XM_029784578.2 24436975 | 21 | 100178580 | 100179486 | Octopus sinensis 2607531 | ACC|GTAAGTGAAA...TTTTTCTTCTTT/TAAAATTTAAAA...AACAG|AAA | 2 | 1 | 65.973 | 
| 133513177 | GT-AG | 0 | 0.0003043385094258 | 1087 | rna-XM_029784578.2 24436975 | 22 | 100177390 | 100178476 | Octopus sinensis 2607531 | AGG|GTATGTAATT...ATTTTCTCAATT/AATTTTCTCAAT...TGTAG|GTT | 0 | 1 | 68.72 | 
| 133513178 | GT-AG | 0 | 0.0008837749283875 | 1822 | rna-XM_029784578.2 24436975 | 23 | 100175468 | 100177289 | Octopus sinensis 2607531 | CAG|GTAAGCTTCA...TTTTTCTTATTT/TTTTTTCTTATT...CTAAG|GTC | 1 | 1 | 71.387 | 
| 133513179 | GT-AG | 0 | 9.446336817624206e-05 | 258 | rna-XM_029784578.2 24436975 | 24 | 100175095 | 100175352 | Octopus sinensis 2607531 | AAA|GTAAGTTGGT...TACTCTTTATCA/ACTTTTTTTATA...TATAG|AAA | 2 | 1 | 74.453 | 
| 133513180 | GT-AG | 0 | 1.32745622907142e-05 | 7527 | rna-XM_029784578.2 24436975 | 25 | 100167423 | 100174949 | Octopus sinensis 2607531 | ATT|GTAAGTATCA...TTATCAATAATT/ATGGTTATCAAT...TCCAG|GGC | 0 | 1 | 78.32 | 
| 133513181 | GT-AG | 0 | 1.2989324504571408e-05 | 2495 | rna-XM_029784578.2 24436975 | 26 | 100164743 | 100167237 | Octopus sinensis 2607531 | CAA|GTAAGTCTTA...GTGCTTTTATAT/GTTATATTTATA...TATAG|GTC | 2 | 1 | 83.253 | 
| 133513182 | GT-AG | 0 | 2.4030479652076706e-05 | 1674 | rna-XM_029784578.2 24436975 | 27 | 100162942 | 100164615 | Octopus sinensis 2607531 | GAG|GTGTGTATTG...TTTTTTTTATTT/TTTTTTTTTATT...TGTAG|GAA | 0 | 1 | 86.64 | 
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
  "dinucleotide_pair" TEXT,
  "is_minor" INTEGER,
  "score" REAL,
  "length" INTEGER,
  "transcript_id" INTEGER,
  "ordinal_index" INTEGER,
  "start" INTEGER,
  "end" INTEGER,
  "taxonomy_id" INTEGER,
  "scored_motifs" TEXT,
  "phase" INTEGER,
  "in_cds" INTEGER,
  "relative_position" REAL
  ,PRIMARY KEY ([id]),
   FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
   FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
    ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
    ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
    ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
    ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
    ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
    ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
    ON [introns] ([in_cds]);