introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
27 rows where transcript_id = 24436973
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position | 
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 133513111 | GT-AG | 0 | 0.0009778549923812 | 15070 | rna-XM_029780102.2 24436973 | 1 | 100803337 | 100818406 | Octopus sinensis 2607531 | CAG|GTAACTTATT...CTAAATTTGATT/CTAAATTTGATT...TTCAG|TTT | 1 | 1 | 1.293 | 
| 133513112 | GT-AG | 0 | 1.000000099473604e-05 | 8705 | rna-XM_029780102.2 24436973 | 2 | 100818505 | 100827209 | Octopus sinensis 2607531 | TTG|GTAAGTAGAG...ATATTCTTAGAT/ATCCATTTCACT...TTCAG|TTC | 0 | 1 | 3.88 | 
| 133513113 | GT-AG | 0 | 1.000000099473604e-05 | 2324 | rna-XM_029780102.2 24436973 | 3 | 100827356 | 100829679 | Octopus sinensis 2607531 | CAG|GTAAGTAAAT...ATTTCCATACAA/ATATTACTGATA...GGCAG|AAC | 2 | 1 | 7.733 | 
| 133513114 | GT-AG | 0 | 1.000000099473604e-05 | 4712 | rna-XM_029780102.2 24436973 | 4 | 100829757 | 100834468 | Octopus sinensis 2607531 | TTG|GTAAGGTTAT...CATGCATTGAAA/CATGCATTGAAA...TTTAG|GAC | 1 | 1 | 9.765 | 
| 133513115 | GT-AG | 0 | 0.0001178427342832 | 374 | rna-XM_029780102.2 24436973 | 5 | 100834641 | 100835014 | Octopus sinensis 2607531 | GGA|GTAAGTATTT...TATTTCTTCACA/ATATTTTTCAAT...CCTAG|TGC | 2 | 1 | 14.305 | 
| 133513116 | GT-AG | 0 | 4.971752285889158e-05 | 1335 | rna-XM_029780102.2 24436973 | 6 | 100835187 | 100836521 | Octopus sinensis 2607531 | AGA|GTAAGTATAA...ATTATTTTAACT/ATTATTTTAACT...ATTAG|GTT | 0 | 1 | 18.844 | 
| 133513117 | GT-AG | 0 | 0.0002131252462879 | 3786 | rna-XM_029780102.2 24436973 | 7 | 100836647 | 100840432 | Octopus sinensis 2607531 | GCG|GTAAGTTGTT...GAATTTTTAACA/ATATATTTAATT...TCCAG|ATA | 2 | 1 | 22.143 | 
| 133513118 | GT-AG | 0 | 1.000000099473604e-05 | 16910 | rna-XM_029780102.2 24436973 | 8 | 100840605 | 100857514 | Octopus sinensis 2607531 | ATT|GTAAGTAATA...TTTCTATTGATG/TTATGATTTACT...TTCAG|GCT | 0 | 1 | 26.683 | 
| 133513119 | GT-AG | 0 | 1.607093333558932e-05 | 3137 | rna-XM_029780102.2 24436973 | 9 | 100857629 | 100860765 | Octopus sinensis 2607531 | ACG|GTAAGCAGAA...TTGCTCTTAAAC/CTTAAACTAATC...TTCAG|GAA | 0 | 1 | 29.691 | 
| 133513120 | GT-AG | 0 | 0.0001677263531609 | 3503 | rna-XM_029780102.2 24436973 | 10 | 100860877 | 100864379 | Octopus sinensis 2607531 | CCC|GTAAGTATGC...ATATCTTTATCT/CTTTATCTTACT...TTTAG|GTT | 0 | 1 | 32.621 | 
| 133513121 | GT-AG | 0 | 1.000000099473604e-05 | 5510 | rna-XM_029780102.2 24436973 | 11 | 100864506 | 100870015 | Octopus sinensis 2607531 | GAG|GTAAGTTAAC...AACCGTTTAATA/CGTTTAATAATA...TTCAG|GTC | 0 | 1 | 35.946 | 
| 133513122 | GT-AG | 0 | 0.0002903070757868 | 2235 | rna-XM_029780102.2 24436973 | 12 | 100870220 | 100872454 | Octopus sinensis 2607531 | GGG|GTAAACAATT...AATTTTTTAATT/AATTTTTTAATT...TCTAG|AAA | 0 | 1 | 41.33 | 
| 133513123 | GT-AG | 0 | 1.547375462946422e-05 | 149 | rna-XM_029780102.2 24436973 | 13 | 100872626 | 100872774 | Octopus sinensis 2607531 | AAG|GTACGTGTTG...TTTCTCTAAATA/CTTTCTCTAAAT...TACAG|GTC | 0 | 1 | 45.843 | 
| 133513124 | GT-AG | 0 | 1.000000099473604e-05 | 13292 | rna-XM_029780102.2 24436973 | 14 | 100872937 | 100886228 | Octopus sinensis 2607531 | CAG|GTAAATTATT...AATAATTTATTC/GAATAATTTATT...TTCAG|AGA | 0 | 1 | 50.119 | 
| 133513125 | GT-AG | 0 | 1.000000099473604e-05 | 10443 | rna-XM_029780102.2 24436973 | 15 | 100886269 | 100896711 | Octopus sinensis 2607531 | ACA|GTAAGTAATA...ACAAATTTAATT/ACAAATTTAATT...TACAG|CAA | 1 | 1 | 51.174 | 
| 133513126 | GT-AG | 0 | 1.000000099473604e-05 | 3189 | rna-XM_029780102.2 24436973 | 16 | 100896813 | 100900001 | Octopus sinensis 2607531 | GAG|GTAGGGCATA...TCATCTTTGAAG/TGAAGTTTAACA...TCCAG|GAA | 0 | 1 | 53.84 | 
| 133513127 | GT-AG | 0 | 1.719848453009824e-05 | 7525 | rna-XM_029780102.2 24436973 | 17 | 100900149 | 100907673 | Octopus sinensis 2607531 | GGC|GTAAGTAAAA...AACTTTTTATCT/TTTATCTTTATA...TTCAG|GCT | 0 | 1 | 57.72 | 
| 133513128 | GT-AG | 0 | 3.2112593293802616e-05 | 2937 | rna-XM_029780102.2 24436973 | 18 | 100907764 | 100910700 | Octopus sinensis 2607531 | CAA|GTAAGTTATT...TTCTCCTAAAAA/TTGATAATTATT...TTTAG|TAT | 0 | 1 | 60.095 | 
| 133513129 | GT-AG | 0 | 1.000000099473604e-05 | 1704 | rna-XM_029780102.2 24436973 | 19 | 100910779 | 100912482 | Octopus sinensis 2607531 | CAG|GTGAGCAAAT...AACTCTGTATCT/TAAAAACTAACT...TAAAG|GAA | 0 | 1 | 62.154 | 
| 133513130 | GT-AG | 0 | 1.000000099473604e-05 | 1987 | rna-XM_029780102.2 24436973 | 20 | 100912567 | 100914553 | Octopus sinensis 2607531 | GGG|GTAAGTCATT...TCTATCTTATTT/TTCTATCTTATT...ATCAG|GCC | 0 | 1 | 64.371 | 
| 133513131 | GT-AG | 0 | 1.183497788837839e-05 | 8955 | rna-XM_029780102.2 24436973 | 21 | 100914758 | 100923712 | Octopus sinensis 2607531 | AAT|GTAAGTAATA...TTACTATTAATT/TTACTATTAATT...TCCAG|CTA | 0 | 1 | 69.755 | 
| 133513132 | GT-AG | 0 | 1.000000099473604e-05 | 3595 | rna-XM_029780102.2 24436973 | 22 | 100923814 | 100927408 | Octopus sinensis 2607531 | TAA|GTGAGTAATT...ATGTTGTTGAAA/ATGTTGTTGAAA...TTCAG|GAA | 2 | 1 | 72.42 | 
| 133513133 | GT-AG | 0 | 0.000200227895636 | 1612 | rna-XM_029780102.2 24436973 | 23 | 100927550 | 100929161 | Octopus sinensis 2607531 | GAA|GTAAGCAAGA...AGTTTCTTGATT/AGTTTCTTGATT...TTCAG|AGT | 2 | 1 | 76.141 | 
| 133513134 | GT-AG | 0 | 1.000000099473604e-05 | 2921 | rna-XM_029780102.2 24436973 | 24 | 100929319 | 100932239 | Octopus sinensis 2607531 | TGC|GTGAGTGTTA...TGTTTTCTATTT/CTGTATGTGATT...TAAAG|GAA | 0 | 1 | 80.285 | 
| 133513135 | GT-AG | 0 | 0.0010250625592032 | 2640 | rna-XM_029780102.2 24436973 | 25 | 100932438 | 100935077 | Octopus sinensis 2607531 | GTG|GTATGTATAT...ATCGTTTTACTA/AATCGTTTTACT...TTCAG|TTT | 0 | 1 | 85.511 | 
| 133513136 | GT-AG | 0 | 1.000000099473604e-05 | 10198 | rna-XM_029780102.2 24436973 | 26 | 100935285 | 100945482 | Octopus sinensis 2607531 | GAG|GTAAGTGTAT...TTTTTTTTATAT/TTTTTTTTCACT...TTTAG|GGT | 0 | 1 | 90.974 | 
| 133513137 | GT-AG | 0 | 0.0003009043737823 | 3581 | rna-XM_029780102.2 24436973 | 27 | 100945630 | 100949210 | Octopus sinensis 2607531 | AAG|GTATTAATCT...ATTGTTTTAACT/TTTTAACTCATT...TGTAG|GTT | 0 | 1 | 94.854 | 
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
  "dinucleotide_pair" TEXT,
  "is_minor" INTEGER,
  "score" REAL,
  "length" INTEGER,
  "transcript_id" INTEGER,
  "ordinal_index" INTEGER,
  "start" INTEGER,
  "end" INTEGER,
  "taxonomy_id" INTEGER,
  "scored_motifs" TEXT,
  "phase" INTEGER,
  "in_cds" INTEGER,
  "relative_position" REAL
  ,PRIMARY KEY ([id]),
   FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
   FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
    ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
    ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
    ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
    ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
    ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
    ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
    ON [introns] ([in_cds]);