introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
23 rows where transcript_id = 31460330
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position | 
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 175006401 | GT-AG | 0 | 1.000000099473604e-05 | 11799 | rna-XM_016138251.2 31460330 | 1 | 80159030 | 80170828 | Rousettus aegyptiacus 9407 | CGG|GTGGGTACCA...AAAACTTTACAT/GAAAACTTTACA...ATCAG|TCA | 2 | 1 | 2.314 | 
| 175006402 | GT-AG | 0 | 0.0003963621348844 | 1922 | rna-XM_016138251.2 31460330 | 2 | 80157005 | 80158926 | Rousettus aegyptiacus 9407 | CCG|GTAAGCTTTC...GTTTCTTTCATA/GAAGTTTTCATT...TTTAG|GAA | 0 | 1 | 3.506 | 
| 175006403 | GT-AG | 0 | 7.636269454890393e-05 | 706 | rna-XM_016138251.2 31460330 | 3 | 80156150 | 80156855 | Rousettus aegyptiacus 9407 | AAG|GTATTGCCCA...TTTTCCTTCCTT/AGTGGTATAATT...TTCAG|ACT | 2 | 1 | 5.23 | 
| 175006404 | GT-AG | 0 | 1.000000099473604e-05 | 2978 | rna-XM_016138251.2 31460330 | 4 | 80152997 | 80155974 | Rousettus aegyptiacus 9407 | AGG|GTAAGTAAGC...CTCTCTTTCCCT/CAGATGCACAGT...TGCAG|GGG | 0 | 1 | 7.254 | 
| 175006405 | GT-AG | 0 | 0.0001968412144093 | 968 | rna-XM_016138251.2 31460330 | 5 | 80151900 | 80152867 | Rousettus aegyptiacus 9407 | CTG|GTACTTTGAA...AAATCATTAGCG/TACAAACTCACA...TACAG|CGT | 0 | 1 | 8.747 | 
| 175006406 | GT-AG | 0 | 0.000101992722109 | 905 | rna-XM_016138251.2 31460330 | 6 | 80150944 | 80151848 | Rousettus aegyptiacus 9407 | CTG|GTAAGCTGAC...CTGCTTTTGAAT/TGTTATCTAATG...CTCAG|AAA | 0 | 1 | 9.337 | 
| 175006407 | GT-AG | 0 | 0.00054190037472 | 1123 | rna-XM_016138251.2 31460330 | 7 | 80149659 | 80150781 | Rousettus aegyptiacus 9407 | TCC|GTAAGCCACC...TTTTCTTTCATT/TTTTCTTTCATT...TGCAG|ATA | 0 | 1 | 11.211 | 
| 175006408 | GT-AG | 0 | 3.5579101067485394e-05 | 847 | rna-XM_016138251.2 31460330 | 8 | 80148707 | 80149553 | Rousettus aegyptiacus 9407 | GAG|GTAAGCTTTA...GATGTTTTCTTG/ATGAATTTCACC...TTCAG|GCA | 0 | 1 | 12.426 | 
| 175006409 | GT-AG | 0 | 9.76705196986091e-05 | 332 | rna-XM_016138251.2 31460330 | 9 | 80148279 | 80148610 | Rousettus aegyptiacus 9407 | CAG|GTTTTTATAC...TTTTCATTGACA/TTGCTTTTCATT...AACAG|TTT | 0 | 1 | 13.537 | 
| 175006410 | GT-AG | 0 | 0.2619442531970026 | 546 | rna-XM_016138251.2 31460330 | 10 | 80147607 | 80148152 | Rousettus aegyptiacus 9407 | GAG|GTATCATCTC...ATGATTTTATTT/TATGATTTTATT...TGCAG|AAA | 0 | 1 | 14.995 | 
| 175006411 | GT-AG | 0 | 1.000000099473604e-05 | 1154 | rna-XM_016138251.2 31460330 | 11 | 80146300 | 80147453 | Rousettus aegyptiacus 9407 | CAG|GTGTGTACTC...GTACTCATGAAC/TGTGTACTCATG...TACAG|AAA | 0 | 1 | 16.765 | 
| 175006412 | GT-AG | 0 | 2.6634010352112653e-05 | 807 | rna-XM_016138251.2 31460330 | 12 | 80145338 | 80146144 | Rousettus aegyptiacus 9407 | CAA|GTAAGTTGTT...TTCCCCTTTGTG/TAAAGTTTGAAA...CCCAG|GAT | 2 | 1 | 18.558 | 
| 175006413 | GT-AG | 0 | 5.4944818297422086e-05 | 761 | rna-XM_016138251.2 31460330 | 13 | 80144450 | 80145210 | Rousettus aegyptiacus 9407 | AAG|GTATGTCCCC...GGAACCATAAAT/TTTTTGGTCATG...TCTAG|CTG | 0 | 1 | 20.028 | 
| 175006414 | GT-AG | 0 | 1.000000099473604e-05 | 269 | rna-XM_016138251.2 31460330 | 14 | 80143979 | 80144247 | Rousettus aegyptiacus 9407 | CAG|GTTGGTTTGG...GTGGTCCTAACA/AATCCTCTGATT...TACAG|TAA | 1 | 1 | 22.365 | 
| 175006415 | GT-AG | 0 | 1.000000099473604e-05 | 659 | rna-XM_016138251.2 31460330 | 15 | 80143093 | 80143751 | Rousettus aegyptiacus 9407 | AAG|GTAGGTATCC...CTTTCCTTTTTT/TTAAGAATCACA...GACAG|AGT | 0 | 1 | 24.991 | 
| 175006416 | GT-AG | 0 | 1.000000099473604e-05 | 429 | rna-XM_016138251.2 31460330 | 16 | 80142497 | 80142925 | Rousettus aegyptiacus 9407 | CAG|GTAATCACGC...ATGTTCATAGTT/CATAGTTTGATG...TACAG|CTT | 2 | 1 | 26.924 | 
| 175006417 | GT-AG | 0 | 1.000000099473604e-05 | 539 | rna-XM_016138251.2 31460330 | 17 | 80141819 | 80142357 | Rousettus aegyptiacus 9407 | AAG|GTAATGTGGA...TCTTTTTTGTCT/TATAGCCTAAAG...GACAG|AAA | 0 | 1 | 28.532 | 
| 175006418 | GT-AG | 0 | 0.0038766753091959 | 1056 | rna-XM_016138251.2 31460330 | 18 | 80140569 | 80141624 | Rousettus aegyptiacus 9407 | CAG|GTATGCCAAC...AATGCTTTAACA/ATTTTTATAATT...CCTAG|ATT | 2 | 1 | 30.776 | 
| 175006419 | GT-AG | 0 | 1.9716506984246792e-05 | 484 | rna-XM_016138251.2 31460330 | 19 | 80139922 | 80140405 | Rousettus aegyptiacus 9407 | AAG|GTAAGCCAGG...TATACTTTAACA/AAATATTTTATT...TATAG|AAC | 0 | 1 | 32.662 | 
| 175006420 | GT-AG | 0 | 0.0664341611447009 | 408 | rna-XM_016138251.2 31460330 | 20 | 80139430 | 80139837 | Rousettus aegyptiacus 9407 | AAG|GTATGCTGAT...TTTTTCTTAATC/TTAATCTTCATT...TTCAG|GAT | 0 | 1 | 33.634 | 
| 175006421 | GT-AG | 0 | 0.0004801895058335 | 584 | rna-XM_016138251.2 31460330 | 21 | 80138738 | 80139321 | Rousettus aegyptiacus 9407 | GAG|GTATATACAT...ACTTTTTTGAAA/ACTTTTTTGAAA...TCAAG|GCT | 0 | 1 | 34.884 | 
| 175006422 | GT-AG | 0 | 1.7853525037277865e-05 | 779 | rna-XM_016138251.2 31460330 | 22 | 80137860 | 80138638 | Rousettus aegyptiacus 9407 | AAG|GTAATTTATT...AATATTTTATTT/ATATTTCTGACT...CACAG|CTG | 0 | 1 | 36.029 | 
| 175006423 | GT-AG | 0 | 0.0016448727743993 | 959 | rna-XM_016138251.2 31460330 | 23 | 80134606 | 80135564 | Rousettus aegyptiacus 9407 | GAG|GTATTCACAA...ATTTCTTTCTTT/TTCTTTTTCAAT...TCTAG|GCA | 0 | 1 | 62.582 | 
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
  "dinucleotide_pair" TEXT,
  "is_minor" INTEGER,
  "score" REAL,
  "length" INTEGER,
  "transcript_id" INTEGER,
  "ordinal_index" INTEGER,
  "start" INTEGER,
  "end" INTEGER,
  "taxonomy_id" INTEGER,
  "scored_motifs" TEXT,
  "phase" INTEGER,
  "in_cds" INTEGER,
  "relative_position" REAL
  ,PRIMARY KEY ([id]),
   FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
   FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
    ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
    ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
    ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
    ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
    ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
    ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
    ON [introns] ([in_cds]);