introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
17 rows where transcript_id = 15198769
This data as json, CSV (advanced)
Suggested facets: score, length, scored_motifs, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position | 
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 82238965 | GT-AG | 0 | 0.0143207724607759 | 2558 | rna-XM_031062952.1 15198769 | 1 | 17971907 | 17974464 | Geospiza fortis 48883 | AAT|GTATAATTCC...CTGTCTCTAATA/AATTTTGTTATT...GTCAG|AAT | 1 | 1 | 1.04 | 
| 82238966 | GT-AG | 0 | 1.000000099473604e-05 | 658 | rna-XM_031062952.1 15198769 | 2 | 17971086 | 17971743 | Geospiza fortis 48883 | AAG|GTAGAACTTT...TGCATTTTAAAA/TGCATTTTAAAA...CACAG|GTT | 2 | 1 | 7.824 | 
| 82238967 | GT-AG | 0 | 1.000000099473604e-05 | 1527 | rna-XM_031062952.1 15198769 | 3 | 17969455 | 17970981 | Geospiza fortis 48883 | GTG|GTAAGATATG...TCTCTCTTTCTT/TACATTTTCAGA...CACAG|CTT | 1 | 1 | 12.151 | 
| 82238968 | GT-AG | 0 | 1.000000099473604e-05 | 573 | rna-XM_031062952.1 15198769 | 4 | 17968722 | 17969294 | Geospiza fortis 48883 | TTG|GTAAATACAG...GTTTTTTTACCA/TGTTTTTTTACC...TTCAG|TAA | 2 | 1 | 18.81 | 
| 82238969 | GT-AG | 0 | 0.0020773317649581 | 2122 | rna-XM_031062952.1 15198769 | 5 | 17966490 | 17968611 | Geospiza fortis 48883 | TAG|GTAACTATTT...CTGTCTCTAATA/AATTTTGTTATT...GTCAG|AAT | 1 | 1 | 23.387 | 
| 82238970 | GT-AG | 0 | 1.000000099473604e-05 | 658 | rna-XM_031062952.1 15198769 | 6 | 17965669 | 17966326 | Geospiza fortis 48883 | AAG|GTAGAACTTT...TGCATTTTAAAA/TGCATTTTAAAA...CACAG|GTT | 2 | 1 | 30.171 | 
| 82238971 | GT-AG | 0 | 1.000000099473604e-05 | 1527 | rna-XM_031062952.1 15198769 | 7 | 17964038 | 17965564 | Geospiza fortis 48883 | GTG|GTAAGATATG...TCTCTCTTTCTT/TACATTTTCAGA...CACAG|CTT | 1 | 1 | 34.499 | 
| 82238972 | GT-AG | 0 | 1.000000099473604e-05 | 573 | rna-XM_031062952.1 15198769 | 8 | 17963305 | 17963877 | Geospiza fortis 48883 | TTG|GTAAATACAG...GTTTTTTTACCA/TGTTTTTTTACC...TTCAG|TAA | 2 | 1 | 41.157 | 
| 82238973 | GT-AG | 0 | 0.0074922641599759 | 2956 | rna-XM_031062952.1 15198769 | 9 | 17960239 | 17963194 | Geospiza fortis 48883 | TAG|GTAACTATTT...AGTTTTTTGAAT/AGTTTTTTGAAT...TACAG|ATT | 1 | 1 | 45.734 | 
| 82238974 | GT-AG | 0 | 1.000000099473604e-05 | 796 | rna-XM_031062952.1 15198769 | 10 | 17959283 | 17960078 | Geospiza fortis 48883 | AAG|GTGAATATGA...TATTTTTGGAAT/AAAATATTTAAT...TTTAG|AAA | 2 | 1 | 52.393 | 
| 82238975 | GT-AG | 0 | 1.000000099473604e-05 | 87 | rna-XM_031062952.1 15198769 | 11 | 17959086 | 17959172 | Geospiza fortis 48883 | CTG|GTAAAATACA...GATGCTTTGAAT/AACTTTCTGATG...TACAG|CCT | 1 | 1 | 56.97 | 
| 82238976 | GT-AG | 0 | 1.000000099473604e-05 | 129 | rna-XM_031062952.1 15198769 | 12 | 17958797 | 17958925 | Geospiza fortis 48883 | AGG|GTGAGATATA...TTTGCATTAATT/CATTAATTCATT...GGCAG|AAA | 2 | 1 | 63.629 | 
| 82238977 | GT-AG | 0 | 1.000000099473604e-05 | 771 | rna-XM_031062952.1 15198769 | 13 | 17957910 | 17958680 | Geospiza fortis 48883 | CTG|GTAAGTAAAA...TCAGCTTTAAAT/CTTTAAATTACT...AATAG|TGT | 1 | 1 | 68.456 | 
| 82238978 | GT-AG | 0 | 1.000000099473604e-05 | 376 | rna-XM_031062952.1 15198769 | 14 | 17957359 | 17957734 | Geospiza fortis 48883 | CAG|GTGAGCCCTA...TTATTCTTTTCC/TATTTAGTCACA...CTCAG|TCT | 2 | 1 | 75.739 | 
| 82238979 | GT-AG | 0 | 0.0001887247951206 | 306 | rna-XM_031062952.1 15198769 | 15 | 17956877 | 17957182 | Geospiza fortis 48883 | CTG|GTATGTAGCA...TTTCTCTTGTAC/GACAAACTAATC...TTTAG|ATC | 1 | 1 | 83.063 | 
| 82238980 | GT-AG | 0 | 1.000000099473604e-05 | 1271 | rna-XM_031062952.1 15198769 | 16 | 17955510 | 17956780 | Geospiza fortis 48883 | AAG|GTACAGAGAA...TAACCTTTATTT/ACATTTCTCATC...TATAG|GTC | 1 | 1 | 87.058 | 
| 82238981 | GT-AG | 0 | 1.4517346767268598e-05 | 1357 | rna-XM_031062952.1 15198769 | 17 | 17954013 | 17955369 | Geospiza fortis 48883 | AAG|GTAACGACAA...TCTTCCTTGCAT/CTTCCTTGCATG...TACAG|GGA | 0 | 1 | 92.884 | 
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
  "dinucleotide_pair" TEXT,
  "is_minor" INTEGER,
  "score" REAL,
  "length" INTEGER,
  "transcript_id" INTEGER,
  "ordinal_index" INTEGER,
  "start" INTEGER,
  "end" INTEGER,
  "taxonomy_id" INTEGER,
  "scored_motifs" TEXT,
  "phase" INTEGER,
  "in_cds" INTEGER,
  "relative_position" REAL
  ,PRIMARY KEY ([id]),
   FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
   FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
    ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
    ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
    ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
    ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
    ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
    ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
    ON [introns] ([in_cds]);