introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
14 rows where transcript_id = 1552350
This data as json, CSV (advanced)
Suggested facets: score, length, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
8649354 | GT-AG | 0 | 1.000000099473604e-05 | 242 | rna-gnl|WGS:MCFG|BCR32DRAFT_mRNA240269 1552350 | 1 | 42545 | 42786 | Anaeromyces robustus 1754192 | ATG|GTAATAATAT...ATTTATTTATTT/TATTTATTTATT...TATAG|ATT | 1 | 1 | 0.677 |
8649355 | GT-AG | 0 | 1.2895970800064977e-05 | 100 | rna-gnl|WGS:MCFG|BCR32DRAFT_mRNA240269 1552350 | 2 | 42139 | 42238 | Anaeromyces robustus 1754192 | AAG|GTAGTGTTTT...ATTTATTTAATA/TAATTATTTATT...TTTAG|ATA | 1 | 1 | 11.574 |
8649356 | GT-AG | 0 | 0.0016638631047453 | 136 | rna-gnl|WGS:MCFG|BCR32DRAFT_mRNA240269 1552350 | 3 | 41771 | 41906 | Anaeromyces robustus 1754192 | TAA|GTAATTTTTT...TATATTTTAAAT/TCAATATTTATT...TATAG|GGA | 2 | 1 | 19.836 |
8649357 | GT-AG | 0 | 1.000000099473604e-05 | 135 | rna-gnl|WGS:MCFG|BCR32DRAFT_mRNA240269 1552350 | 4 | 41537 | 41671 | Anaeromyces robustus 1754192 | AAG|GTTAAATATT...TAAACCTTATTT/ACTTTTCTCACT...TAAAG|ATT | 2 | 1 | 23.362 |
8649358 | GT-AG | 0 | 0.0011874047482219 | 131 | rna-gnl|WGS:MCFG|BCR32DRAFT_mRNA240269 1552350 | 5 | 41305 | 41435 | Anaeromyces robustus 1754192 | AAC|GTATGTAAAT...TATACCTAAATA/TATATACTAATT...ATTAG|ATG | 1 | 1 | 26.959 |
8649359 | GT-AG | 0 | 1.3056363626834249 | 155 | rna-gnl|WGS:MCFG|BCR32DRAFT_mRNA240269 1552350 | 6 | 40647 | 40801 | Anaeromyces robustus 1754192 | AAA|GTATATTTAT...TCATTTTTAACT/TCATTTTTAACT...AAAAG|ATA | 0 | 1 | 44.872 |
8649360 | GT-AG | 0 | 1.000000099473604e-05 | 146 | rna-gnl|WGS:MCFG|BCR32DRAFT_mRNA240269 1552350 | 7 | 40398 | 40543 | Anaeromyces robustus 1754192 | ATT|GTAATAAAAA...AATTTTTTATTT/TAATTTTTTATT...AATAG|TAT | 1 | 1 | 48.54 |
8649361 | GT-AG | 0 | 7.545685654574506e-05 | 135 | rna-gnl|WGS:MCFG|BCR32DRAFT_mRNA240269 1552350 | 8 | 39966 | 40100 | Anaeromyces robustus 1754192 | ACA|GTAATTAAAT...ATTATTTTAACA/ATTATTTTAACA...TATAG|ATT | 1 | 1 | 59.117 |
8649362 | GT-AG | 0 | 1.000000099473604e-05 | 105 | rna-gnl|WGS:MCFG|BCR32DRAFT_mRNA240269 1552350 | 9 | 39749 | 39853 | Anaeromyces robustus 1754192 | AAA|GTAATAACAT...ATATTATTAACT/ATATTATTAACT...AATAG|GAG | 2 | 1 | 63.105 |
8649363 | GT-AG | 0 | 0.0002388137020482 | 162 | rna-gnl|WGS:MCFG|BCR32DRAFT_mRNA240269 1552350 | 10 | 39366 | 39527 | Anaeromyces robustus 1754192 | AAG|GTATTATATG...AAATTATTAAAT/TTTATATTTATA...ATTAG|ATT | 1 | 1 | 70.976 |
8649364 | GT-AG | 0 | 1.000000099473604e-05 | 251 | rna-gnl|WGS:MCFG|BCR32DRAFT_mRNA240269 1552350 | 11 | 38860 | 39110 | Anaeromyces robustus 1754192 | CAG|GTAATATTTT...AAATTATTATTA/TATATACTAAAT...TATAG|AAC | 1 | 1 | 80.057 |
8649365 | GT-AG | 0 | 0.0165110274647852 | 88 | rna-gnl|WGS:MCFG|BCR32DRAFT_mRNA240269 1552350 | 12 | 38598 | 38685 | Anaeromyces robustus 1754192 | CAA|GTATTATTAA...TTTTTTTTATAT/CTTTTTTTTATA...AATAG|ATA | 1 | 1 | 86.254 |
8649366 | GT-AG | 0 | 0.0006805016762049 | 393 | rna-gnl|WGS:MCFG|BCR32DRAFT_mRNA240269 1552350 | 13 | 38088 | 38480 | Anaeromyces robustus 1754192 | TAG|GTTTTTTACC...ATGTTTTTAAAT/ATGTTTTTAAAT...GGCAG|ATA | 1 | 1 | 90.42 |
8649367 | GT-AG | 0 | 0.0001673490356739 | 143 | rna-gnl|WGS:MCFG|BCR32DRAFT_mRNA240269 1552350 | 14 | 37858 | 38000 | Anaeromyces robustus 1754192 | ACA|GTAAGTCAAT...AATTCCTTAATA/TAATTTATCATT...TAAAG|AAA | 1 | 1 | 93.519 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);