introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
15 rows where transcript_id = 9114881
This data as json, CSV (advanced)
Suggested facets: score, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
49389607 | GT-AG | 0 | 1.000000099473604e-05 | 3866 | rna-gnl|WGS:VYZG|COLPIC_R00035_mrna 9114881 | 1 | 2064589 | 2068454 | Columbina picui 115618 | AAG|GTAAGGGCTT...ATAATTTTAATC/ATAATTTTAATC...TGTAG|GAT | 1 | 1 | 6.85 |
49389608 | GT-AG | 0 | 1.000000099473604e-05 | 343 | rna-gnl|WGS:VYZG|COLPIC_R00035_mrna 9114881 | 2 | 2064190 | 2064532 | Columbina picui 115618 | CAA|GTCATGCTTC...ATGTTCTTAGGC/TTTTTTGTTATG...TTCAG|AAG | 0 | 1 | 8.901 |
49389609 | GT-AG | 0 | 1.000000099473604e-05 | 1224 | rna-gnl|WGS:VYZG|COLPIC_R00035_mrna 9114881 | 3 | 2062875 | 2064098 | Columbina picui 115618 | CGG|GTAATTGCTT...AAGATTTTAATT/AAGATTTTAATT...TCCAG|AGC | 1 | 1 | 12.234 |
49389610 | GT-AG | 0 | 1.000000099473604e-05 | 655 | rna-gnl|WGS:VYZG|COLPIC_R00035_mrna 9114881 | 4 | 2062089 | 2062743 | Columbina picui 115618 | CAG|GTGAGCGTGA...ATAATTTTAAAA/ATAATTTTAAAA...ATTAG|GTC | 0 | 1 | 17.033 |
49389611 | GT-AG | 0 | 1.000000099473604e-05 | 988 | rna-gnl|WGS:VYZG|COLPIC_R00035_mrna 9114881 | 5 | 2060929 | 2061916 | Columbina picui 115618 | CAG|GTAAAATGAG...GACATCTTTTTG/TATGTTGTCACA...TCCAG|AGT | 1 | 1 | 23.333 |
49389612 | GT-AG | 0 | 1.000000099473604e-05 | 155 | rna-gnl|WGS:VYZG|COLPIC_R00035_mrna 9114881 | 6 | 2060675 | 2060829 | Columbina picui 115618 | CAG|GTAAAAATTG...AGCCCTCTGATT/AGCCCTCTGATT...CTTAG|GTC | 1 | 1 | 26.96 |
49389613 | GT-AG | 0 | 0.000104404657276 | 544 | rna-gnl|WGS:VYZG|COLPIC_R00035_mrna 9114881 | 7 | 2059846 | 2060389 | Columbina picui 115618 | TGA|GTAAGTATGC...TTTCTTTTAAAT/TTTCTTTTAAAT...TGTAG|CTG | 1 | 1 | 37.399 |
49389614 | GT-AG | 0 | 0.0021679885703909 | 1238 | rna-gnl|WGS:VYZG|COLPIC_R00035_mrna 9114881 | 8 | 2058449 | 2059686 | Columbina picui 115618 | GCA|GTAAGTTTGG...AAACTTTTAACT/AAACTTTTAACT...CTCAG|GTA | 1 | 1 | 43.223 |
49389615 | GT-AG | 0 | 1.000000099473604e-05 | 381 | rna-gnl|WGS:VYZG|COLPIC_R00035_mrna 9114881 | 9 | 2057457 | 2057837 | Columbina picui 115618 | AAA|GTGAGTGTAT...GGATCTTTTGCT/CTGTGGTTGATC...CGTAG|ATA | 0 | 1 | 65.604 |
49389616 | GT-AG | 0 | 1.000000099473604e-05 | 3481 | rna-gnl|WGS:VYZG|COLPIC_R00035_mrna 9114881 | 10 | 2053801 | 2057281 | Columbina picui 115618 | TAG|GTAGGTGCAT...CTTCTCGTAACA/TTGGATCTGATC...AACAG|GGT | 1 | 1 | 72.015 |
49389617 | GT-AG | 0 | 0.0041656417579893 | 868 | rna-gnl|WGS:VYZG|COLPIC_R00035_mrna 9114881 | 11 | 2052847 | 2053714 | Columbina picui 115618 | CAG|GTATCAATAT...ATGGTCATATTA/TTCCTGTTCACA...CACAG|CGT | 0 | 1 | 75.165 |
49389618 | GT-AG | 0 | 0.0002959158038294 | 935 | rna-gnl|WGS:VYZG|COLPIC_R00035_mrna 9114881 | 12 | 2051807 | 2052741 | Columbina picui 115618 | GAG|GTAACTGCTT...TAGCTTTTGAAT/TAGCTTTTGAAT...CTTAG|GCC | 0 | 1 | 79.011 |
49389619 | GT-AG | 0 | 0.0045132810975369 | 996 | rna-gnl|WGS:VYZG|COLPIC_R00035_mrna 9114881 | 13 | 2050725 | 2051720 | Columbina picui 115618 | CAA|GTATGTATTG...CTTCCTTTGGCT/CTTTGGCTGAAT...TGCAG|GTT | 2 | 1 | 82.161 |
49389620 | GT-AG | 0 | 1.000000099473604e-05 | 1992 | rna-gnl|WGS:VYZG|COLPIC_R00035_mrna 9114881 | 14 | 2048540 | 2050531 | Columbina picui 115618 | GAG|GTTGGTATAA...TTTCCCATATTC/CTGTTTCCCATA...TTCAG|AAA | 0 | 1 | 89.231 |
49389621 | GT-AG | 0 | 0.0007732259394872 | 751 | rna-gnl|WGS:VYZG|COLPIC_R00035_mrna 9114881 | 15 | 2047629 | 2048379 | Columbina picui 115618 | ATG|GTAAGCTTCT...CTTTTCTTACTG/TCTTTTCTTACT...TTCAG|GCT | 1 | 1 | 95.092 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);