introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
26 rows where transcript_id = 15198739
This data as json, CSV (advanced)
Suggested facets: score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 82238500 | GT-AG | 0 | 0.0001823585588511 | 1036 | rna-XM_031066610.1 15198739 | 3 | 14176767 | 14177802 | Geospiza fortis 48883 | CAG|GTATTCAGAG...GCATTGCTAACA/GCATTGCTAACA...CCTAG|TGA | 2 | 1 | 19.499 |
| 82238501 | GT-AG | 0 | 1.000000099473604e-05 | 511 | rna-XM_031066610.1 15198739 | 4 | 14177994 | 14178504 | Geospiza fortis 48883 | AAG|GTAATATACA...CTGCTTTTCTCT/TAATAGTTAATA...TGAAG|ATT | 1 | 1 | 24.102 |
| 82238502 | GT-AG | 0 | 1.000000099473604e-05 | 1377 | rna-XM_031066610.1 15198739 | 5 | 14178673 | 14180049 | Geospiza fortis 48883 | CTG|GTGCGTGTAA...CTTGCTTTAACT/CTTGCTTTAACT...TTCAG|GTG | 1 | 1 | 28.151 |
| 82238503 | GT-AG | 0 | 1.000000099473604e-05 | 415 | rna-XM_031066610.1 15198739 | 6 | 14180192 | 14180606 | Geospiza fortis 48883 | TCG|GTGAGCCAGA...AGTTCTTTCATG/CATGAATTCACA...TTCAG|AAT | 2 | 1 | 31.574 |
| 82238504 | GT-AG | 0 | 0.0003089532468416 | 940 | rna-XM_031066610.1 15198739 | 7 | 14180692 | 14181631 | Geospiza fortis 48883 | CAG|GTATATCATC...GGTCTATTAATG/GGTCTATTAATG...CAAAG|GTG | 0 | 1 | 33.623 |
| 82238505 | GT-AG | 0 | 1.000000099473604e-05 | 1152 | rna-XM_031066610.1 15198739 | 8 | 14181761 | 14182912 | Geospiza fortis 48883 | CAG|GTAATTAATT...AATATTTTAATA/AATATTTTAATA...TTCAG|GTT | 0 | 1 | 36.732 |
| 82238506 | GT-AG | 0 | 0.0002266299418281 | 564 | rna-XM_031066610.1 15198739 | 9 | 14183129 | 14183692 | Geospiza fortis 48883 | AAG|GTAATCACTA...TAACTCTTAACA/TATAGGTTAATT...GATAG|AAA | 0 | 1 | 41.938 |
| 82238507 | GT-AG | 0 | 1.000000099473604e-05 | 973 | rna-XM_031066610.1 15198739 | 10 | 14183759 | 14184731 | Geospiza fortis 48883 | TCA|GTAAGTAAAA...TAATTTTTATAA/CTAATTTTTATA...TTCAG|GAG | 0 | 1 | 43.529 |
| 82238508 | GT-AG | 0 | 1.000000099473604e-05 | 905 | rna-XM_031066610.1 15198739 | 11 | 14184883 | 14185787 | Geospiza fortis 48883 | AAG|GTAAATAATC...TCTTCTTTATTA/TTCTTCTTTATT...TCCAG|AAC | 1 | 1 | 47.168 |
| 82238509 | GT-AG | 0 | 1.000000099473604e-05 | 493 | rna-XM_031066610.1 15198739 | 12 | 14185982 | 14186474 | Geospiza fortis 48883 | AAG|GTAAAGTTCT...CAAATTTTAACA/TTATTGTTTACT...TTCAG|AGC | 0 | 1 | 51.844 |
| 82238510 | GT-AG | 0 | 1.000000099473604e-05 | 507 | rna-XM_031066610.1 15198739 | 13 | 14186619 | 14187125 | Geospiza fortis 48883 | CAG|GTAAGATGAA...CAAACTATAATC/TGTATGCTAACA...TAAAG|TCT | 0 | 1 | 55.315 |
| 82238511 | GT-AG | 0 | 1.000000099473604e-05 | 2288 | rna-XM_031066610.1 15198739 | 14 | 14187270 | 14189557 | Geospiza fortis 48883 | AAG|GTGAGCTTTA...GTCTTTTTACAT/TGTCTTTTTACA...CTCAG|ATG | 0 | 1 | 58.785 |
| 82238512 | GT-AG | 0 | 1.6700506113762112e-05 | 161 | rna-XM_031066610.1 15198739 | 15 | 14189636 | 14189796 | Geospiza fortis 48883 | GGA|GTAAGTGCTT...TTTATTTTAAAT/TGGTTATTTATT...TGCAG|CTT | 0 | 1 | 60.665 |
| 82238513 | GT-AG | 0 | 3.871316114249828e-05 | 1752 | rna-XM_031066610.1 15198739 | 16 | 14189949 | 14191700 | Geospiza fortis 48883 | TCA|GTAAGTCTGA...CTTCGTTTAACT/TGTATACTCATT...TTCAG|GCT | 2 | 1 | 64.329 |
| 82238514 | GT-AG | 0 | 0.0003211774951212 | 70 | rna-XM_031066610.1 15198739 | 17 | 14191817 | 14191886 | Geospiza fortis 48883 | TAG|GTAATCTATA...TTGTGCTTATTT/GTTGTGCTTATT...TTTAG|ATA | 1 | 1 | 67.125 |
| 82238515 | GT-AG | 0 | 0.0003612407465579 | 459 | rna-XM_031066610.1 15198739 | 18 | 14192042 | 14192500 | Geospiza fortis 48883 | AGG|GTATGTCCAG...AAACTTTTAATT/AAACTTTTAATT...CATAG|GAA | 0 | 1 | 70.86 |
| 82238516 | GT-AG | 0 | 1.000000099473604e-05 | 980 | rna-XM_031066610.1 15198739 | 19 | 14192557 | 14193536 | Geospiza fortis 48883 | AGA|GTAAGTAAAG...ACAGCTTTAATA/AATGTTTTCACT...GATAG|CCA | 2 | 1 | 72.21 |
| 82238517 | GT-AG | 0 | 1.000000099473604e-05 | 1897 | rna-XM_031066610.1 15198739 | 20 | 14193706 | 14195602 | Geospiza fortis 48883 | CAG|GTATGAAGAC...TTTGCCTTTATG/ATTTAATTTACT...TCTAG|GCT | 0 | 1 | 76.283 |
| 82238518 | GT-AG | 0 | 1.000000099473604e-05 | 1726 | rna-XM_031066610.1 15198739 | 21 | 14195714 | 14197439 | Geospiza fortis 48883 | AAG|GTAAAAATAT...AGAGCTTTACTG/ATGTTGATCATT...ATTAG|GAG | 0 | 1 | 78.959 |
| 82238519 | GT-AG | 0 | 1.000000099473604e-05 | 889 | rna-XM_031066610.1 15198739 | 22 | 14197626 | 14198514 | Geospiza fortis 48883 | GGG|GTAATAACGT...AAACTCTTAAAT/TTAAATTTAACA...TTTAG|CAT | 0 | 1 | 83.442 |
| 82238520 | GT-AG | 0 | 1.000000099473604e-05 | 954 | rna-XM_031066610.1 15198739 | 23 | 14198725 | 14199678 | Geospiza fortis 48883 | AGG|GTAAGCAGCC...AGTGTATTAATA/AGTGTATTAATA...CTTAG|GTT | 0 | 1 | 88.503 |
| 82238521 | GT-AG | 0 | 0.0007724002798645 | 3295 | rna-XM_031066610.1 15198739 | 24 | 14199882 | 14203176 | Geospiza fortis 48883 | ACT|GTAAGTTTTA...CTTCTCATGATT/TTGCTTCTCATG...TTTAG|AAG | 2 | 1 | 93.396 |
| 82238522 | GT-AG | 0 | 1.000000099473604e-05 | 391 | rna-XM_031066610.1 15198739 | 25 | 14203282 | 14203672 | Geospiza fortis 48883 | ACG|GTAAGATGCC...CTTACCGTGGTG/GTCTAGCTGAGG...TGCAG|GGA | 2 | 1 | 95.927 |
| 82238523 | GT-AG | 0 | 1.000000099473604e-05 | 1705 | rna-XM_031066610.1 15198739 | 26 | 14203820 | 14205524 | Geospiza fortis 48883 | AAA|GTAAGTGATT...TTACCTTTATTA/TTTATTATGACA...CACAG|TAA | 2 | 1 | 99.47 |
| 82240375 | GT-AG | 0 | 1.000000099473604e-05 | 3890 | rna-XM_031066610.1 15198739 | 1 | 14170999 | 14174888 | Geospiza fortis 48883 | GTG|GTAAGTGTGG...GTCCATTTATTT/AAAGTGCTAACA...TGCAG|TTA | 0 | 2.193 | |
| 82240376 | GT-AG | 0 | 7.025256221665107e-05 | 1108 | rna-XM_031066610.1 15198739 | 2 | 14175494 | 14176601 | Geospiza fortis 48883 | CAG|GTCTGTTGCT...TTTTCCTTCTCT/CCTTCTCTAATG...TGCAG|GTG | 0 | 16.775 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);