introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
30 rows where transcript_id = 3395334
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 16698804 | GT-AG | 0 | 1.000000099473604e-05 | 1429 | rna-gnl|WGS:VXAK|AREINT_R06643_mrna 3395334 | 1 | 7820 | 9248 | Arenaria interpres 54971 | ATG|GTTAGTAGAG...CTACTCTTAACT/CTACTCTTAACT...TATAG|TGT | 0 | 1 | 3.313 |
| 16698805 | GT-AG | 0 | 0.0034090622162703 | 12052 | rna-gnl|WGS:VXAK|AREINT_R06643_mrna 3395334 | 2 | 9388 | 21439 | Arenaria interpres 54971 | ACT|GTAAGTTTCA...TTGGTCTTATTT/CTTATTTTCATA...GTCAG|GTG | 1 | 1 | 7.965 |
| 16698806 | GT-AG | 0 | 1.000000099473604e-05 | 376 | rna-gnl|WGS:VXAK|AREINT_R06643_mrna 3395334 | 3 | 21575 | 21950 | Arenaria interpres 54971 | ATG|GTGGGTGCTT...TTAGCCTTCACC/TATCTCCTAATT...CCTAG|GTA | 1 | 1 | 12.483 |
| 16698807 | GT-AG | 0 | 1.000000099473604e-05 | 190 | rna-gnl|WGS:VXAK|AREINT_R06643_mrna 3395334 | 4 | 22135 | 22324 | Arenaria interpres 54971 | AAG|GTTATTTGCT...CTTTCCGTATTT/TCTCTGCTAAAG...CACAG|GCC | 2 | 1 | 18.641 |
| 16698808 | GT-AG | 0 | 1.2554480277225297e-05 | 1167 | rna-gnl|WGS:VXAK|AREINT_R06643_mrna 3395334 | 5 | 22354 | 23520 | Arenaria interpres 54971 | TAG|GTAAGTTACT...TGTCTTTTAACT/TGTCTTTTAACT...TTCAG|GAG | 1 | 1 | 19.612 |
| 16698809 | GT-AG | 0 | 1.000000099473604e-05 | 680 | rna-gnl|WGS:VXAK|AREINT_R06643_mrna 3395334 | 6 | 23570 | 24249 | Arenaria interpres 54971 | AAG|GTGAATTTAA...ATTTCTTTATTT/TTGTTTTTTATC...TGTAG|TTG | 2 | 1 | 21.252 |
| 16698810 | GT-AG | 0 | 0.0004473724865247 | 1823 | rna-gnl|WGS:VXAK|AREINT_R06643_mrna 3395334 | 7 | 24291 | 26113 | Arenaria interpres 54971 | CTG|GTATGTACAA...GTTTCCATAATT/CCATAATTGATT...TACAG|GAA | 1 | 1 | 22.624 |
| 16698811 | GT-AG | 0 | 1.000000099473604e-05 | 3886 | rna-gnl|WGS:VXAK|AREINT_R06643_mrna 3395334 | 8 | 26192 | 30077 | Arenaria interpres 54971 | CAG|GTAATACTAA...TCTTTCTTCCCA/CCCAATTTAAAA...AGCAG|GTC | 1 | 1 | 25.234 |
| 16698812 | GT-AG | 0 | 1.000000099473604e-05 | 1234 | rna-gnl|WGS:VXAK|AREINT_R06643_mrna 3395334 | 9 | 30135 | 31368 | Arenaria interpres 54971 | CAG|GTAGAGCTTT...GCTGCTTTGACA/GCTGCTTTGACA...CACAG|AGC | 1 | 1 | 27.142 |
| 16698813 | GT-AG | 0 | 1.000000099473604e-05 | 3426 | rna-gnl|WGS:VXAK|AREINT_R06643_mrna 3395334 | 11 | 31494 | 34919 | Arenaria interpres 54971 | AAG|GTGAGTTCTC...GTTGTCTGAACG/GTGATATTTATG...TGAAG|ATG | 1 | 1 | 31.258 |
| 16698814 | GT-AG | 0 | 1.000000099473604e-05 | 5760 | rna-gnl|WGS:VXAK|AREINT_R06643_mrna 3395334 | 12 | 35022 | 40781 | Arenaria interpres 54971 | TTG|GTAAGATACA...TCCTTTTTAATT/CTCTGTCTCATT...TGCAG|AGA | 1 | 1 | 34.672 |
| 16698815 | GT-AG | 0 | 0.000215044232836 | 2265 | rna-gnl|WGS:VXAK|AREINT_R06643_mrna 3395334 | 13 | 40848 | 43112 | Arenaria interpres 54971 | ATG|GTACGTGTTA...GATGTTTTAATT/GATGTTTTAATT...TAAAG|CTC | 1 | 1 | 36.881 |
| 16698816 | GT-AG | 0 | 1.000000099473604e-05 | 1410 | rna-gnl|WGS:VXAK|AREINT_R06643_mrna 3395334 | 14 | 43297 | 44706 | Arenaria interpres 54971 | AAA|GTAAGGAATT...TGTATCTTAATT/TGTATCTTAATT...TTCAG|GCA | 2 | 1 | 43.039 |
| 16698817 | GT-AG | 0 | 2.3410090195496333e-05 | 2592 | rna-gnl|WGS:VXAK|AREINT_R06643_mrna 3395334 | 15 | 44766 | 47357 | Arenaria interpres 54971 | GAG|GTAGGTCTTC...TTGTCCTTGTTC/TTCTGTGTCACT...TACAG|TGA | 1 | 1 | 45.013 |
| 16698818 | GT-AG | 0 | 1.000000099473604e-05 | 697 | rna-gnl|WGS:VXAK|AREINT_R06643_mrna 3395334 | 16 | 47430 | 48126 | Arenaria interpres 54971 | TTG|GTGAGTACAG...AGTTTCTGGATG/AATGGTTTGATG...ATCAG|CCG | 1 | 1 | 47.423 |
| 16698819 | GT-AG | 0 | 1.000000099473604e-05 | 348 | rna-gnl|WGS:VXAK|AREINT_R06643_mrna 3395334 | 17 | 48311 | 48658 | Arenaria interpres 54971 | AAG|GTAAGTGTTT...TAGTACTTAATT/TAGTACTTAATT...TGCAG|ACC | 2 | 1 | 53.581 |
| 16698820 | GT-AG | 0 | 1.000000099473604e-05 | 2130 | rna-gnl|WGS:VXAK|AREINT_R06643_mrna 3395334 | 18 | 48718 | 50847 | Arenaria interpres 54971 | CAG|GTAAGACAGG...ACCTCTTTATCT/AATACTCTGACA...TTCAG|TCA | 1 | 1 | 55.556 |
| 16698821 | GT-AG | 0 | 1.000000099473604e-05 | 492 | rna-gnl|WGS:VXAK|AREINT_R06643_mrna 3395334 | 19 | 50920 | 51411 | Arenaria interpres 54971 | TCG|GTAAGTTACA...ATATCAGTAAAT/TAAAATATCAGT...TTCAG|CCC | 1 | 1 | 57.965 |
| 16698822 | GT-AG | 0 | 0.0003763742222762 | 509 | rna-gnl|WGS:VXAK|AREINT_R06643_mrna 3395334 | 20 | 51596 | 52104 | Arenaria interpres 54971 | AAA|GTAGGTTTAT...TTTTTCTTACAT/TTTTTTCTTACA...TTTAG|GCC | 2 | 1 | 64.123 |
| 16698823 | GT-AG | 0 | 1.000000099473604e-05 | 696 | rna-gnl|WGS:VXAK|AREINT_R06643_mrna 3395334 | 21 | 52164 | 52859 | Arenaria interpres 54971 | AAG|GTAAGCAATT...GTTATATTGACT/GTTATATTGACT...TCCAG|CAA | 1 | 1 | 66.098 |
| 16698824 | GT-AG | 0 | 1.000000099473604e-05 | 471 | rna-gnl|WGS:VXAK|AREINT_R06643_mrna 3395334 | 22 | 52944 | 53414 | Arenaria interpres 54971 | AAG|GTATGGAAGA...ATGTCCATATGT/CCATATGTAATC...TCCAG|GTC | 1 | 1 | 68.909 |
| 16698825 | GT-AG | 0 | 9.22899964649176e-05 | 1900 | rna-gnl|WGS:VXAK|AREINT_R06643_mrna 3395334 | 23 | 53511 | 55410 | Arenaria interpres 54971 | TTG|GTAAATATTG...CGCGTCTTGATT/CTTGATTTAATT...GGCAG|AGG | 1 | 1 | 72.122 |
| 16698826 | GT-AG | 0 | 1.000000099473604e-05 | 831 | rna-gnl|WGS:VXAK|AREINT_R06643_mrna 3395334 | 24 | 55477 | 56307 | Arenaria interpres 54971 | GCG|GTAAGTTTAT...AGGAGCTTATTT/CAGGAGCTTATT...TTCAG|GGG | 1 | 1 | 74.331 |
| 16698827 | GT-AG | 0 | 1.000000099473604e-05 | 596 | rna-gnl|WGS:VXAK|AREINT_R06643_mrna 3395334 | 25 | 56492 | 57087 | Arenaria interpres 54971 | GAA|GTAAGACTTC...GTGGTCTTATTA/TGTGGTCTTATT...TGCAG|GCC | 2 | 1 | 80.489 |
| 16698828 | GT-AG | 0 | 1.000000099473604e-05 | 916 | rna-gnl|WGS:VXAK|AREINT_R06643_mrna 3395334 | 26 | 57141 | 58056 | Arenaria interpres 54971 | AAA|GTGAGTTTTG...GTTTGTTTAAAT/TTTAAATTCATT...TTTAG|GAA | 1 | 1 | 82.262 |
| 16698829 | GT-AG | 0 | 1.000000099473604e-05 | 770 | rna-gnl|WGS:VXAK|AREINT_R06643_mrna 3395334 | 27 | 58168 | 58937 | Arenaria interpres 54971 | CAG|GTGCGTACAC...ATCTGTTTAATT/ATCTGTTTAATT...TCCAG|ATG | 1 | 1 | 85.977 |
| 16698830 | GT-AG | 0 | 1.000000099473604e-05 | 760 | rna-gnl|WGS:VXAK|AREINT_R06643_mrna 3395334 | 28 | 59022 | 59781 | Arenaria interpres 54971 | TTG|GTAATTGCTT...GTAGACTTGAAA/TTCAGTGTCATC...CTTAG|GTG | 1 | 1 | 88.788 |
| 16698831 | GT-AG | 0 | 1.000000099473604e-05 | 1081 | rna-gnl|WGS:VXAK|AREINT_R06643_mrna 3395334 | 29 | 59966 | 61046 | Arenaria interpres 54971 | TCG|GTAGGTGGAT...ATTTCTTTATTT/CTTTATTTTATT...TACAG|ACC | 2 | 1 | 94.946 |
| 16698832 | GT-AG | 0 | 1.000000099473604e-05 | 1417 | rna-gnl|WGS:VXAK|AREINT_R06643_mrna 3395334 | 30 | 61076 | 62492 | Arenaria interpres 54971 | ATC|GTAAGTACTG...ACCGTTTTCATT/ACCGTTTTCATT...CTTAG|AGC | 1 | 1 | 95.917 |
| 16698833 | GT-AG | 0 | 1.000000099473604e-05 | 3888 | rna-gnl|WGS:VXAK|AREINT_R06643_mrna 3395334 | 31 | 62559 | 66446 | Arenaria interpres 54971 | CAG|GTGAGGCAGA...TTTTTCTTCATT/TTTTTCTTCATT...AACAG|ATA | 1 | 1 | 98.126 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);