introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
36 rows where transcript_id = 3555680
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 17676362 | GT-AG | 0 | 1.000000099473604e-05 | 2263 | rna-XM_038335600.1 3555680 | 1 | 150223783 | 150226045 | Arvicola amphibius 1047088 | GGG|GTGAGTGGTC...AGGCTCTGAGTC/AAGGCTCTGAGT...CTCAG|CCA | 1 | 1 | 3.452 |
| 17676363 | GT-AG | 0 | 1.000000099473604e-05 | 79 | rna-XM_038335600.1 3555680 | 2 | 150226138 | 150226216 | Arvicola amphibius 1047088 | GAG|GTAAGTGAAT...ATGGTCTAACCC/TATGGTCTAACC...CTCAG|GTT | 0 | 1 | 5.437 |
| 17676364 | GT-AG | 0 | 1.000000099473604e-05 | 350 | rna-XM_038335600.1 3555680 | 3 | 150226301 | 150226650 | Arvicola amphibius 1047088 | GAG|GTCAGTTTGG...GCTTTCCAAACC/CGGGAGCTGATC...CTCAG|ACG | 0 | 1 | 7.249 |
| 17676365 | GT-AG | 0 | 1.000000099473604e-05 | 278 | rna-XM_038335600.1 3555680 | 4 | 150226747 | 150227024 | Arvicola amphibius 1047088 | CTG|GTGAGGATCA...TGACTCTTGGCT/CTGAGTCTGACT...TCCAG|TAC | 0 | 1 | 9.32 |
| 17676366 | GT-AG | 0 | 1.000000099473604e-05 | 384 | rna-XM_038335600.1 3555680 | 5 | 150227174 | 150227557 | Arvicola amphibius 1047088 | CAG|GTGGGGCCCC...AAGCCCTTCACC/ACGTGTCTAAAC...CCCAG|GGA | 2 | 1 | 12.535 |
| 17676367 | GT-AG | 0 | 1.000000099473604e-05 | 119 | rna-XM_038335600.1 3555680 | 6 | 150227652 | 150227770 | Arvicola amphibius 1047088 | ATG|GTAAGGAGCT...GGCGCCTGGCCC/TGTGGCCTCATG...CACAG|GTA | 0 | 1 | 14.563 |
| 17676368 | GT-AG | 0 | 1.000000099473604e-05 | 101 | rna-XM_038335600.1 3555680 | 7 | 150227972 | 150228072 | Arvicola amphibius 1047088 | GAG|GTTAGACTGT...TCTTCTGTGATC/TCCAGCCTGATC...TGCAG|GAC | 0 | 1 | 18.9 |
| 17676369 | GT-AG | 0 | 1.000000099473604e-05 | 381 | rna-XM_038335600.1 3555680 | 8 | 150228322 | 150228702 | Arvicola amphibius 1047088 | CCA|GTGAGTGACA...TTGTCTGTGACT/TTGTCTGTGACT...TGTAG|GAG | 0 | 1 | 24.272 |
| 17676370 | GT-AG | 0 | 1.000000099473604e-05 | 1024 | rna-XM_038335600.1 3555680 | 9 | 150228783 | 150229806 | Arvicola amphibius 1047088 | CAG|GTGAGTCTGG...TGGTCCATGAGG/CTTCTGCTAACC...CTCAG|CCC | 2 | 1 | 25.998 |
| 17676371 | GT-AG | 0 | 1.000000099473604e-05 | 98 | rna-XM_038335600.1 3555680 | 10 | 150229890 | 150229987 | Arvicola amphibius 1047088 | AAG|GTACAGGGGC...CAGCCCTTCTCC/TGGCTACTCACC...ACCAG|AGG | 1 | 1 | 27.789 |
| 17676372 | GT-AG | 0 | 0.3878423537906982 | 194 | rna-XM_038335600.1 3555680 | 11 | 150230069 | 150230262 | Arvicola amphibius 1047088 | CAG|GTATCTCTTT...CAAGTCTCAATG/GCAAGTCTCAAT...CACAG|AGA | 1 | 1 | 29.536 |
| 17676373 | GT-AG | 0 | 1.000000099473604e-05 | 87 | rna-XM_038335600.1 3555680 | 12 | 150230370 | 150230456 | Arvicola amphibius 1047088 | CAG|GTGAGCACTA...TTGACCTTGTCC/CCTGGCTTGACC...CTCAG|GAG | 0 | 1 | 31.845 |
| 17676374 | GT-AG | 0 | 1.000000099473604e-05 | 234 | rna-XM_038335600.1 3555680 | 13 | 150230637 | 150230870 | Arvicola amphibius 1047088 | GAG|GTTAGTGATG...ATTACCTTGCTG/GTTAGCATTACC...CACAG|CTG | 0 | 1 | 35.728 |
| 17676375 | GT-AG | 0 | 0.0001269712198716 | 120 | rna-XM_038335600.1 3555680 | 14 | 150230964 | 150231083 | Arvicola amphibius 1047088 | AAG|GTATAGAGGT...GGCTTCTTAAAC/TGGCTTCTTAAA...TCAAG|ACC | 0 | 1 | 37.735 |
| 17676376 | GT-AG | 0 | 1.000000099473604e-05 | 84 | rna-XM_038335600.1 3555680 | 15 | 150231206 | 150231289 | Arvicola amphibius 1047088 | GGG|GTGAGCACAT...GTCTCCTTGGCC/CTGGGCGTCAGC...CCCAG|GAT | 2 | 1 | 40.367 |
| 17676377 | GT-AG | 0 | 1.000000099473604e-05 | 226 | rna-XM_038335600.1 3555680 | 16 | 150231366 | 150231591 | Arvicola amphibius 1047088 | CGG|GTGAGCAGGA...CTTCCCTCTGCG/TGGGGACTGAGC...CCCAG|CTG | 0 | 1 | 42.006 |
| 17676378 | GT-AG | 0 | 1.000000099473604e-05 | 89 | rna-XM_038335600.1 3555680 | 17 | 150231699 | 150231787 | Arvicola amphibius 1047088 | CTG|GTGGGTACTG...CAGACCATGACA/CAGACCATGACA...TTCAG|GGT | 2 | 1 | 44.315 |
| 17676379 | GT-AG | 0 | 1.000000099473604e-05 | 328 | rna-XM_038335600.1 3555680 | 18 | 150231903 | 150232230 | Arvicola amphibius 1047088 | CTG|GTGAGCCCCA...ATGTCCTTGATT/ATGTCCTTGATT...ACCAG|GAC | 0 | 1 | 46.796 |
| 17676380 | GT-AG | 0 | 1.000000099473604e-05 | 75 | rna-XM_038335600.1 3555680 | 19 | 150232380 | 150232454 | Arvicola amphibius 1047088 | ACG|GTGAGGTCAT...TTGCCCTCCTCT/CCTATGGTGAGC...CCCAG|CCG | 2 | 1 | 50.011 |
| 17676381 | GT-AG | 0 | 1.000000099473604e-05 | 279 | rna-XM_038335600.1 3555680 | 20 | 150232544 | 150232822 | Arvicola amphibius 1047088 | GGG|GTAAGTGCCC...GGTTCCTAAGCC/TGGTTCCTAAGC...TACAG|ATG | 1 | 1 | 51.931 |
| 17676382 | GT-AG | 0 | 1.000000099473604e-05 | 85 | rna-XM_038335600.1 3555680 | 21 | 150232876 | 150232960 | Arvicola amphibius 1047088 | GAG|GTAAGAACCA...GTGGCCCTGACT/TCCTCGCTCATC...CCCAG|AAG | 0 | 1 | 53.074 |
| 17676383 | GT-AG | 0 | 1.000000099473604e-05 | 85 | rna-XM_038335600.1 3555680 | 22 | 150233060 | 150233144 | Arvicola amphibius 1047088 | GGG|GTGAGAACAG...ATTCCCTTCTCA/TCCCTTCTCACT...TTCAG|ACT | 0 | 1 | 55.21 |
| 17676384 | GT-AG | 0 | 1.000000099473604e-05 | 251 | rna-XM_038335600.1 3555680 | 23 | 150233205 | 150233455 | Arvicola amphibius 1047088 | AAG|GTCAGTGGCC...GAGGCCCTGGCA/TGGGTGCTCAGA...CACAG|CCC | 0 | 1 | 56.505 |
| 17676385 | GT-AG | 0 | 1.000000099473604e-05 | 250 | rna-XM_038335600.1 3555680 | 24 | 150233562 | 150233811 | Arvicola amphibius 1047088 | ACT|GTAAGAACCC...GCTGCCTGACCT/ACCTGCTTCATC...CACAG|CCT | 1 | 1 | 58.792 |
| 17676386 | GT-AG | 0 | 1.000000099473604e-05 | 81 | rna-XM_038335600.1 3555680 | 25 | 150233946 | 150234026 | Arvicola amphibius 1047088 | TCG|GTGAGCTGTG...CAGCCCTTCATG/CAGCCCTTCATG...ACTAG|GTG | 0 | 1 | 61.683 |
| 17676387 | GT-AG | 0 | 1.000000099473604e-05 | 94 | rna-XM_038335600.1 3555680 | 26 | 150234167 | 150234260 | Arvicola amphibius 1047088 | GAG|GTCAGTGCCA...GTATCCTTTGTT/GAGGGACTCATT...CCCAG|GGA | 2 | 1 | 64.703 |
| 17676388 | GT-AG | 0 | 1.000000099473604e-05 | 697 | rna-XM_038335600.1 3555680 | 27 | 150234343 | 150235039 | Arvicola amphibius 1047088 | AGG|GTGAGTGCTC...TTGGTCGTAGCC/ACCAGCCTGACT...TGCAG|GTG | 0 | 1 | 66.472 |
| 17676389 | GT-AG | 0 | 2.867467622906824e-05 | 1739 | rna-XM_038335600.1 3555680 | 28 | 150235257 | 150236995 | Arvicola amphibius 1047088 | TTG|GTAAATCTTG...TGACCTCTGACC/CAAGGTGTCACT...CTCAG|ACA | 1 | 1 | 71.154 |
| 17676390 | GT-AG | 0 | 0.0022537473467315 | 119 | rna-XM_038335600.1 3555680 | 29 | 150237059 | 150237177 | Arvicola amphibius 1047088 | ATG|GTACCACTGG...TGCTTCTCACCC/CTGCTTCTCACC...CACAG|ACA | 1 | 1 | 72.513 |
| 17676391 | GT-AG | 0 | 1.000000099473604e-05 | 854 | rna-XM_038335600.1 3555680 | 30 | 150237778 | 150238631 | Arvicola amphibius 1047088 | GGG|GTAAGGCCGG...CCTGGCTTGCCT/CTGTTTCTCACC...CGCAG|GTT | 1 | 1 | 85.458 |
| 17676392 | GT-AG | 0 | 1.000000099473604e-05 | 73 | rna-XM_038335600.1 3555680 | 31 | 150238730 | 150238802 | Arvicola amphibius 1047088 | AAG|GTGAAGGTCT...CGTGGCTTGACC/CGTGGCTTGACC...CTCAG|GTA | 0 | 1 | 87.573 |
| 17676393 | GT-AG | 0 | 1.000000099473604e-05 | 116 | rna-XM_038335600.1 3555680 | 32 | 150238888 | 150239003 | Arvicola amphibius 1047088 | CAG|GTGAGGAAGT...AGCTCCCTGATT/AGCTCCCTGATT...CACAG|AGA | 1 | 1 | 89.407 |
| 17676394 | GT-AG | 0 | 1.000000099473604e-05 | 100 | rna-XM_038335600.1 3555680 | 33 | 150239125 | 150239224 | Arvicola amphibius 1047088 | CAG|GTGCGTACGT...TGTTCTTTACAC/CTGTTCTTTACA...TGCAG|GGA | 2 | 1 | 92.017 |
| 17676395 | GT-AG | 0 | 1.000000099473604e-05 | 205 | rna-XM_038335600.1 3555680 | 34 | 150239331 | 150239535 | Arvicola amphibius 1047088 | AGG|GTGAGTCCTC...TCTCCTCTGACA/TCTCCTCTGACA...TCCAG|GAA | 0 | 1 | 94.304 |
| 17676396 | GT-AG | 0 | 1.000000099473604e-05 | 115 | rna-XM_038335600.1 3555680 | 35 | 150239654 | 150239768 | Arvicola amphibius 1047088 | CCA|GTAAGAGCAG...GAGCCCTTAAAT/TTAAATCTAACT...TCCAG|TAG | 1 | 1 | 96.85 |
| 17676397 | GT-AG | 0 | 1.000000099473604e-05 | 1954 | rna-XM_038335600.1 3555680 | 36 | 150239858 | 150241811 | Arvicola amphibius 1047088 | CAG|GTGAGCCAGT...TTCCTGTTATTC/TCCTGTCTGACT...TCCAG|GTC | 0 | 1 | 98.77 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);