introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
35 rows where transcript_id = 27368757
This data as json, CSV (advanced)
Suggested facets: phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 152380437 | GT-AG | 0 | 1.000000099473604e-05 | 5547 | rna-XM_032606435.1 27368757 | 1 | 2148225 | 2153771 | Phocoena sinus 42100 | CAT|GTGAGGAGCG...GTGTTGGTGACA/AGAGTGCTGACC...CCCAG|GCC | 2 | 1 | 2.588 |
| 152380438 | GT-AG | 0 | 1.000000099473604e-05 | 5437 | rna-XM_032606435.1 27368757 | 2 | 2142653 | 2148089 | Phocoena sinus 42100 | CAG|GTGCGTGCAG...TTTCCCTTCTCT/TCGACGGTAACG...CCTAG|AAC | 2 | 1 | 5.6 |
| 152380439 | GT-AG | 0 | 1.000000099473604e-05 | 10663 | rna-XM_032606435.1 27368757 | 3 | 2131880 | 2142542 | Phocoena sinus 42100 | CCT|GTGAGTGCCC...TTCTCCCTCCCT/CCCCTCCTCACG...CACAG|CCC | 1 | 1 | 8.054 |
| 152380440 | GT-AG | 0 | 1.000000099473604e-05 | 45461 | rna-XM_032606435.1 27368757 | 4 | 2086314 | 2131774 | Phocoena sinus 42100 | ACG|GTGAGTGGCA...TGGCTGTTGACC/TGGCTGTTGACC...TGCAG|ACG | 1 | 1 | 10.397 |
| 152380441 | GT-AG | 0 | 1.000000099473604e-05 | 8527 | rna-XM_032606435.1 27368757 | 5 | 2077664 | 2086190 | Phocoena sinus 42100 | TGG|GTAAGCACCC...TCGTCCTGGGCT/CCTGGGCTGACC...CCCAG|CCA | 1 | 1 | 13.141 |
| 152380442 | GT-AG | 0 | 1.000000099473604e-05 | 910 | rna-XM_032606435.1 27368757 | 6 | 2076628 | 2077537 | Phocoena sinus 42100 | TCC|GTGAGTGCCA...CATCTCTTGCTT/TTTGCCTCCATG...CGTAG|GGA | 1 | 1 | 15.953 |
| 152380443 | GT-AG | 0 | 1.000000099473604e-05 | 1322 | rna-XM_032606435.1 27368757 | 7 | 2075183 | 2076504 | Phocoena sinus 42100 | AAG|GTAAGACGCC...CCGCCCATGTCT/CTCAGGCCAATG...CCCAG|ATG | 1 | 1 | 18.697 |
| 152380444 | GT-AG | 0 | 1.000000099473604e-05 | 349 | rna-XM_032606435.1 27368757 | 8 | 2074711 | 2075059 | Phocoena sinus 42100 | ACC|GTGAGTGGAC...CAGCCCCTGTCC/CGGGGTGTCACC...CGCAG|GGA | 1 | 1 | 21.441 |
| 152380445 | GT-AG | 0 | 1.000000099473604e-05 | 6418 | rna-XM_032606435.1 27368757 | 9 | 2068155 | 2074572 | Phocoena sinus 42100 | TTG|GTGCGTGCCA...GGCACCTGGGTG/CTGGGTGCCAGC...TGCAG|ATG | 1 | 1 | 24.52 |
| 152380446 | GT-AG | 0 | 1.000000099473604e-05 | 322 | rna-XM_032606435.1 27368757 | 10 | 2067713 | 2068034 | Phocoena sinus 42100 | AGG|GTGAGCGGTA...GAGACCCCGACC/GAGACCCCGACC...CGCAG|ACG | 1 | 1 | 27.198 |
| 152380447 | GT-AG | 0 | 1.000000099473604e-05 | 512 | rna-XM_032606435.1 27368757 | 11 | 2067078 | 2067589 | Phocoena sinus 42100 | GCC|GTGAGTCTCT...CCCTCTGTGCTC/CAGGGGCTCAGG...CCCAG|CCC | 1 | 1 | 29.942 |
| 152380448 | GT-AG | 0 | 1.000000099473604e-05 | 361 | rna-XM_032606435.1 27368757 | 12 | 2066552 | 2066912 | Phocoena sinus 42100 | TTG|GTGAGAGGTC...AGGCCCTTCTCC/CAGGCTCTGAGG...CCCAG|TCT | 1 | 1 | 33.623 |
| 152380449 | GT-AG | 0 | 1.000000099473604e-05 | 738 | rna-XM_032606435.1 27368757 | 13 | 2065682 | 2066419 | Phocoena sinus 42100 | AGG|GTGAGGCTGC...TTGTGCTTCTCT/GGGCCAGTGACC...TGCAG|CTT | 1 | 1 | 36.568 |
| 152380450 | GT-AG | 0 | 1.000000099473604e-05 | 1625 | rna-XM_032606435.1 27368757 | 14 | 2063928 | 2065552 | Phocoena sinus 42100 | ACG|GTGGGTGCAG...GGGGCCATGTCC/TGGGCAGTCAAG...CCCAG|GCT | 1 | 1 | 39.447 |
| 152380451 | GT-AG | 0 | 1.000000099473604e-05 | 623 | rna-XM_032606435.1 27368757 | 15 | 2063176 | 2063798 | Phocoena sinus 42100 | TGG|GTGAGTCCCA...GGCCCCCTCACG/GGCCCCCTCACG...TGCAG|CCT | 1 | 1 | 42.325 |
| 152380452 | GT-AG | 0 | 1.000000099473604e-05 | 76 | rna-XM_032606435.1 27368757 | 16 | 2062965 | 2063040 | Phocoena sinus 42100 | ACG|GTGAGTGCAG...CCCCTCTCACCC/TCCCCTCTCACC...TGCAG|AGT | 1 | 1 | 45.337 |
| 152380453 | GT-AG | 0 | 1.000000099473604e-05 | 2026 | rna-XM_032606435.1 27368757 | 17 | 2060804 | 2062829 | Phocoena sinus 42100 | AAG|GTGAGCGACC...GAGGCCCCGACC/TGGTGGCTGAGG...CGTAG|AGT | 1 | 1 | 48.349 |
| 152380454 | GT-AG | 0 | 1.000000099473604e-05 | 706 | rna-XM_032606435.1 27368757 | 18 | 2059972 | 2060677 | Phocoena sinus 42100 | CAG|GTGAGTGGCA...CTGGCTGTGGTG/GGGAGACTGACT...GCTAG|ATT | 1 | 1 | 51.16 |
| 152380455 | GT-AG | 0 | 1.000000099473604e-05 | 164 | rna-XM_032606435.1 27368757 | 19 | 2059676 | 2059839 | Phocoena sinus 42100 | ACA|GTGAGTGCCG...TGTCCCGTGTCC/AGTGCGCTCACC...TGCAG|CTT | 1 | 1 | 54.105 |
| 152380456 | GT-AG | 0 | 1.000000099473604e-05 | 127 | rna-XM_032606435.1 27368757 | 20 | 2059420 | 2059546 | Phocoena sinus 42100 | GAG|GTCGGCGGCC...GTCACCTGCACA/GCTGTGGTCACC...TGCAG|CCT | 1 | 1 | 56.983 |
| 152380457 | GT-AG | 0 | 1.000000099473604e-05 | 488 | rna-XM_032606435.1 27368757 | 21 | 2058800 | 2059287 | Phocoena sinus 42100 | AGC|GTGAGTCCCG...GTCCCCCCAGCT/CCCCAGCTGATG...CTCAG|GGT | 1 | 1 | 59.929 |
| 152380458 | GT-AG | 0 | 1.000000099473604e-05 | 89 | rna-XM_032606435.1 27368757 | 22 | 2058582 | 2058670 | Phocoena sinus 42100 | GCG|GTGAGCCAGG...GGACCCATGACC/TCTCCCATCACC...TGCAG|CCT | 1 | 1 | 62.807 |
| 152380459 | GT-AG | 0 | 1.000000099473604e-05 | 187 | rna-XM_032606435.1 27368757 | 23 | 2058266 | 2058452 | Phocoena sinus 42100 | AGA|GTGCGTGAGG...CGGCCCCTGACC/CGGCCCCTGACC...CGCAG|CCT | 1 | 1 | 65.685 |
| 152380460 | GT-AG | 0 | 1.000000099473604e-05 | 373 | rna-XM_032606435.1 27368757 | 24 | 2057764 | 2058136 | Phocoena sinus 42100 | AGG|GTAAGCCCTG...GGTCACTTCTCT/CTGCAGGTCACT...CGCAG|CCT | 1 | 1 | 68.563 |
| 152380461 | GT-AG | 0 | 1.000000099473604e-05 | 280 | rna-XM_032606435.1 27368757 | 25 | 2057355 | 2057634 | Phocoena sinus 42100 | AAG|GTGAGCGCCG...GGACCCTCACCG/TGGACCCTCACC...CCCAG|CCT | 1 | 1 | 71.441 |
| 152380462 | GT-AG | 0 | 1.000000099473604e-05 | 389 | rna-XM_032606435.1 27368757 | 26 | 2056828 | 2057216 | Phocoena sinus 42100 | GCC|GTGAGTGGGG...GGGCCCTTCACA/GGGCCCTTCACA...TGCAG|CCT | 1 | 1 | 74.52 |
| 152380463 | GT-AG | 0 | 1.000000099473604e-05 | 407 | rna-XM_032606435.1 27368757 | 27 | 2056292 | 2056698 | Phocoena sinus 42100 | AGG|GTAGGTGACC...TGGCCCCTGGCA/GGCTTTGTGAGC...TGCAG|GGT | 1 | 1 | 77.398 |
| 152380464 | GT-AG | 0 | 1.000000099473604e-05 | 533 | rna-XM_032606435.1 27368757 | 28 | 2055630 | 2056162 | Phocoena sinus 42100 | TCG|GTGAGTGGCC...CCTGCCTGGAGC/CACCCTCCCATT...GGCAG|CCT | 1 | 1 | 80.277 |
| 152380465 | GT-AG | 0 | 1.000000099473604e-05 | 1003 | rna-XM_032606435.1 27368757 | 29 | 2054498 | 2055500 | Phocoena sinus 42100 | ACG|GTGAGAGCCT...GAGCTCTGAGCA/AGAGCTCTGAGC...CCCAG|GTT | 1 | 1 | 83.155 |
| 152380466 | GT-AG | 0 | 1.000000099473604e-05 | 281 | rna-XM_032606435.1 27368757 | 30 | 2054088 | 2054368 | Phocoena sinus 42100 | TGG|GTGAGTGTCT...TCTCCCGTGGCA/AGGAGGCTGACC...CCCAG|CCT | 1 | 1 | 86.033 |
| 152380467 | GT-AG | 0 | 1.000000099473604e-05 | 233 | rna-XM_032606435.1 27368757 | 31 | 2053726 | 2053958 | Phocoena sinus 42100 | ACC|GTGAGTCGGG...CTTTGCTCACTC/CCTTTGCTCACT...CTCAG|CCT | 1 | 1 | 88.911 |
| 152380468 | GT-AG | 0 | 1.000000099473604e-05 | 87 | rna-XM_032606435.1 27368757 | 32 | 2053510 | 2053596 | Phocoena sinus 42100 | AGG|GTGAGTGGTC...TGATGCTAAGCC/GTGATGCTAAGC...ACCAG|GGT | 1 | 1 | 91.789 |
| 152380469 | GT-AG | 0 | 1.000000099473604e-05 | 584 | rna-XM_032606435.1 27368757 | 33 | 2052797 | 2053380 | Phocoena sinus 42100 | TGG|GTGAGTCTGG...TCTCCCGTGTCT/GTGGCACTCAGG...CTCAG|GCT | 1 | 1 | 94.668 |
| 152380470 | GT-AG | 0 | 1.000000099473604e-05 | 1377 | rna-XM_032606435.1 27368757 | 34 | 2051291 | 2052667 | Phocoena sinus 42100 | AGG|GTGAGTCAGG...TTTCTGTTGCCC/CTGTTGCCCACC...TTCAG|GTG | 1 | 1 | 97.546 |
| 152380471 | GT-AG | 0 | 1.000000099473604e-05 | 322 | rna-XM_032606435.1 27368757 | 35 | 2050921 | 2051242 | Phocoena sinus 42100 | CAG|GTAAGGGGGC...GTCTCCTGCCCC/CCCGGGGTGACC...TTCAG|TGG | 1 | 1 | 98.617 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);