introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
36 rows where transcript_id = 3555670
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 17676098 | GT-AG | 0 | 1.000000099473604e-05 | 1270 | rna-XM_038317984.1 3555670 | 1 | 187576150 | 187577419 | Arvicola amphibius 1047088 | GAG|GTGCGACATC...GCTCTCTTTTCT/ATTACACTGATG...CAAAG|ATG | 2 | 1 | 1.325 |
| 17676099 | GT-AG | 0 | 0.0102858331170586 | 4622 | rna-XM_038317984.1 3555670 | 2 | 187571274 | 187575895 | Arvicola amphibius 1047088 | AAG|GTATCTGGGA...AAGTTCTTGTCA/TGTGAGTTCATC...TTTAG|GCA | 1 | 1 | 6.504 |
| 17676100 | GT-AG | 0 | 1.000000099473604e-05 | 1960 | rna-XM_038317984.1 3555670 | 3 | 187569217 | 187571176 | Arvicola amphibius 1047088 | CTG|GTAAGTCAGT...TTTTCCTCGAAT/AACAAACTAATT...CTAAG|GAA | 2 | 1 | 8.481 |
| 17676101 | GT-AG | 0 | 1.000000099473604e-05 | 6426 | rna-XM_038317984.1 3555670 | 4 | 187562623 | 187569048 | Arvicola amphibius 1047088 | GAG|GTGGGGACCG...GTCTCCTTTATC/GTCTCCTTTATC...TCCAG|GTT | 2 | 1 | 11.906 |
| 17676102 | GT-AG | 0 | 0.0011356348009758 | 5885 | rna-XM_038317984.1 3555670 | 5 | 187556587 | 187562471 | Arvicola amphibius 1047088 | AGG|GTAATTTTCT...TGGGCCTTGACC/GCTGTCTTAACA...CGTAG|CCG | 0 | 1 | 14.985 |
| 17676103 | GT-AG | 0 | 0.0001121117739768 | 4624 | rna-XM_038317984.1 3555670 | 6 | 187551803 | 187556426 | Arvicola amphibius 1047088 | TGG|GTAAGCAACG...GCTCTCTTAATT/TGATGTCTAATT...CAAAG|ATG | 1 | 1 | 18.247 |
| 17676104 | GT-AG | 0 | 1.000000099473604e-05 | 1971 | rna-XM_038317984.1 3555670 | 7 | 187549764 | 187551734 | Arvicola amphibius 1047088 | ATT|GTAATGTATT...TTTGTCTCACTG/GTTTGTCTCACT...TGTAG|GAT | 0 | 1 | 19.633 |
| 17676105 | GT-AG | 0 | 0.0002628616962031 | 2613 | rna-XM_038317984.1 3555670 | 8 | 187547019 | 187549631 | Arvicola amphibius 1047088 | AAG|GTACATTATT...TATGCATTAATT/TATGCATTAATT...TTCAG|GGG | 0 | 1 | 22.324 |
| 17676106 | GT-AG | 0 | 1.000000099473604e-05 | 1537 | rna-XM_038317984.1 3555670 | 9 | 187545365 | 187546901 | Arvicola amphibius 1047088 | CTG|GTAAGAAAGG...ATGGTCCTAACG/ATGGTCCTAACG...CGCAG|ACG | 0 | 1 | 24.709 |
| 17676107 | GT-AG | 0 | 2.4397547383970047e-05 | 2552 | rna-XM_038317984.1 3555670 | 10 | 187542732 | 187545283 | Arvicola amphibius 1047088 | CCT|GTAAGTGTAC...CGTGTCTTATAC/ACGTGTCTTATA...AACAG|GCA | 0 | 1 | 26.361 |
| 17676108 | GT-AG | 0 | 0.000904591562505 | 6492 | rna-XM_038317984.1 3555670 | 11 | 187536091 | 187542582 | Arvicola amphibius 1047088 | GAT|GTAAGCATCT...GATTTCTTCTCT/CCCCAGCTGATG...CCTAG|CTA | 2 | 1 | 29.399 |
| 17676109 | GT-AG | 0 | 0.0001457363442624 | 1849 | rna-XM_038317984.1 3555670 | 12 | 187534138 | 187535986 | Arvicola amphibius 1047088 | CTG|GTACGCCCTA...TGTGGTTTAAAA/TGTGGTTTAAAA...TGCAG|AGA | 1 | 1 | 31.519 |
| 17676110 | GT-AG | 0 | 1.000000099473604e-05 | 1363 | rna-XM_038317984.1 3555670 | 13 | 187532634 | 187533996 | Arvicola amphibius 1047088 | AAG|GTGACACAAA...CTCCTCTTTTCC/TGAAATCTCAGG...CTTAG|TTC | 1 | 1 | 34.393 |
| 17676111 | GT-AG | 0 | 0.0068731778988435 | 1196 | rna-XM_038317984.1 3555670 | 14 | 187531265 | 187532460 | Arvicola amphibius 1047088 | AAG|GTATGCCAGA...TTTGCTTTGATT/TTTGCTTTGATT...GGCAG|AAT | 0 | 1 | 37.92 |
| 17676112 | GT-AG | 0 | 0.0001670069790019 | 1653 | rna-XM_038317984.1 3555670 | 15 | 187529462 | 187531114 | Arvicola amphibius 1047088 | CTG|GTAAATTTTA...TTATCCCTAAAA/ATTGCATTTATA...TGGAG|GAA | 0 | 1 | 40.979 |
| 17676113 | GT-AG | 0 | 5.9990847973600334e-05 | 1972 | rna-XM_038317984.1 3555670 | 16 | 187527359 | 187529330 | Arvicola amphibius 1047088 | GAA|GTACGTATGG...TCTCTCTCGCTC/CTCTCTCTCTCT...ATTAG|GCG | 2 | 1 | 43.649 |
| 17676114 | GT-AG | 0 | 0.0106593650927014 | 2143 | rna-XM_038317984.1 3555670 | 17 | 187525062 | 187527204 | Arvicola amphibius 1047088 | AAT|GTAGCTAGCT...GTCTCCTTCGAA/ATCTCTCTTAGA...AAAAG|CTG | 0 | 1 | 46.789 |
| 17676115 | GT-AG | 0 | 0.0019140703631896 | 1323 | rna-XM_038317984.1 3555670 | 18 | 187523658 | 187524980 | Arvicola amphibius 1047088 | GAG|GTATTTGTTT...CCCGCCTTACTA/GCCTTACTAAGT...TTCAG|GCC | 0 | 1 | 48.44 |
| 17676116 | GT-AG | 0 | 1.000000099473604e-05 | 1275 | rna-XM_038317984.1 3555670 | 19 | 187522305 | 187523579 | Arvicola amphibius 1047088 | ACT|GTAAGTGAAT...CCTCTCTTCACC/CCTCTCTTCACC...TTCAG|ATT | 0 | 1 | 50.031 |
| 17676117 | GT-AG | 0 | 1.520477071173857e-05 | 1810 | rna-XM_038317984.1 3555670 | 20 | 187520369 | 187522178 | Arvicola amphibius 1047088 | AAG|GTATGAAAGG...TTCATTTTGATC/CATAGTTTCATT...TCAAG|ATA | 0 | 1 | 52.599 |
| 17676118 | GT-AG | 0 | 1.000000099473604e-05 | 998 | rna-XM_038317984.1 3555670 | 21 | 187519266 | 187520263 | Arvicola amphibius 1047088 | AAG|GTAATGCACA...TCTGTCTTGGTT/ATGAACTTGATC...CCGAG|TGC | 0 | 1 | 54.74 |
| 17676119 | GT-AG | 0 | 0.1485441456047225 | 1473 | rna-XM_038317984.1 3555670 | 22 | 187517676 | 187519148 | Arvicola amphibius 1047088 | AAG|GTACCCCTTT...GAATTCTGATTG/GGAATTCTGATT...CTTAG|CTG | 0 | 1 | 57.125 |
| 17676120 | GT-AG | 0 | 1.000000099473604e-05 | 1465 | rna-XM_038317984.1 3555670 | 23 | 187515980 | 187517444 | Arvicola amphibius 1047088 | AAA|GTGAGTGGGT...AACCTATTGATA/AACCTATTGATA...TTCAG|GAT | 0 | 1 | 61.835 |
| 17676121 | GT-AG | 0 | 1.000000099473604e-05 | 1917 | rna-XM_038317984.1 3555670 | 24 | 187513856 | 187515772 | Arvicola amphibius 1047088 | GAG|GTACAGGCCA...TCTATTTTAATT/TCTATTTTAATT...CCCAG|GCC | 0 | 1 | 66.055 |
| 17676122 | GT-AG | 0 | 1.000000099473604e-05 | 1244 | rna-XM_038317984.1 3555670 | 25 | 187512531 | 187513774 | Arvicola amphibius 1047088 | ATG|GTGAGCTACT...GCTCCATTGATG/ATGCAGCTTATG...TCCAG|GTG | 0 | 1 | 67.706 |
| 17676123 | GT-AG | 0 | 1.000000099473604e-05 | 3171 | rna-XM_038317984.1 3555670 | 26 | 187509234 | 187512404 | Arvicola amphibius 1047088 | AAG|GTTAGAAGAA...AAACATTTAATT/CATTTAATTATT...TTTAG|TCT | 0 | 1 | 70.275 |
| 17676124 | GT-AG | 0 | 2.530879072916893e-05 | 2050 | rna-XM_038317984.1 3555670 | 27 | 187507064 | 187509113 | Arvicola amphibius 1047088 | CAG|GTACATGAAC...TTCCTCTTAATA/CTTCCTCTTAAT...CACAG|GAG | 0 | 1 | 72.722 |
| 17676125 | GT-AG | 0 | 1.000000099473604e-05 | 1378 | rna-XM_038317984.1 3555670 | 28 | 187505578 | 187506955 | Arvicola amphibius 1047088 | AAA|GTAAGTCCGG...TTCCAATTAACT/TTCCAATTAACT...TGTAG|GTC | 0 | 1 | 74.924 |
| 17676126 | GT-AG | 0 | 1.000000099473604e-05 | 671 | rna-XM_038317984.1 3555670 | 29 | 187504820 | 187505490 | Arvicola amphibius 1047088 | AAA|GTGAGTCACG...CAATCGTGGACA/GATGTGTTTATC...CTTAG|ATT | 0 | 1 | 76.697 |
| 17676127 | GT-AG | 0 | 0.0001067086571348 | 2427 | rna-XM_038317984.1 3555670 | 30 | 187502304 | 187504730 | Arvicola amphibius 1047088 | AAG|GTTTGTTTCG...CTTTTTTTGCTA/AGGTTGATGATC...ACCAG|GAT | 2 | 1 | 78.512 |
| 17676128 | GT-AG | 0 | 0.0008086057596178 | 1029 | rna-XM_038317984.1 3555670 | 31 | 187501061 | 187502089 | Arvicola amphibius 1047088 | AAA|GTATGTAACT...CAAGCCCTGACA/CAAGCCCTGACA...ACCAG|GTC | 0 | 1 | 82.875 |
| 17676129 | GT-AG | 0 | 5.979206481203824e-05 | 1137 | rna-XM_038317984.1 3555670 | 32 | 187499811 | 187500947 | Arvicola amphibius 1047088 | CCG|GTAATCCCAA...TCTCTTTTGAAG/TGAAGACTAAAT...TTTAG|ATT | 2 | 1 | 85.178 |
| 17676130 | GT-AG | 0 | 1.000000099473604e-05 | 3604 | rna-XM_038317984.1 3555670 | 33 | 187496047 | 187499650 | Arvicola amphibius 1047088 | CGG|GTAATTATAG...CAGCTCTGAATA/TCAGCTCTGAAT...TTCAG|GCT | 0 | 1 | 88.44 |
| 17676131 | GC-AG | 0 | 1.000000099473604e-05 | 1547 | rna-XM_038317984.1 3555670 | 34 | 187494335 | 187495881 | Arvicola amphibius 1047088 | AAG|GCAAGGCTCC...TTATCTTTACAT/ATTATCTTTACA...TAAAG|TAC | 0 | 1 | 91.804 |
| 17676132 | GT-AG | 0 | 1.000000099473604e-05 | 544 | rna-XM_038317984.1 3555670 | 35 | 187493710 | 187494253 | Arvicola amphibius 1047088 | AAG|GTCAGTCTTA...TTTTTTTCAAAT/CTTTTTTTCAAA...TTCAG|ACT | 0 | 1 | 93.456 |
| 17676133 | GT-AG | 0 | 1.000000099473604e-05 | 175 | rna-XM_038317984.1 3555670 | 36 | 187493381 | 187493555 | Arvicola amphibius 1047088 | TGG|GTGAGCGTTT...TAATTCTTTTCT/ACTCAACTAATT...ATTAG|GGT | 1 | 1 | 96.595 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);