introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
27 rows where transcript_id = 6061910
This data as json, CSV (advanced)
Suggested facets: score, phase, in_cds
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
31219505 | GT-AG | 0 | 1.000000099473604e-05 | 92626 | rna-XM_030449568.1 6061910 | 2 | 188178597 | 188271222 | Calypte anna 9244 | CTG|GTGAGTTTTG...GTTTCTTTGATC/GTTTCTTTGATC...CACAG|GAG | 1 | 1 | 24.352 |
31219506 | GT-AG | 0 | 1.000000099473604e-05 | 86644 | rna-XM_030449568.1 6061910 | 3 | 188091638 | 188178281 | Calypte anna 9244 | CAG|GTAAGGAAAT...TTATCCTTTTTT/TATATTTTCAAT...CACAG|GTC | 1 | 1 | 26.627 |
31219507 | GT-AG | 0 | 1.000000099473604e-05 | 25741 | rna-XM_030449568.1 6061910 | 4 | 188065835 | 188091575 | Calypte anna 9244 | GAG|GTAAGATTAA...TGTTCCTATACA/TGTAGTCCCACT...TTCAG|GTA | 0 | 1 | 27.074 |
31219508 | GT-AG | 0 | 1.2619566752055228e-05 | 1594 | rna-XM_030449568.1 6061910 | 5 | 188063926 | 188065519 | Calypte anna 9244 | ACA|GTAAGTGCTC...CTTTTCTTTGTA/GTGATGTTCAGT...GCCAG|ATA | 0 | 1 | 29.349 |
31219509 | GT-AG | 0 | 1.000000099473604e-05 | 9354 | rna-XM_030449568.1 6061910 | 6 | 188054361 | 188063714 | Calypte anna 9244 | TCG|GTAAGTTGGG...GCTAATTTAATT/GCTAATTTAATT...ATTAG|GAG | 1 | 1 | 30.873 |
31219510 | GT-AG | 0 | 1.000000099473604e-05 | 9993 | rna-XM_030449568.1 6061910 | 7 | 188044228 | 188054220 | Calypte anna 9244 | CAG|GTAGAATATC...TTTCTTTTATTA/TTTTCTTTTATT...TATAG|GTT | 0 | 1 | 31.884 |
31219511 | GT-AG | 0 | 1.000000099473604e-05 | 1548 | rna-XM_030449568.1 6061910 | 8 | 188042404 | 188043951 | Calypte anna 9244 | ATG|GTAAGAATGC...GGAGTTTTACTA/TGGAGTTTTACT...TATAG|GTC | 0 | 1 | 33.877 |
31219512 | GT-AG | 0 | 1.000000099473604e-05 | 3040 | rna-XM_030449568.1 6061910 | 9 | 188039153 | 188042192 | Calypte anna 9244 | CAG|GTTAGGGCAG...TTGCTTCTAATA/CAATTGCTAATT...TTCAG|GAA | 1 | 1 | 35.401 |
31219513 | GT-AG | 0 | 1.000000099473604e-05 | 7069 | rna-XM_030449568.1 6061910 | 10 | 188028010 | 188035078 | Calypte anna 9244 | CAG|GTAAGAAACA...TGTGTGTTATCT/TTGTGTGTTATC...TTCAG|GAG | 1 | 1 | 64.823 |
31219514 | GT-AG | 0 | 1.000000099473604e-05 | 1108 | rna-XM_030449568.1 6061910 | 11 | 188026705 | 188027812 | Calypte anna 9244 | CAG|GTGAGTTGGC...ATTTCTTTTTTT/AATACTGTTACA...TTTAG|GTT | 0 | 1 | 66.245 |
31219515 | GT-AG | 0 | 1.000000099473604e-05 | 1386 | rna-XM_030449568.1 6061910 | 12 | 188025165 | 188026550 | Calypte anna 9244 | ATG|GTAAGACAAC...CTACTTTTAATG/CTTTTAATGATT...TTTAG|GTG | 1 | 1 | 67.358 |
31219516 | GT-AG | 0 | 1.000000099473604e-05 | 12565 | rna-XM_030449568.1 6061910 | 13 | 188012366 | 188024930 | Calypte anna 9244 | TAG|GTAAGTGAGC...GTTGACTTATTT/ATTTATTTCATT...TTCAG|GTG | 1 | 1 | 69.047 |
31219517 | GT-AG | 0 | 1.000000099473604e-05 | 2955 | rna-XM_030449568.1 6061910 | 14 | 188009021 | 188011975 | Calypte anna 9244 | CAG|GTTTGTAAAG...CCTTCCTTGCTT/TGGCTACTGACA...CAAAG|GAG | 1 | 1 | 71.864 |
31219518 | GT-AG | 0 | 1.000000099473604e-05 | 666 | rna-XM_030449568.1 6061910 | 15 | 188008140 | 188008805 | Calypte anna 9244 | ATG|GTGAGCGAGG...GATGGCTTAATA/CTAATTTTCACT...AAAAG|TTG | 0 | 1 | 73.417 |
31219519 | GT-AG | 0 | 1.000000099473604e-05 | 2674 | rna-XM_030449568.1 6061910 | 16 | 188005328 | 188008001 | Calypte anna 9244 | AGG|GTAAGATATG...ATTTCGTTATTA/AATGTTCTCACC...TGCAG|ATA | 0 | 1 | 74.413 |
31219520 | GT-AG | 0 | 0.0001626347772662 | 520 | rna-XM_030449568.1 6061910 | 17 | 188004664 | 188005183 | Calypte anna 9244 | CAA|GTAAGTTTAT...TTTTCTTTATAC/CTTTTCTTTATA...TAAAG|GAA | 0 | 1 | 75.453 |
31219521 | GT-AG | 0 | 0.0001493391801537 | 3327 | rna-XM_030449568.1 6061910 | 18 | 188001139 | 188004465 | Calypte anna 9244 | CAG|GTATGGCTTG...TTATTTTTATTT/TTTATTTTTATT...ATTAG|GCA | 0 | 1 | 76.883 |
31219522 | GT-AG | 0 | 1.000000099473604e-05 | 8369 | rna-XM_030449568.1 6061910 | 19 | 187991971 | 188000339 | Calypte anna 9244 | ATG|GTGAGTCCTG...AGTATTTTAACT/AGTATTTTAACT...TGCAG|GAG | 1 | 1 | 82.653 |
31219523 | GT-AG | 0 | 1.000000099473604e-05 | 1001 | rna-XM_030449568.1 6061910 | 20 | 187990835 | 187991835 | Calypte anna 9244 | CAG|GTGGGTGTCA...TGTGTCTTTTCA/AGTTATTTCATG...TTCAG|GCC | 1 | 1 | 83.628 |
31219524 | GT-AG | 0 | 1.000000099473604e-05 | 7951 | rna-XM_030449568.1 6061910 | 21 | 187982726 | 187990676 | Calypte anna 9244 | AAG|GTAATTAAAA...AGCACTTTGATA/TCTTTTTCCATC...TGTAG|ATC | 0 | 1 | 84.769 |
31219525 | GT-AG | 0 | 1.000000099473604e-05 | 9448 | rna-XM_030449568.1 6061910 | 22 | 187972809 | 187982256 | Calypte anna 9244 | GTG|GTAAAGATTT...GTGTCCTTCACC/CTCTGATTTATT...CCCAG|GCT | 1 | 1 | 88.156 |
31219526 | GT-AG | 0 | 1.000000099473604e-05 | 1579 | rna-XM_030449568.1 6061910 | 23 | 187971076 | 187972654 | Calypte anna 9244 | AAC|GTAAGTGACC...TGTTTCTTTTTT/TTGCTAGGCATT...TCAAG|GTG | 2 | 1 | 89.268 |
31219527 | GT-AG | 0 | 3.388910097489049e-05 | 2130 | rna-XM_030449568.1 6061910 | 24 | 187968290 | 187970419 | Calypte anna 9244 | AAA|GTAAGTATTG...TTTGCTTTGTTT/TAAGTATGTATA...TGAAG|GTA | 1 | 1 | 94.006 |
31219528 | GT-AG | 0 | 1.000000099473604e-05 | 3528 | rna-XM_030449568.1 6061910 | 25 | 187964648 | 187968175 | Calypte anna 9244 | ATG|GTAAGAAATG...CTCCCTTTATTT/CCTCCCTTTATT...GCAAG|CTT | 1 | 1 | 94.829 |
31219529 | GT-AG | 0 | 6.439440334233942e-05 | 2552 | rna-XM_030449568.1 6061910 | 26 | 187962036 | 187964587 | Calypte anna 9244 | AAG|GTATTTGAAT...GTTTTTTTCACT/GTTTTTTTCACT...GACAG|TGA | 1 | 1 | 95.263 |
31219530 | GT-AG | 0 | 1.000000099473604e-05 | 632 | rna-XM_030449568.1 6061910 | 27 | 187961368 | 187961999 | Calypte anna 9244 | GAG|GTGATTTTTC...GCAGCCCTACCT/CCTACCCTCACA...TGCAG|CCT | 1 | 1 | 95.522 |
31238745 | GT-AG | 0 | 1.000000099473604e-05 | 89940 | rna-XM_030449568.1 6061910 | 1 | 188274535 | 188364474 | Calypte anna 9244 | CCG|GTGCGTACCC...TGTCTCTTGACC/CTCTTTTTCATT...CACAG|GGT | 0 | 0.556 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);