introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
22 rows where transcript_id = 9114876
This data as json, CSV (advanced)
Suggested facets: score, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
49389507 | GT-AG | 0 | 1.000000099473604e-05 | 406 | rna-gnl|WGS:VYZG|COLPIC_R00056_mrna 9114876 | 1 | 2439136 | 2439541 | Columbina picui 115618 | CCG|GTGAGGGGGA...CAACTCTTTGCA/CCTGGCCCAACT...GGCAG|GAC | 1 | 1 | 4.98 |
49389508 | GT-AG | 0 | 1.000000099473604e-05 | 504 | rna-gnl|WGS:VYZG|COLPIC_R00056_mrna 9114876 | 2 | 2438484 | 2438987 | Columbina picui 115618 | GAT|GTGAGTACAG...TCCCCCTTCTCT/CTAGTGCTCAGC...TGCAG|ACG | 2 | 1 | 9.502 |
49389509 | GT-AG | 0 | 1.000000099473604e-05 | 133 | rna-gnl|WGS:VYZG|COLPIC_R00056_mrna 9114876 | 5 | 2437470 | 2437602 | Columbina picui 115618 | CCT|GTGAGTGCCC...CTGTCCCCAGCG/CGGTGGGTCACC...CACAG|GCA | 2 | 1 | 20.593 |
49389510 | GT-AG | 0 | 0.0023654856828465 | 585 | rna-gnl|WGS:VYZG|COLPIC_R00056_mrna 9114876 | 6 | 2436706 | 2437290 | Columbina picui 115618 | CCC|GTCCCTGCCA...CTGTCCTCACCG/GCTGTCCTCACC...TGCAG|GGA | 1 | 1 | 26.062 |
49389511 | GT-AG | 0 | 1.000000099473604e-05 | 112 | rna-gnl|WGS:VYZG|COLPIC_R00056_mrna 9114876 | 7 | 2436449 | 2436560 | Columbina picui 115618 | GCT|GTAAGAACCT...ATCTCCTGAGCT/AGGCCTTTCACA...CCCAG|GCG | 2 | 1 | 30.492 |
49389512 | GT-AG | 0 | 1.000000099473604e-05 | 228 | rna-gnl|WGS:VYZG|COLPIC_R00056_mrna 9114876 | 8 | 2436146 | 2436373 | Columbina picui 115618 | AAT|GTGAGTATTG...CCTGCCTTCCCC/GAGTCCCCCATC...TGTAG|CAA | 2 | 1 | 32.783 |
49389513 | GT-AG | 0 | 1.000000099473604e-05 | 284 | rna-gnl|WGS:VYZG|COLPIC_R00056_mrna 9114876 | 9 | 2435718 | 2436001 | Columbina picui 115618 | CCT|GTGAGTTCCC...ATGCCCTCATCC/GATGCCCTCATC...TGCAG|GAT | 2 | 1 | 37.183 |
49389514 | GT-AG | 0 | 0.0233790065333449 | 122 | rna-gnl|WGS:VYZG|COLPIC_R00056_mrna 9114876 | 10 | 2435452 | 2435573 | Columbina picui 115618 | ACT|GTATGTCTCA...GATGCCTTTGTG/TGGCTGCTCAAG...TGTAG|GAA | 2 | 1 | 41.583 |
49389515 | GT-AG | 0 | 0.0001052332313904 | 1103 | rna-gnl|WGS:VYZG|COLPIC_R00056_mrna 9114876 | 11 | 2434182 | 2435284 | Columbina picui 115618 | AAG|GTAACTCACG...GACTTTGTGACT/GACTTTGTGACT...TGCAG|GTC | 1 | 1 | 46.685 |
49389516 | GT-AG | 0 | 1.000000099473604e-05 | 1031 | rna-gnl|WGS:VYZG|COLPIC_R00056_mrna 9114876 | 12 | 2433018 | 2434048 | Columbina picui 115618 | ACT|GTGAGTGGTG...TGCTTCTCAGCC/GGTACTTTCATT...TGCAG|CTA | 2 | 1 | 50.749 |
49389517 | GT-AG | 0 | 1.000000099473604e-05 | 359 | rna-gnl|WGS:VYZG|COLPIC_R00056_mrna 9114876 | 13 | 2432590 | 2432948 | Columbina picui 115618 | TGT|GTAAGTAGGA...CTGGCTCCAATC/CTGGCTCCAATC...TCCAG|TGA | 2 | 1 | 52.857 |
49389518 | GT-AG | 0 | 1.000000099473604e-05 | 241 | rna-gnl|WGS:VYZG|COLPIC_R00056_mrna 9114876 | 14 | 2432277 | 2432517 | Columbina picui 115618 | CCT|GTAAGTGCGG...GTTGCCTTCGGT/CTGTGTTTCCTC...TGCAG|GAT | 2 | 1 | 55.057 |
49389519 | GT-AG | 0 | 1.000000099473604e-05 | 300 | rna-gnl|WGS:VYZG|COLPIC_R00056_mrna 9114876 | 15 | 2431905 | 2432204 | Columbina picui 115618 | GCT|GTAAGTACCT...CGCACCTTGGAG/TTGCCACGCACC...TGCAG|GTC | 2 | 1 | 57.256 |
49389520 | GT-AG | 0 | 1.000000099473604e-05 | 1485 | rna-gnl|WGS:VYZG|COLPIC_R00056_mrna 9114876 | 16 | 2430348 | 2431832 | Columbina picui 115618 | CCT|GTGAGTATCT...TTGCCCTTTCTC/TTCCTCTGCAGG...TGCAG|AGC | 2 | 1 | 59.456 |
49389521 | GT-AG | 0 | 1.000000099473604e-05 | 627 | rna-gnl|WGS:VYZG|COLPIC_R00056_mrna 9114876 | 17 | 2429544 | 2430170 | Columbina picui 115618 | CCT|GTGCTATTTG...CCTCCCTGCATA/CTGGGGGTCACT...CTCAG|TGC | 2 | 1 | 64.864 |
49389522 | GT-AG | 0 | 1.000000099473604e-05 | 794 | rna-gnl|WGS:VYZG|COLPIC_R00056_mrna 9114876 | 18 | 2428567 | 2429360 | Columbina picui 115618 | CAG|GTGAGTGCTG...CACCTCCTGACG/CACCTCCTGACG...GGCAG|GTG | 2 | 1 | 70.455 |
49389523 | GT-AG | 0 | 1.000000099473604e-05 | 366 | rna-gnl|WGS:VYZG|COLPIC_R00056_mrna 9114876 | 19 | 2428061 | 2428426 | Columbina picui 115618 | CAG|GTAGGGGCCT...CATCCCCTGGCA/TTGCTGTGCATC...TGCAG|GGG | 1 | 1 | 74.733 |
49389524 | GT-AG | 0 | 1.000000099473604e-05 | 161 | rna-gnl|WGS:VYZG|COLPIC_R00056_mrna 9114876 | 20 | 2427806 | 2427966 | Columbina picui 115618 | CAG|GTGAGAACCC...AGCCCCTCACCT/AAGCCCCTCACC...CCCAG|GTG | 2 | 1 | 77.605 |
49389525 | GT-AG | 0 | 1.000000099473604e-05 | 395 | rna-gnl|WGS:VYZG|COLPIC_R00056_mrna 9114876 | 21 | 2427273 | 2427667 | Columbina picui 115618 | CAG|GTACTACCAG...ACACCCTTGGCC/GTGCTGTCCACC...TGCAG|TGG | 2 | 1 | 81.821 |
49389526 | GT-AG | 0 | 1.000000099473604e-05 | 87 | rna-gnl|WGS:VYZG|COLPIC_R00056_mrna 9114876 | 22 | 2426948 | 2427034 | Columbina picui 115618 | CAG|GTGAGCGGTG...TCCTGCTTCTCC/CAGTGGTGCAGC...GCCAG|GTC | 0 | 1 | 89.093 |
49389527 | GT-AG | 0 | 0.0023278747424618 | 91 | rna-gnl|WGS:VYZG|COLPIC_R00056_mrna 9114876 | 23 | 2426726 | 2426816 | Columbina picui 115618 | CAG|GTACCTGCCA...CAGTCCTCAGTG/GCAGTCCTCAGT...ACCAG|TGC | 2 | 1 | 93.095 |
49389528 | GT-AG | 0 | 1.000000099473604e-05 | 65 | rna-gnl|WGS:VYZG|COLPIC_R00056_mrna 9114876 | 24 | 2426506 | 2426570 | Columbina picui 115618 | GAG|GTAGGAAAGG...CAGCCCTGTTCT/GGCATGCTGACA...CCCAG|GGA | 1 | 1 | 97.831 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);