introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
21 rows where transcript_id = 9114825
This data as json, CSV (advanced)
Suggested facets: score, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
49389070 | GT-AG | 0 | 1.000000099473604e-05 | 3318 | rna-gnl|WGS:VYZG|COLPIC_R05513_mrna 9114825 | 1 | 2301003 | 2304320 | Columbina picui 115618 | CAA|GTGAGTACAA...TCTTTTTTATTT/CTCTTTTTTATT...ACCAG|GAA | 2 | 1 | 3.504 |
49389071 | GT-AG | 0 | 1.000000099473604e-05 | 3949 | rna-gnl|WGS:VYZG|COLPIC_R05513_mrna 9114825 | 2 | 2297023 | 2300971 | Columbina picui 115618 | AAT|GTAAGTACAT...TTTTTATTAACT/TTTTTATTAACT...CACAG|GTA | 0 | 1 | 4.519 |
49389072 | GT-AG | 0 | 3.2474387592645635 | 4369 | rna-gnl|WGS:VYZG|COLPIC_R05513_mrna 9114825 | 3 | 2292543 | 2296911 | Columbina picui 115618 | AAG|GTATCTTTTT...GGCTTCTTTATG/GGCTTCTTTATG...TGCAG|GAA | 0 | 1 | 8.153 |
49389073 | GT-AG | 0 | 0.0004532112142486 | 5342 | rna-gnl|WGS:VYZG|COLPIC_R05513_mrna 9114825 | 4 | 2287001 | 2292342 | Columbina picui 115618 | TAA|GTACTGCTTG...GATTTTTTACTC/TCCTTCCTCACT...GGGAG|AGG | 2 | 1 | 14.702 |
49389074 | GT-AG | 0 | 1.000000099473604e-05 | 13680 | rna-gnl|WGS:VYZG|COLPIC_R05513_mrna 9114825 | 5 | 2273075 | 2286754 | Columbina picui 115618 | CAG|GTAAGACCAG...ATGTTTTTATCA/AATGTTTTTATC...TGTAG|ATT | 2 | 1 | 22.757 |
49389075 | GT-AG | 0 | 0.0003399353147025 | 4806 | rna-gnl|WGS:VYZG|COLPIC_R05513_mrna 9114825 | 6 | 2268233 | 2273038 | Columbina picui 115618 | AGA|GTAAGTTGTC...TACTTCTTATTC/TTACTTCTTATT...TTTAG|GAG | 2 | 1 | 23.936 |
49389076 | GT-AG | 0 | 1.000000099473604e-05 | 2177 | rna-gnl|WGS:VYZG|COLPIC_R05513_mrna 9114825 | 7 | 2265929 | 2268105 | Columbina picui 115618 | GAT|GTGAGTATGA...CAGTTTTTAGCT/CCAGTTTTTAGC...CACAG|GAC | 0 | 1 | 28.094 |
49389077 | GT-AG | 0 | 0.1026978485038782 | 195 | rna-gnl|WGS:VYZG|COLPIC_R05513_mrna 9114825 | 8 | 2265579 | 2265773 | Columbina picui 115618 | GAA|GTATGTTGTT...ATTATTTTAATG/ATTATTTTAATG...TGCAG|AGA | 2 | 1 | 33.17 |
49389078 | GT-AG | 0 | 1.000000099473604e-05 | 3435 | rna-gnl|WGS:VYZG|COLPIC_R05513_mrna 9114825 | 9 | 2261741 | 2265175 | Columbina picui 115618 | AAG|GTAGGATACA...ATAGCCTTAGCA/ATTTATTTAACT...TGCAG|GTA | 0 | 1 | 46.365 |
49389079 | GT-AG | 0 | 1.000000099473604e-05 | 5894 | rna-gnl|WGS:VYZG|COLPIC_R05513_mrna 9114825 | 10 | 2255625 | 2261518 | Columbina picui 115618 | ACA|GTAAGACCAA...GCTTCCTCACCT/GGCTTCCTCACC...TGTAG|GCA | 0 | 1 | 53.635 |
49389080 | GT-AG | 0 | 1.000000099473604e-05 | 6185 | rna-gnl|WGS:VYZG|COLPIC_R05513_mrna 9114825 | 11 | 2249355 | 2255539 | Columbina picui 115618 | GCA|GTGCTGAGAG...AAATTCTTAACC/AAATTCTTAACC...ATTAG|GTT | 1 | 1 | 56.418 |
49389081 | GT-AG | 0 | 0.0009245936821595 | 658 | rna-gnl|WGS:VYZG|COLPIC_R05513_mrna 9114825 | 12 | 2248524 | 2249181 | Columbina picui 115618 | GTG|GTATGTACAG...TGATTTTTGAAT/TGATTTTTGAAT...AACAG|ATG | 0 | 1 | 62.083 |
49389082 | GT-AG | 0 | 1.000000099473604e-05 | 3091 | rna-gnl|WGS:VYZG|COLPIC_R05513_mrna 9114825 | 13 | 2245274 | 2248364 | Columbina picui 115618 | TGT|GTGATGTTCA...GCAGCTTTCTCT/TCTCTGGTCACC...AACAG|CTT | 0 | 1 | 67.289 |
49389083 | GT-AG | 0 | 1.000000099473604e-05 | 888 | rna-gnl|WGS:VYZG|COLPIC_R05513_mrna 9114825 | 14 | 2244281 | 2245168 | Columbina picui 115618 | CAG|GTAAGGAAGA...TTACTCCTAATA/TTACTCCTAATA...TACAG|GAG | 0 | 1 | 70.727 |
49389084 | GT-AG | 0 | 1.000000099473604e-05 | 1028 | rna-gnl|WGS:VYZG|COLPIC_R05513_mrna 9114825 | 15 | 2243160 | 2244187 | Columbina picui 115618 | GAG|GTAAAAGGCT...GTATTTTTAAGT/TCTCTTCTGATC...TTCAG|GAA | 0 | 1 | 73.772 |
49389085 | GT-AG | 0 | 1.000000099473604e-05 | 1373 | rna-gnl|WGS:VYZG|COLPIC_R05513_mrna 9114825 | 16 | 2241682 | 2243054 | Columbina picui 115618 | GTG|GTGAGTAGGA...TCGTTTTTCCCT/GCCAGATTCAAC...TCTAG|AAC | 0 | 1 | 77.21 |
49389086 | GT-AG | 0 | 0.0002928225948266 | 762 | rna-gnl|WGS:VYZG|COLPIC_R05513_mrna 9114825 | 17 | 2240783 | 2241544 | Columbina picui 115618 | GGG|GTATGGATCA...TTTTCCCTGACT/CTTGTTTTTATT...AGCAG|TGG | 2 | 1 | 81.696 |
49389087 | GT-AG | 0 | 1.000000099473604e-05 | 1440 | rna-gnl|WGS:VYZG|COLPIC_R05513_mrna 9114825 | 18 | 2239221 | 2240660 | Columbina picui 115618 | AGG|GTGAGGGTCA...AACATTTTATCA/CATTTTATCACC...TGTAG|ATG | 1 | 1 | 85.691 |
49389088 | GT-AG | 0 | 1.000000099473604e-05 | 1357 | rna-gnl|WGS:VYZG|COLPIC_R05513_mrna 9114825 | 19 | 2237740 | 2239096 | Columbina picui 115618 | GAG|GTGAGAACGT...TGTTCTTTAGCT/GCTGTATTAATG...TTTAG|GGC | 2 | 1 | 89.751 |
49389089 | GT-AG | 0 | 1.000000099473604e-05 | 2788 | rna-gnl|WGS:VYZG|COLPIC_R05513_mrna 9114825 | 20 | 2234874 | 2237661 | Columbina picui 115618 | CAG|GTACAAAAGG...GCTTTCTTGAAG/CTTGAAGTCATG...CCTAG|GTC | 2 | 1 | 92.305 |
49389090 | GT-AG | 0 | 0.0080660286033051 | 905 | rna-gnl|WGS:VYZG|COLPIC_R05513_mrna 9114825 | 21 | 2233860 | 2234764 | Columbina picui 115618 | AGG|GTATGTCTCT...TCTCCCTTACTT/GTCTCCCTTACT...TACAG|TGT | 0 | 1 | 95.874 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);