introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
24 rows where transcript_id = 9114892
This data as json, CSV (advanced)
Suggested facets: score, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
49389767 | GT-AG | 0 | 1.000000099473604e-05 | 471 | rna-gnl|WGS:VYZG|COLPIC_R00059_mrna 9114892 | 1 | 2245799 | 2246269 | Columbina picui 115618 | ATG|GTAAGACTTG...TTTTTCTTGTTT/TTTGCTCTCATG...TTCAG|ACA | 0 | 1 | 4.935 |
49389768 | GT-AG | 0 | 2.1394347962947547e-05 | 123 | rna-gnl|WGS:VYZG|COLPIC_R00059_mrna 9114892 | 2 | 2245619 | 2245741 | Columbina picui 115618 | AAG|GTAACAGGAA...TGCTTCTTAACC/TGCTTCTTAACC...GACAG|GTC | 0 | 1 | 7.692 |
49389769 | GT-AG | 0 | 0.0134953665743061 | 1001 | rna-gnl|WGS:VYZG|COLPIC_R00059_mrna 9114892 | 3 | 2244471 | 2245471 | Columbina picui 115618 | CAG|GTACCATGCT...GTTGCCTTACAT/TGTTGCCTTACA...TGCAG|ACA | 0 | 1 | 14.804 |
49389770 | GT-AG | 0 | 1.000000099473604e-05 | 1105 | rna-gnl|WGS:VYZG|COLPIC_R00059_mrna 9114892 | 4 | 2243258 | 2244362 | Columbina picui 115618 | AAG|GTGAGGCCCT...TTACTTTTAAAT/TTACTTTTAAAT...TTTAG|AAT | 0 | 1 | 20.029 |
49389771 | GT-AG | 0 | 1.000000099473604e-05 | 853 | rna-gnl|WGS:VYZG|COLPIC_R00059_mrna 9114892 | 5 | 2242312 | 2243164 | Columbina picui 115618 | ACG|GTAAAGAAAT...GAGATTTTAAAA/GAGATTTTAAAA...CTCAG|GTG | 0 | 1 | 24.528 |
49389772 | GT-AG | 0 | 1.000000099473604e-05 | 669 | rna-gnl|WGS:VYZG|COLPIC_R00059_mrna 9114892 | 6 | 2241568 | 2242236 | Columbina picui 115618 | CAG|GTAAGGGGCT...GCTTCCTGAAGC/AGTGTGATTATT...TCCAG|ATC | 0 | 1 | 28.157 |
49389773 | GT-AG | 0 | 2.3241355923495185e-05 | 5502 | rna-gnl|WGS:VYZG|COLPIC_R00059_mrna 9114892 | 7 | 2235979 | 2241480 | Columbina picui 115618 | GAG|GTAGGTATTC...TCTTCCTTCTTC/TCTTCTGTTACC...AGCAG|TCA | 0 | 1 | 32.366 |
49389774 | GT-AG | 0 | 1.000000099473604e-05 | 1033 | rna-gnl|WGS:VYZG|COLPIC_R00059_mrna 9114892 | 8 | 2234889 | 2235921 | Columbina picui 115618 | AAT|GTGAGTATTT...GATTTTGTAACA/GATTTTGTAACA...TTCAG|GGT | 0 | 1 | 35.123 |
49389775 | GT-AG | 0 | 1.000000099473604e-05 | 1594 | rna-gnl|WGS:VYZG|COLPIC_R00059_mrna 9114892 | 9 | 2233251 | 2234844 | Columbina picui 115618 | CAG|GTAAAACCTA...TCATCCTTATTC/CCATTTCTAACC...TTCAG|GCA | 2 | 1 | 37.252 |
49389776 | GT-AG | 0 | 1.000000099473604e-05 | 971 | rna-gnl|WGS:VYZG|COLPIC_R00059_mrna 9114892 | 10 | 2232219 | 2233189 | Columbina picui 115618 | ATG|GTAAGCAGCG...ATAGCTTTCATA/ATAGCTTTCATA...CATAG|GTG | 0 | 1 | 40.203 |
49389777 | GT-AG | 0 | 2.3572349633232116e-05 | 685 | rna-gnl|WGS:VYZG|COLPIC_R00059_mrna 9114892 | 11 | 2231487 | 2232171 | Columbina picui 115618 | CCA|GTAAGTGCAC...TAGCCCTTATCT/CCTTATCTAACT...CACAG|AAC | 2 | 1 | 42.477 |
49389778 | GT-AG | 0 | 1.000000099473604e-05 | 1880 | rna-gnl|WGS:VYZG|COLPIC_R00059_mrna 9114892 | 12 | 2229531 | 2231410 | Columbina picui 115618 | CAA|GTGAGTTTCA...AAGCTGTTGACC/AAGCTGTTGACC...GCTAG|AGC | 0 | 1 | 46.154 |
49389779 | GT-AG | 0 | 1.000000099473604e-05 | 4807 | rna-gnl|WGS:VYZG|COLPIC_R00059_mrna 9114892 | 13 | 2224630 | 2229436 | Columbina picui 115618 | ATG|GTAAGAAATA...TGTGTCTTATCT/ATGTGTCTTATC...TCTAG|TTA | 1 | 1 | 50.701 |
49389780 | GT-AG | 0 | 1.000000099473604e-05 | 4592 | rna-gnl|WGS:VYZG|COLPIC_R00059_mrna 9114892 | 14 | 2219936 | 2224527 | Columbina picui 115618 | AAG|GTAAATTCTG...TGTTTTTTCTTC/GTGTGGCTCATG...TGCAG|ATG | 1 | 1 | 55.636 |
49389781 | GT-AG | 0 | 6.1009138292110014e-05 | 1423 | rna-gnl|WGS:VYZG|COLPIC_R00059_mrna 9114892 | 15 | 2218454 | 2219876 | Columbina picui 115618 | GAG|GTAGGTTTCT...CTTTTCTTGCTC/GCTATTTGTATC...AATAG|GAT | 0 | 1 | 58.491 |
49389782 | GT-AG | 0 | 1.000000099473604e-05 | 1093 | rna-gnl|WGS:VYZG|COLPIC_R00059_mrna 9114892 | 16 | 2217295 | 2218387 | Columbina picui 115618 | ATG|GTGAGTCTTT...GACTTCTTTTCA/CTTCTTTTCATC...TGAAG|CCA | 0 | 1 | 61.684 |
49389783 | GT-AG | 0 | 1.000000099473604e-05 | 1520 | rna-gnl|WGS:VYZG|COLPIC_R00059_mrna 9114892 | 17 | 2215705 | 2217224 | Columbina picui 115618 | TCG|GTGAGTGTCC...CTTACTTTATCA/TCGATACTTACT...GTCAG|ATT | 1 | 1 | 65.07 |
49389784 | GT-AG | 0 | 1.000000099473604e-05 | 2866 | rna-gnl|WGS:VYZG|COLPIC_R00059_mrna 9114892 | 18 | 2212778 | 2215643 | Columbina picui 115618 | TGT|GTAAGTGTCT...TTTGCCATCTCT/ATGTAATTGATG...TCCAG|ACG | 2 | 1 | 68.021 |
49389785 | GT-AG | 0 | 0.0009002941487499 | 14649 | rna-gnl|WGS:VYZG|COLPIC_R00059_mrna 9114892 | 19 | 2198040 | 2212688 | Columbina picui 115618 | CAG|GTACTTTACA...TTGCTTTTAATT/TTTAATTTTATT...TCTAG|CTT | 1 | 1 | 72.327 |
49389786 | GT-AG | 0 | 1.000000099473604e-05 | 514 | rna-gnl|WGS:VYZG|COLPIC_R00059_mrna 9114892 | 20 | 2197443 | 2197956 | Columbina picui 115618 | ATG|GTGAGTACAG...ACTTCTTTACAC/CCTGTTGTAACT...TCTAG|GTT | 0 | 1 | 76.343 |
49389787 | GT-AG | 0 | 1.000000099473604e-05 | 2623 | rna-gnl|WGS:VYZG|COLPIC_R00059_mrna 9114892 | 21 | 2194693 | 2197315 | Columbina picui 115618 | TGG|GTGAGTACCT...GCACTGTTAACT/GCACTGTTAACT...TTCAG|TTC | 1 | 1 | 82.487 |
49389788 | GT-AG | 0 | 1.000000099473604e-05 | 34581 | rna-gnl|WGS:VYZG|COLPIC_R00059_mrna 9114892 | 22 | 2160036 | 2194616 | Columbina picui 115618 | TAG|GTAAGGAATA...CAAGCCTAAGCT/CGAGACCTGAAA...TTCAG|GGC | 2 | 1 | 86.164 |
49389789 | GT-AG | 0 | 1.000000099473604e-05 | 25658 | rna-gnl|WGS:VYZG|COLPIC_R00059_mrna 9114892 | 23 | 2134299 | 2159956 | Columbina picui 115618 | CAG|GTAAGCCAGG...CTGTCTTTGGCT/TGGATGCTGACT...GGCAG|GTT | 0 | 1 | 89.985 |
49389790 | GT-AG | 0 | 1.25235327471417e-05 | 5650 | rna-gnl|WGS:VYZG|COLPIC_R00059_mrna 9114892 | 24 | 2128532 | 2134181 | Columbina picui 115618 | CTG|GTAAGCCGCC...AATCCCTTGGCT/CCTTGGCTAAAA...GTCAG|GTT | 0 | 1 | 95.646 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);