introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
37 rows where transcript_id = 22607907
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 122607887 | GT-AG | 0 | 1.000000099473604e-05 | 3043 | rna-XM_021214354.1 22607907 | 1 | 52330152 | 52333194 | Mus pahari 10093 | GCT|GTGAGTGCCG...CTTCCCTTTTCG/ATGGCGTTTACA...TTTAG|CTG | 1 | 1 | 1.106 |
| 122607888 | GT-AG | 0 | 1.000000099473604e-05 | 29372 | rna-XM_021214354.1 22607907 | 2 | 52300680 | 52330051 | Mus pahari 10093 | GAG|GTACAGAGCG...CCGTCCTTTTCT/GGATTTCTGACC...TGCAG|GTG | 2 | 1 | 3.116 |
| 122607889 | GT-AG | 0 | 1.000000099473604e-05 | 2249 | rna-XM_021214354.1 22607907 | 3 | 52298274 | 52300522 | Mus pahari 10093 | AAG|GTGAGTCTTT...GTGTTCTTGTGC/CGCTTGCCAACA...TCCAG|GCT | 0 | 1 | 6.273 |
| 122607890 | GT-AG | 0 | 0.0002785226916999 | 1713 | rna-XM_021214354.1 22607907 | 4 | 52296483 | 52298195 | Mus pahari 10093 | AAG|GTAAGCTACG...ACTTCTTTGATT/ATTCTTCTAACC...CCTAG|ATT | 0 | 1 | 7.841 |
| 122607891 | GT-AG | 0 | 0.0010940970645529 | 2920 | rna-XM_021214354.1 22607907 | 5 | 52293486 | 52296405 | Mus pahari 10093 | CAG|GTAACCAGTG...TCACTCTTTGCA/CGAGGCCTCACT...TCTAG|TTT | 2 | 1 | 9.389 |
| 122607892 | GT-AG | 0 | 1.000000099473604e-05 | 1722 | rna-XM_021214354.1 22607907 | 6 | 52291696 | 52293417 | Mus pahari 10093 | CAG|GTAAGGAGTT...GTTGCTTGAGCA/CAGGTCTTCATC...TGCAG|AAG | 1 | 1 | 10.756 |
| 122607893 | GT-AG | 0 | 1.000000099473604e-05 | 767 | rna-XM_021214354.1 22607907 | 7 | 52290815 | 52291581 | Mus pahari 10093 | CGC|GTAAGAGGGG...TGTGTCTTGATT/TGTGTCTTGATT...TTTAG|TAC | 1 | 1 | 13.048 |
| 122607894 | GT-AG | 0 | 2.861847389849745e-05 | 1646 | rna-XM_021214354.1 22607907 | 8 | 52288990 | 52290635 | Mus pahari 10093 | AGG|GTACAAATGC...CTGTTTTTATTC/TTTTTATTCATC...CTCAG|ACG | 0 | 1 | 16.647 |
| 122607895 | GT-AG | 0 | 0.0002800723268052 | 628 | rna-XM_021214354.1 22607907 | 9 | 52288277 | 52288904 | Mus pahari 10093 | ACA|GTAAGTATCG...ATGCTTTTAACC/ATGCTTTTAACC...CTCAG|CAT | 1 | 1 | 18.355 |
| 122607896 | GT-AG | 0 | 1.000000099473604e-05 | 1197 | rna-XM_021214354.1 22607907 | 10 | 52286916 | 52288112 | Mus pahari 10093 | CAG|GTGAGCATGC...ATGTCCTTTGCT/AGCCATCTCATC...CCTAG|AGT | 0 | 1 | 21.653 |
| 122607897 | GT-AG | 0 | 1.000000099473604e-05 | 3754 | rna-XM_021214354.1 22607907 | 11 | 52283077 | 52286830 | Mus pahari 10093 | GAC|GTAAGAGTCC...GGGCTCTAAGCC/TGCATGCTGACC...TTCAG|GGT | 1 | 1 | 23.361 |
| 122607898 | GT-AG | 0 | 1.000000099473604e-05 | 181 | rna-XM_021214354.1 22607907 | 12 | 52282732 | 52282912 | Mus pahari 10093 | GAG|GTAAGTTAAC...GAGTCCTTCCTC/TCCTTCCTCACT...CACAG|CAT | 0 | 1 | 26.659 |
| 122607899 | GT-AG | 0 | 1.000000099473604e-05 | 443 | rna-XM_021214354.1 22607907 | 13 | 52282128 | 52282570 | Mus pahari 10093 | GAG|GTGGGGGCCC...CTTTTTTTATTC/TTTTTTTTCACC...GGCAG|GTA | 2 | 1 | 29.895 |
| 122607900 | GT-AG | 0 | 1.000000099473604e-05 | 2029 | rna-XM_021214354.1 22607907 | 14 | 52279974 | 52282002 | Mus pahari 10093 | AGC|GTGAGTAACC...GCCTCCTGAGTA/AGCCTCCTGAGT...CTTAG|GGA | 1 | 1 | 32.409 |
| 122607901 | GT-AG | 0 | 1.000000099473604e-05 | 4921 | rna-XM_021214354.1 22607907 | 15 | 52274889 | 52279809 | Mus pahari 10093 | CAG|GTGAGCCACC...CATACCCTGATG/CTGATGTTCACC...TGTAG|GAA | 0 | 1 | 35.706 |
| 122607902 | GT-AG | 0 | 0.00032795137646 | 157 | rna-XM_021214354.1 22607907 | 16 | 52274641 | 52274797 | Mus pahari 10093 | GGT|GTAAGTATCC...AGCTCCTAAACT/CCGTTGATAACT...GCCAG|TTG | 1 | 1 | 37.535 |
| 122607903 | GT-AG | 0 | 1.000000099473604e-05 | 607 | rna-XM_021214354.1 22607907 | 17 | 52273866 | 52274472 | Mus pahari 10093 | CAG|GTGGGTCATG...TTTCTGTTTTCT/CTCTGGGTTACT...GCCAG|GAG | 1 | 1 | 40.913 |
| 122607904 | GT-AG | 0 | 1.374173768206728e-05 | 3277 | rna-XM_021214354.1 22607907 | 18 | 52270446 | 52273722 | Mus pahari 10093 | CAG|GTAGGCCACC...TGTCCCTGAATG/TTGTCCCTGAAT...TTCAG|AGC | 0 | 1 | 43.788 |
| 122607905 | GT-AG | 0 | 0.111786124066319 | 911 | rna-XM_021214354.1 22607907 | 19 | 52269364 | 52270274 | Mus pahari 10093 | CAG|GTACCCCCAC...GATTCTTTGAAA/CTTTCACTGATT...TTTAG|TCA | 0 | 1 | 47.226 |
| 122607906 | GT-AG | 0 | 2.4373256858078764e-05 | 89 | rna-XM_021214354.1 22607907 | 20 | 52269191 | 52269279 | Mus pahari 10093 | AAG|GTATGGTAGC...ACCCTCTAAGTT/GCAAGCCTGACA...CACAG|ATT | 0 | 1 | 48.914 |
| 122607907 | GT-AG | 0 | 1.326783171727218e-05 | 91 | rna-XM_021214354.1 22607907 | 21 | 52269028 | 52269118 | Mus pahari 10093 | CAT|GTAAGTAGCC...TTCTTCTTTGTT/AAGTGAGTAAAG...CTTAG|ATA | 0 | 1 | 50.362 |
| 122607908 | GT-AG | 0 | 1.000000099473604e-05 | 349 | rna-XM_021214354.1 22607907 | 22 | 52268609 | 52268957 | Mus pahari 10093 | TCA|GTGAGTAGCC...TGATCCATATTT/TTCCTGGTAACT...GTCAG|TTA | 1 | 1 | 51.769 |
| 122607909 | GT-AG | 0 | 0.0002650472028474 | 1972 | rna-XM_021214354.1 22607907 | 23 | 52266428 | 52268399 | Mus pahari 10093 | CAG|GTATGCACGG...TTGTTTTTTATT/TTGTTTTTTATT...TCTAG|GAT | 0 | 1 | 55.971 |
| 122607910 | GC-AG | 0 | 1.000000099473604e-05 | 497 | rna-XM_021214354.1 22607907 | 24 | 52265775 | 52266271 | Mus pahari 10093 | CAG|GCAAGTGGCT...AAGTTTCTGATG/AAGTTTCTGATG...TTCAG|ACC | 0 | 1 | 59.107 |
| 122607911 | GT-AG | 0 | 2.8390972661958596e-05 | 1277 | rna-XM_021214354.1 22607907 | 25 | 52264334 | 52265610 | Mus pahari 10093 | CAA|GTATGAACGG...GTTAGCTTAAAT/TAGATTCTGAGG...CATAG|GTC | 2 | 1 | 62.405 |
| 122607912 | GT-AG | 0 | 1.000000099473604e-05 | 3828 | rna-XM_021214354.1 22607907 | 26 | 52260281 | 52264108 | Mus pahari 10093 | GAG|GTTGGTAACC...TTGGTCTAACAT/GTTGGTCTAACA...CTCAG|TAA | 2 | 1 | 66.928 |
| 122607913 | GT-AG | 0 | 1.000000099473604e-05 | 88 | rna-XM_021214354.1 22607907 | 27 | 52260052 | 52260139 | Mus pahari 10093 | CCC|GTGAGTACCG...GTGGCCTGACCT/TGTGGCCTGACC...CACAG|TTA | 2 | 1 | 69.763 |
| 122607914 | GT-AG | 0 | 1.000000099473604e-05 | 1045 | rna-XM_021214354.1 22607907 | 28 | 52258922 | 52259966 | Mus pahari 10093 | AAG|GTAAGAATCT...TGGTCTTTAAGG/CTGGTCTTTAAG...TTCAG|ATT | 0 | 1 | 71.472 |
| 122607915 | GT-AG | 0 | 1.000000099473604e-05 | 637 | rna-XM_021214354.1 22607907 | 29 | 52258052 | 52258688 | Mus pahari 10093 | CAG|GTAAGAGACA...GCTGCCCTGACA/CTGGTATTTACT...TTCAG|ACG | 2 | 1 | 76.156 |
| 122607916 | GT-AG | 0 | 1.000000099473604e-05 | 1461 | rna-XM_021214354.1 22607907 | 30 | 52256458 | 52257918 | Mus pahari 10093 | ACT|GTAAGTGCTT...AGAGGGTTGAAG/GGACCAATCACT...TGCAG|CTC | 0 | 1 | 78.83 |
| 122607917 | GT-AG | 0 | 0.0005686961646232 | 629 | rna-XM_021214354.1 22607907 | 31 | 52255726 | 52256354 | Mus pahari 10093 | TAG|GTAACATTCT...ATTGCATTGATT/ATTGCATTGATT...TGCAG|GAG | 1 | 1 | 80.901 |
| 122607918 | GT-AG | 0 | 1.000000099473604e-05 | 288 | rna-XM_021214354.1 22607907 | 32 | 52255302 | 52255589 | Mus pahari 10093 | GAA|GTGAGTACTG...GCCATGTTGAAA/CACGTTCCCATC...TACAG|TAC | 2 | 1 | 83.635 |
| 122607919 | GT-AG | 0 | 1.000000099473604e-05 | 1321 | rna-XM_021214354.1 22607907 | 33 | 52253893 | 52255213 | Mus pahari 10093 | CAG|GTACAAATGA...TGGTTTTTGTTG/GGTGAACTGACA...TGTAG|GAA | 0 | 1 | 85.404 |
| 122607920 | GT-AG | 0 | 1.000000099473604e-05 | 650 | rna-XM_021214354.1 22607907 | 34 | 52253030 | 52253679 | Mus pahari 10093 | AAG|GTACAGCACC...TTTTTCCTAATT/TTTTTCCTAATT...CTCAG|GAT | 0 | 1 | 89.686 |
| 122607921 | GT-AG | 0 | 1.000000099473604e-05 | 3895 | rna-XM_021214354.1 22607907 | 35 | 52248968 | 52252862 | Mus pahari 10093 | CAA|GTGAGTCTTG...TGGTTTTTATCT/GTGGTTTTTATC...CACAG|GGT | 2 | 1 | 93.044 |
| 122607922 | GT-AG | 0 | 1.000000099473604e-05 | 4663 | rna-XM_021214354.1 22607907 | 36 | 52244182 | 52248844 | Mus pahari 10093 | CCA|GTGAGTGCTT...GTCTCATTACCT/TCAAAGCTCACA...AACAG|ATT | 2 | 1 | 95.517 |
| 122607923 | GT-AG | 0 | 1.000000099473604e-05 | 3141 | rna-XM_021214354.1 22607907 | 37 | 52240932 | 52244072 | Mus pahari 10093 | CAG|GTGGGTGTGC...TTGTTCTGGATT/CTGGATTTGATG...AACAG|GAC | 0 | 1 | 97.708 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);