introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
58 rows where transcript_id = 9848188
This data as json, CSV (advanced)
Suggested facets: score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 53881989 | GT-AG | 0 | 1.000000099473604e-05 | 2165 | rna-XM_019542502.1 9848188 | 2 | 201992442 | 201994606 | Crocodylus porosus 8502 | AAG|GTGAGAATAA...TTTGTATTACTA/TAGTTACTAATG...TTCAG|ATA | 0 | 1 | 8.605 |
| 53881990 | GT-AG | 0 | 2.035383091123925e-05 | 28613 | rna-XM_019542502.1 9848188 | 3 | 201994779 | 202023391 | Crocodylus porosus 8502 | ATA|GTAAGTCCGT...CTTTTTTTATCA/TCTTTTTTTATC...TATAG|CTG | 1 | 1 | 10.605 |
| 53881991 | GT-AG | 0 | 1.8702776394995764e-05 | 4437 | rna-XM_019542502.1 9848188 | 4 | 202023556 | 202027992 | Crocodylus porosus 8502 | ACT|GTAAGTAATA...TTTTCTATATTC/ATACATTTAATC...TATAG|ATT | 0 | 1 | 12.512 |
| 53881992 | GT-AG | 0 | 1.000000099473604e-05 | 1962 | rna-XM_019542502.1 9848188 | 5 | 202028080 | 202030041 | Crocodylus porosus 8502 | GAG|GTGAGGACTC...TGGGACTTAAAT/TTAAATATCATT...TTTAG|GTG | 0 | 1 | 13.523 |
| 53881993 | GT-AG | 0 | 1.000000099473604e-05 | 8049 | rna-XM_019542502.1 9848188 | 6 | 202030175 | 202038223 | Crocodylus porosus 8502 | CAG|GTAGGTGAAC...CATTTTTTAATT/CATTTTTTAATT...GCCAG|TGA | 1 | 1 | 15.07 |
| 53881994 | GT-AG | 0 | 1.000000099473604e-05 | 577 | rna-XM_019542502.1 9848188 | 7 | 202038376 | 202038952 | Crocodylus porosus 8502 | CAA|GTGAGTGACA...TGATTCTTATTT/GTGATTCTTATT...TGCAG|ACT | 0 | 1 | 16.837 |
| 53881995 | GT-AG | 0 | 1.000000099473604e-05 | 3932 | rna-XM_019542502.1 9848188 | 8 | 202039032 | 202042963 | Crocodylus porosus 8502 | CAG|GTCAGGAATT...TTTTTTTTAAAT/TTTTTTTTAAAT...CTTAG|GAA | 1 | 1 | 17.756 |
| 53881996 | GT-AG | 0 | 1.000000099473604e-05 | 2375 | rna-XM_019542502.1 9848188 | 9 | 202043040 | 202045414 | Crocodylus porosus 8502 | CAG|GTTAGTGGTT...TGGATTTGGATA/TCAAGACTAAAT...CCTAG|ATG | 2 | 1 | 18.64 |
| 53881997 | GT-AG | 0 | 0.0001488248783735 | 100 | rna-XM_019542502.1 9848188 | 10 | 202045492 | 202045591 | Crocodylus porosus 8502 | CAA|GTAAGCGTGA...TTCTCTTTATCT/TATATACTAATC...TCCAG|GGC | 1 | 1 | 19.535 |
| 53881998 | GT-AG | 0 | 2.1773775804904564e-05 | 181 | rna-XM_019542502.1 9848188 | 11 | 202045624 | 202045804 | Crocodylus porosus 8502 | ACT|GTAAAGACTT...TTTCTTTTATTC/TTTTCTTTTATT...AATAG|TGG | 0 | 1 | 19.907 |
| 53881999 | GT-AG | 0 | 1.000000099473604e-05 | 7012 | rna-XM_019542502.1 9848188 | 12 | 202045935 | 202052946 | Crocodylus porosus 8502 | ATG|GTAAAATACT...AATATTTTAATT/AATATTTTAATT...CTTAG|GTC | 1 | 1 | 21.419 |
| 53882000 | GT-AG | 0 | 1.5237379781562234e-05 | 1950 | rna-XM_019542502.1 9848188 | 13 | 202053051 | 202055000 | Crocodylus porosus 8502 | ATG|GTATGAAATT...ATTTTCTTGCTT/GTATGATTGATT...GTTAG|GAC | 0 | 1 | 22.628 |
| 53882001 | GT-AG | 0 | 1.000000099473604e-05 | 897 | rna-XM_019542502.1 9848188 | 14 | 202055175 | 202056071 | Crocodylus porosus 8502 | AAG|GTAAGATGGG...TGTTTCTTGGTT/GTGCTATTTACA...TAAAG|GGT | 0 | 1 | 24.651 |
| 53882002 | GT-AG | 0 | 3.2686693444641605e-05 | 1561 | rna-XM_019542502.1 9848188 | 15 | 202056285 | 202057845 | Crocodylus porosus 8502 | ACA|GTAAGCAGTT...TTGAGCTTATAT/GTTGAGCTTATA...CTTAG|GGC | 0 | 1 | 27.128 |
| 53882003 | GT-AG | 0 | 0.0489685420686496 | 3451 | rna-XM_019542502.1 9848188 | 16 | 202058024 | 202061474 | Crocodylus porosus 8502 | AAT|GTATGCAGCT...CATACTTTAAAT/CATACTTTAAAT...TTTAG|TTG | 1 | 1 | 29.198 |
| 53882004 | GT-AG | 0 | 0.0003721054838272 | 12585 | rna-XM_019542502.1 9848188 | 17 | 202061607 | 202074191 | Crocodylus porosus 8502 | CAA|GTAAGTTTTA...TGGATTTTGACC/TGGATTTTGACC...CACAG|AGG | 1 | 1 | 30.733 |
| 53882005 | GT-AG | 0 | 2.353180481328383e-05 | 822 | rna-XM_019542502.1 9848188 | 18 | 202074395 | 202075216 | Crocodylus porosus 8502 | CAG|GTAGATAATA...TCATTCTTAGTT/TTATGCTTCATT...ATCAG|GTT | 0 | 1 | 33.093 |
| 53882006 | GT-AG | 0 | 1.000000099473604e-05 | 2990 | rna-XM_019542502.1 9848188 | 19 | 202075304 | 202078293 | Crocodylus porosus 8502 | AAG|GTAAGAATTT...TGGTTATTACTT/ATGGTTATTACT...CCTAG|GTG | 0 | 1 | 34.105 |
| 53882007 | GT-AG | 0 | 0.0006093508376957 | 208 | rna-XM_019542502.1 9848188 | 20 | 202078427 | 202078634 | Crocodylus porosus 8502 | CAG|GTATGTACTT...GTTTTCTTCTCT/TGTGGACTCAGT...GGAAG|TTA | 1 | 1 | 35.651 |
| 53882008 | GT-AG | 0 | 1.000000099473604e-05 | 3179 | rna-XM_019542502.1 9848188 | 21 | 202078787 | 202081965 | Crocodylus porosus 8502 | ATG|GTAAAGTTGA...TGGCTCTTGTCT/GACTTTCTCAGT...TCTAG|GTT | 0 | 1 | 37.419 |
| 53882009 | GT-AG | 0 | 1.3767208148149128e-05 | 4764 | rna-XM_019542502.1 9848188 | 22 | 202082121 | 202086884 | Crocodylus porosus 8502 | GAG|GTAGAATTTT...TCTTTCTTGTTT/TGATGCCTGACT...CTTAG|GTG | 2 | 1 | 39.221 |
| 53882010 | GT-AG | 0 | 1.000000099473604e-05 | 302 | rna-XM_019542502.1 9848188 | 23 | 202086944 | 202087245 | Crocodylus porosus 8502 | AAG|GTAAGGGCTA...CTTTTATTAACT/CTTTTATTAACT...GGCAG|AAT | 1 | 1 | 39.907 |
| 53882011 | GT-AG | 0 | 1.000000099473604e-05 | 3343 | rna-XM_019542502.1 9848188 | 24 | 202087291 | 202090633 | Crocodylus porosus 8502 | ATG|GTAAGTCCTA...TAATTTTTACTT/AGTGTTTTCATT...TGCAG|AAG | 1 | 1 | 40.43 |
| 53882012 | GT-AG | 0 | 1.000000099473604e-05 | 19292 | rna-XM_019542502.1 9848188 | 25 | 202090798 | 202110089 | Crocodylus porosus 8502 | AAG|GTACAAGGCT...CTATTTTTACTA/TCTATTTTTACT...TAAAG|GTT | 0 | 1 | 42.337 |
| 53882013 | GT-AG | 0 | 0.0038455263045769 | 8554 | rna-XM_019542502.1 9848188 | 26 | 202110319 | 202118872 | Crocodylus porosus 8502 | CAT|GTATGTGTTC...TTCCTCTTTTTT/CAAATAATCAAA...TTTAG|TGG | 1 | 1 | 45.0 |
| 53882014 | GT-AG | 0 | 1.000000099473604e-05 | 892 | rna-XM_019542502.1 9848188 | 27 | 202119028 | 202119919 | Crocodylus porosus 8502 | CAG|GTAAAGAAAA...TCAGTCTTATTT/ATCAGTCTTATT...TACAG|GTG | 0 | 1 | 46.802 |
| 53882015 | GT-AG | 0 | 1.000000099473604e-05 | 8030 | rna-XM_019542502.1 9848188 | 28 | 202119999 | 202128028 | Crocodylus porosus 8502 | CAG|GTAAGAGTTA...CCCCCCTCAAAC/CTTGTTGTCATT...CACAG|GCA | 1 | 1 | 47.721 |
| 53882016 | GT-AG | 0 | 0.0036347608740094 | 2007 | rna-XM_019542502.1 9848188 | 29 | 202128111 | 202130117 | Crocodylus porosus 8502 | TAG|GTATGCCATT...CTCTCCTTCCCC/ACCATCCTAACT...CAAAG|ATG | 2 | 1 | 48.674 |
| 53882017 | GT-AG | 0 | 0.0002494575149237 | 161 | rna-XM_019542502.1 9848188 | 30 | 202130174 | 202130334 | Crocodylus porosus 8502 | AAA|GTAGGTATTC...AAAACTTTAATT/TCTCTCTTCATT...TTCAG|AAA | 1 | 1 | 49.326 |
| 53882018 | GT-AG | 0 | 1.000000099473604e-05 | 8219 | rna-XM_019542502.1 9848188 | 31 | 202130368 | 202138586 | Crocodylus porosus 8502 | GAG|GTGAGAAACC...ATATTTTTACTT/GATATTTTTACT...TTTAG|GAC | 1 | 1 | 49.709 |
| 53882019 | GT-AG | 0 | 1.000000099473604e-05 | 7588 | rna-XM_019542502.1 9848188 | 32 | 202138757 | 202146344 | Crocodylus porosus 8502 | CTA|GTAAGTCATA...ACAGTTTTGATA/CTCTTCTTCATG...TGTAG|GTT | 0 | 1 | 51.686 |
| 53882020 | GT-AG | 0 | 1.000000099473604e-05 | 2174 | rna-XM_019542502.1 9848188 | 33 | 202146565 | 202148738 | Crocodylus porosus 8502 | CAG|GTAAAAATAG...TTTCTCTTTTTT/TTTTATTTCAAT...TATAG|TCC | 1 | 1 | 54.244 |
| 53882021 | GT-AG | 0 | 1.000000099473604e-05 | 3226 | rna-XM_019542502.1 9848188 | 34 | 202148894 | 202152119 | Crocodylus porosus 8502 | AAG|GTAAAAAGTG...GTTGCTTTACTG/TGTTGCTTTACT...TGTAG|GTG | 0 | 1 | 56.047 |
| 53882022 | GT-AG | 0 | 0.0003692797807014 | 1550 | rna-XM_019542502.1 9848188 | 35 | 202152334 | 202153883 | Crocodylus porosus 8502 | TTG|GTAGGTTCTT...TTCTCTTTGATT/GCTGTTTTCATT...TAAAG|GTA | 1 | 1 | 58.535 |
| 53882023 | GT-AG | 0 | 1.000000099473604e-05 | 5227 | rna-XM_019542502.1 9848188 | 36 | 202154075 | 202159301 | Crocodylus porosus 8502 | AAA|GTAGTTCAAA...ATGTATTTGACT/ATGTATTTGACT...CATAG|GAT | 0 | 1 | 60.756 |
| 53882024 | GT-AG | 0 | 0.0001193694728655 | 16492 | rna-XM_019542502.1 9848188 | 37 | 202159513 | 202176004 | Crocodylus porosus 8502 | CTG|GTAATCTAAG...ATTTCTTTCATG/ATTTCTTTCATG...TTCAG|TTG | 1 | 1 | 63.209 |
| 53882025 | GT-AG | 0 | 1.000000099473604e-05 | 1213 | rna-XM_019542502.1 9848188 | 38 | 202176157 | 202177369 | Crocodylus porosus 8502 | CAG|GTAAAACAAC...ATTATTTTCATT/ATTATTTTCATT...CACAG|GTG | 0 | 1 | 64.977 |
| 53882026 | GT-AG | 0 | 0.0008570351158092 | 3907 | rna-XM_019542502.1 9848188 | 39 | 202177449 | 202181355 | Crocodylus porosus 8502 | AAG|GTATAATACA...ATTGTTTTAAAT/ATTGTTTTAAAT...ATCAG|ATG | 1 | 1 | 65.895 |
| 53882027 | GT-AG | 0 | 1.000000099473604e-05 | 144 | rna-XM_019542502.1 9848188 | 40 | 202181488 | 202181631 | Crocodylus porosus 8502 | AAG|GTAATACCAG...GTATTCTGAATA/TTTTGGTTAACT...TGTAG|AAA | 1 | 1 | 67.43 |
| 53882028 | GT-AG | 0 | 1.000000099473604e-05 | 1475 | rna-XM_019542502.1 9848188 | 41 | 202181814 | 202183288 | Crocodylus porosus 8502 | TTG|GTTGGTTTCT...TATTGCATAATG/TTCAAGTTCACA...AACAG|GTG | 0 | 1 | 69.547 |
| 53882029 | GT-AG | 0 | 0.0003863375235208 | 6930 | rna-XM_019542502.1 9848188 | 42 | 202183376 | 202190305 | Crocodylus porosus 8502 | AAG|GTATTTCACA...TTTTCTTTTTCT/ACATTTCTCACA...TGCAG|CTT | 0 | 1 | 70.558 |
| 53882030 | GT-AG | 0 | 0.0026249285721382 | 3271 | rna-XM_019542502.1 9848188 | 43 | 202190439 | 202193709 | Crocodylus porosus 8502 | GAG|GTATGCCACT...CATTCATTGATT/AATGTTTTCATT...GGCAG|AAG | 1 | 1 | 72.105 |
| 53882031 | GT-AG | 0 | 1.000000099473604e-05 | 15572 | rna-XM_019542502.1 9848188 | 44 | 202193862 | 202209433 | Crocodylus porosus 8502 | CAG|GTTAGTATGG...GAAATTTTATCC/GTGTCATTCATT...CTTAG|ATT | 0 | 1 | 73.872 |
| 53882032 | GT-AG | 0 | 1.000000099473604e-05 | 4699 | rna-XM_019542502.1 9848188 | 45 | 202209687 | 202214385 | Crocodylus porosus 8502 | GGG|GTGAGATGCC...AGATTATTAATT/TATTAATTAATT...TCTAG|GGA | 1 | 1 | 76.814 |
| 53882033 | GT-AG | 0 | 1.000000099473604e-05 | 3440 | rna-XM_019542502.1 9848188 | 46 | 202214628 | 202218067 | Crocodylus porosus 8502 | GAG|GTAAAGAACT...TGGCTTTTATTC/GTGGCTTTTATT...TGTAG|GTA | 0 | 1 | 79.628 |
| 53882034 | GT-AG | 0 | 0.0001861705423836 | 2742 | rna-XM_019542502.1 9848188 | 47 | 202218195 | 202220936 | Crocodylus porosus 8502 | CAG|GTTTGTCATG...TATGCCTTAACT/CAATGCCTAATT...AACAG|TAA | 1 | 1 | 81.105 |
| 53882035 | GT-AG | 0 | 1.710867651601541e-05 | 14427 | rna-XM_019542502.1 9848188 | 48 | 202221086 | 202235512 | Crocodylus porosus 8502 | AAG|GTAGGTTATC...TTCATTTTGATC/TTCATTTTGATC...TTCAG|GTG | 0 | 1 | 82.837 |
| 53882036 | GT-AG | 0 | 1.000000099473604e-05 | 1357 | rna-XM_019542502.1 9848188 | 49 | 202235721 | 202237077 | Crocodylus porosus 8502 | CAG|GTCAAAATTA...TTCTCTTCAATC/TTTCTCTTCAAT...ATTAG|AAA | 1 | 1 | 85.256 |
| 53882037 | GT-AG | 0 | 0.0156535355918017 | 322 | rna-XM_019542502.1 9848188 | 50 | 202237138 | 202237459 | Crocodylus porosus 8502 | CAG|GTAGCTCTTA...CATATTTTAATC/CATATTTTAATC...TTCAG|AAG | 1 | 1 | 85.953 |
| 53882038 | GT-AG | 0 | 1.000000099473604e-05 | 614 | rna-XM_019542502.1 9848188 | 51 | 202237639 | 202238252 | Crocodylus porosus 8502 | GAT|GTAAAATACT...TGATGCTGAATG/ATGATGCTGAAT...CCTAG|GTG | 0 | 1 | 88.035 |
| 53882039 | GT-AG | 0 | 1.000000099473604e-05 | 5225 | rna-XM_019542502.1 9848188 | 52 | 202238476 | 202243700 | Crocodylus porosus 8502 | CTG|GTAGGTATAA...CCTGTTTTATTG/GCCTGTTTTATT...TGCAG|TGG | 1 | 1 | 90.628 |
| 53882040 | GT-AG | 0 | 0.0035707535735125 | 112 | rna-XM_019542502.1 9848188 | 53 | 202243856 | 202243967 | Crocodylus porosus 8502 | GAG|GTATTTAATC...TTTATCTTAAAA/AATGTATTTATC...TACAG|TCC | 0 | 1 | 92.43 |
| 53882041 | GT-AG | 0 | 2.6068252868401943e-05 | 1604 | rna-XM_019542502.1 9848188 | 54 | 202244044 | 202245647 | Crocodylus porosus 8502 | CAG|GTAAATTACT...ATTATTTTAATT/ATTATTTTAATT...TGTAG|ACT | 1 | 1 | 93.314 |
| 53882042 | GT-AG | 0 | 0.0003222199899576 | 209 | rna-XM_019542502.1 9848188 | 55 | 202245727 | 202245935 | Crocodylus porosus 8502 | TGA|GTAAGTTTCA...TGTTTTCTAACA/TGTTTTCTAACA...TCCAG|AAT | 2 | 1 | 94.233 |
| 53882043 | GT-AG | 0 | 1.000000099473604e-05 | 1611 | rna-XM_019542502.1 9848188 | 56 | 202246222 | 202247832 | Crocodylus porosus 8502 | AAG|GTAAGTGCTG...CTTTTTTTAATT/CTTTTTTTAATT...TAAAG|GTT | 0 | 1 | 97.558 |
| 53882044 | GT-AG | 0 | 0.0004317126537374 | 8982 | rna-XM_019542502.1 9848188 | 57 | 202247912 | 202256893 | Crocodylus porosus 8502 | TTG|GTATGTATCC...GGTGTATTAAAT/CGTGTTTTCATT...TACAG|GAA | 1 | 1 | 98.477 |
| 53882045 | GT-AG | 0 | 1.000000099473604e-05 | 4694 | rna-XM_019542502.1 9848188 | 58 | 202256970 | 202261663 | Crocodylus porosus 8502 | CAG|GTCTGTATTC...GGGTTTTTATGT/TGGGTTTTTATG...TCAAG|GTG | 2 | 1 | 99.36 |
| 53894266 | GT-AG | 0 | 5.1775590210334285e-05 | 57446 | rna-XM_019542502.1 9848188 | 1 | 201934348 | 201991793 | Crocodylus porosus 8502 | GCA|GTAAGCAAGA...AAAATTTTATAT/AAAAATTTTATA...TTCAG|GTT | 0 | 1.209 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);