introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
62 rows where transcript_id = 3555597
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 17673770 | GT-AG | 0 | 1.000000099473604e-05 | 24902 | rna-XM_038321673.2 3555597 | 1 | 120149471 | 120174372 | Arvicola amphibius 1047088 | CAG|GTGGGTGACC...TTTTTCTTCTCT/CTAATATTAATT...CCCAG|AAC | 2 | 1 | 0.81 |
| 17673771 | GT-AG | 0 | 2.324064995455014e-05 | 687 | rna-XM_038321673.2 3555597 | 2 | 120148620 | 120149306 | Arvicola amphibius 1047088 | AGG|GTATAAATGC...AAACATTTACTT/TAAACATTTACT...TTCAG|GTG | 1 | 1 | 2.304 |
| 17673772 | GT-AG | 0 | 1.000000099473604e-05 | 3129 | rna-XM_038321673.2 3555597 | 3 | 120145335 | 120148463 | Arvicola amphibius 1047088 | AGG|GTAAGACTTG...ATATTTTTAGTC/ATATGCTTCATA...CACAG|AGC | 1 | 1 | 3.724 |
| 17673773 | GT-AG | 0 | 1.000000099473604e-05 | 1246 | rna-XM_038321673.2 3555597 | 4 | 120143952 | 120145197 | Arvicola amphibius 1047088 | CTG|GTGAGTAAAC...CTGCTTTTAAAT/GTTTTTATAATT...TTAAG|GTA | 0 | 1 | 4.971 |
| 17673774 | GT-AG | 0 | 1.000000099473604e-05 | 992 | rna-XM_038321673.2 3555597 | 5 | 120142901 | 120143892 | Arvicola amphibius 1047088 | AAG|GTAAGTAACT...TTTGCATTAAAA/TTTGTTTGCATT...CACAG|TAG | 2 | 1 | 5.509 |
| 17673775 | GT-AG | 0 | 1.000000099473604e-05 | 3299 | rna-XM_038321673.2 3555597 | 6 | 120139388 | 120142686 | Arvicola amphibius 1047088 | CAG|GTGAGAGCCA...TTCTCCCTACAT/AGGATTATAACT...TTCAG|CTC | 0 | 1 | 7.457 |
| 17673776 | GT-AG | 0 | 1.000000099473604e-05 | 392 | rna-XM_038321673.2 3555597 | 7 | 120138870 | 120139261 | Arvicola amphibius 1047088 | AGG|GTGAGTTTCT...GAAACCTTAAAA/ACTGGCCTCATC...TCAAG|GAT | 0 | 1 | 8.604 |
| 17673777 | GT-AG | 0 | 1.000000099473604e-05 | 1043 | rna-XM_038321673.2 3555597 | 8 | 120137754 | 120138796 | Arvicola amphibius 1047088 | CTG|GTAAGTCCTA...TTTTTTTTACCT/TTTTTTTTTACC...TCAAG|GGT | 1 | 1 | 9.269 |
| 17673778 | GT-AG | 0 | 1.000000099473604e-05 | 1597 | rna-XM_038321673.2 3555597 | 9 | 120136059 | 120137655 | Arvicola amphibius 1047088 | GAG|GTAAGTCGTA...AATCCTTCAGTG/TGTATATTTACA...TCTAG|GAC | 0 | 1 | 10.161 |
| 17673779 | GT-AG | 0 | 1.000000099473604e-05 | 2446 | rna-XM_038321673.2 3555597 | 10 | 120133439 | 120135884 | Arvicola amphibius 1047088 | GAT|GTAAGTGTCT...TTGTGTTTAAAA/TTGTGTTTAAAA...TGCAG|GTT | 0 | 1 | 11.745 |
| 17673780 | GT-AG | 0 | 0.0001075714271183 | 896 | rna-XM_038321673.2 3555597 | 11 | 120132318 | 120133213 | Arvicola amphibius 1047088 | TTG|GTATGTGAAC...CTTATTTTAAAA/CTTTTTGTAACT...TTTAG|ATT | 0 | 1 | 13.794 |
| 17673781 | GT-AG | 0 | 1.000000099473604e-05 | 658 | rna-XM_038321673.2 3555597 | 12 | 120131558 | 120132215 | Arvicola amphibius 1047088 | GAG|GTAAGTAATA...TGATTTTTAACC/TGATTTTTAACC...TGTAG|GTT | 0 | 1 | 14.723 |
| 17673782 | GT-AG | 0 | 1.000000099473604e-05 | 2334 | rna-XM_038321673.2 3555597 | 13 | 120128954 | 120131287 | Arvicola amphibius 1047088 | GGG|GTGAGTCTTT...TATACTTTGTCT/CACAAATATACT...GGCAG|ATG | 0 | 1 | 17.181 |
| 17673783 | GT-AG | 0 | 1.0307174532658682e-05 | 109 | rna-XM_038321673.2 3555597 | 14 | 120128714 | 120128822 | Arvicola amphibius 1047088 | CAG|GTACTGTACT...GTTGTCTTCATT/GTTGTCTTCATT...TTCAG|GCA | 2 | 1 | 18.374 |
| 17673784 | GT-AG | 0 | 2.396286049821079e-05 | 568 | rna-XM_038321673.2 3555597 | 15 | 120127984 | 120128551 | Arvicola amphibius 1047088 | CAG|GTACATAAAT...TTTTTTTTAAAA/TTTTTTTTTAAA...TTCAG|GAA | 2 | 1 | 19.849 |
| 17673785 | GT-AG | 0 | 1.000000099473604e-05 | 406 | rna-XM_038321673.2 3555597 | 16 | 120127459 | 120127864 | Arvicola amphibius 1047088 | ACA|GTAAGAGACT...TTTTCCTTTTTA/TTCCTTTTTAAT...AATAG|CTC | 1 | 1 | 20.932 |
| 17673786 | GT-AG | 0 | 4.434443679143405e-05 | 85 | rna-XM_038321673.2 3555597 | 17 | 120127286 | 120127370 | Arvicola amphibius 1047088 | GAG|GTATGGACTA...CCTGCTTTATCT/TTATCTGTTATT...CATAG|GTG | 2 | 1 | 21.734 |
| 17673787 | GT-AG | 0 | 0.0001221591945624 | 1630 | rna-XM_038321673.2 3555597 | 18 | 120125554 | 120127183 | Arvicola amphibius 1047088 | AAG|GTATAACTAT...TCTTTCATGACG/ACATCTTTCATG...AACAG|CAA | 2 | 1 | 22.662 |
| 17673788 | GT-AG | 0 | 1.000000099473604e-05 | 506 | rna-XM_038321673.2 3555597 | 19 | 120124907 | 120125412 | Arvicola amphibius 1047088 | AGG|GTAAGTAAGG...TTGTCATTGTCT/GAAGTTGTCATT...CCCAG|GAA | 2 | 1 | 23.946 |
| 17673789 | GT-AG | 0 | 0.0019454019373042 | 374 | rna-XM_038321673.2 3555597 | 20 | 120124333 | 120124706 | Arvicola amphibius 1047088 | AAG|GTAACCAGCA...TTTTCTCTGATT/TTTTCTCTGATT...TTTAG|GTA | 1 | 1 | 25.767 |
| 17673790 | GT-AG | 0 | 1.000000099473604e-05 | 159 | rna-XM_038321673.2 3555597 | 21 | 120123977 | 120124135 | Arvicola amphibius 1047088 | AAG|GTTGGTGTCC...TTTTTTTTAATT/TTTTTTTTAATT...TTTAG|GTC | 0 | 1 | 27.561 |
| 17673791 | GT-AG | 0 | 1.000000099473604e-05 | 2409 | rna-XM_038321673.2 3555597 | 22 | 120121403 | 120123811 | Arvicola amphibius 1047088 | CAG|GTAAGGTCAT...ATTTTTTTAGTG/CTGTGACTAATC...TCCAG|GGG | 0 | 1 | 29.063 |
| 17673792 | GT-AG | 0 | 1.3638234802993687e-05 | 1239 | rna-XM_038321673.2 3555597 | 23 | 120120009 | 120121247 | Arvicola amphibius 1047088 | GAG|GTAAGTTAGA...ACTGCTTTAATT/ACTGCTTTAATT...TAAAG|GTT | 2 | 1 | 30.474 |
| 17673793 | GT-AG | 0 | 1.000000099473604e-05 | 156 | rna-XM_038321673.2 3555597 | 24 | 120119710 | 120119865 | Arvicola amphibius 1047088 | ATG|GTGAGCATTC...TCTTTCTTAATC/TCTTTCTTAATC...TTCAG|GCG | 1 | 1 | 31.776 |
| 17673794 | GT-AG | 0 | 1.000000099473604e-05 | 1574 | rna-XM_038321673.2 3555597 | 25 | 120117934 | 120119507 | Arvicola amphibius 1047088 | CAA|GTAATGGCTG...GAGGTCTAACAC/TGAGGTCTAACA...TACAG|GTC | 2 | 1 | 33.616 |
| 17673795 | GT-AG | 0 | 0.0002140922976393 | 1611 | rna-XM_038321673.2 3555597 | 26 | 120116213 | 120117823 | Arvicola amphibius 1047088 | TAG|GTATTTGAGA...AATACCTTTGTA/AACAAATTAACA...TTTAG|ATA | 1 | 1 | 34.617 |
| 17673796 | GT-AG | 0 | 1.000000099473604e-05 | 247 | rna-XM_038321673.2 3555597 | 27 | 120115818 | 120116064 | Arvicola amphibius 1047088 | TGA|GTGAGTCGGA...ATCTTGTTAATT/TGTTAATTAATT...TTCAG|AAA | 2 | 1 | 35.965 |
| 17673797 | GT-AG | 0 | 1.000000099473604e-05 | 1177 | rna-XM_038321673.2 3555597 | 28 | 120114481 | 120115657 | Arvicola amphibius 1047088 | GAG|GTGTGTAGAT...ACTGTTTTATAC/GACTGTTTTATA...TCCAG|GAC | 0 | 1 | 37.421 |
| 17673798 | GT-AG | 0 | 0.0003393022921539 | 1211 | rna-XM_038321673.2 3555597 | 29 | 120113149 | 120114359 | Arvicola amphibius 1047088 | AAG|GTAACTATCT...ACTTTTTTTTTT/GAGTTAGTGACT...CCCAG|AAC | 1 | 1 | 38.523 |
| 17673799 | GT-AG | 0 | 1.000000099473604e-05 | 1003 | rna-XM_038321673.2 3555597 | 30 | 120111897 | 120112899 | Arvicola amphibius 1047088 | CAG|GTTAGTAAAA...GTGGTTTTGTAT/AAATAAATCAGT...TATAG|GCC | 1 | 1 | 40.79 |
| 17673800 | GT-AG | 0 | 7.303468303995763e-05 | 1693 | rna-XM_038321673.2 3555597 | 31 | 120109916 | 120111608 | Arvicola amphibius 1047088 | CAG|GTACAATTTT...TTCTTGTTATTT/GTTCTTGTTATT...GGCAG|TGC | 1 | 1 | 43.413 |
| 17673801 | GT-AG | 0 | 0.0003529702439942 | 114 | rna-XM_038321673.2 3555597 | 32 | 120109657 | 120109770 | Arvicola amphibius 1047088 | CAG|GTATGTATGT...AAATACTTGATG/GACTTGTTTATT...TATAG|TCA | 2 | 1 | 44.733 |
| 17673802 | GC-AG | 0 | 1.000000099473604e-05 | 1240 | rna-XM_038321673.2 3555597 | 33 | 120108278 | 120109517 | Arvicola amphibius 1047088 | CAG|GCAAGGAAAA...ATGTTCATATTT/TCTATGTTCATA...TTTAG|GAT | 0 | 1 | 45.998 |
| 17673803 | GT-AG | 0 | 9.42962460555022e-05 | 105 | rna-XM_038321673.2 3555597 | 34 | 120107942 | 120108046 | Arvicola amphibius 1047088 | CAA|GTAGGTGGCT...TATCCCTTAATT/GGTTTCCTTATC...CTTAG|ATT | 0 | 1 | 48.102 |
| 17673804 | GT-AG | 0 | 1.000000099473604e-05 | 502 | rna-XM_038321673.2 3555597 | 35 | 120107262 | 120107763 | Arvicola amphibius 1047088 | GAG|GTAATGACCA...ATCACTTTATCT/CAGGTTTTTATC...TGAAG|GAA | 1 | 1 | 49.722 |
| 17673805 | GT-AG | 0 | 0.0009528579038604 | 552 | rna-XM_038321673.2 3555597 | 36 | 120106548 | 120107099 | Arvicola amphibius 1047088 | CAG|GTATGCAACA...TTTTCCATACCA/TTGTTTTCCATA...TGTAG|GAA | 1 | 1 | 51.197 |
| 17673806 | GT-AG | 0 | 1.000000099473604e-05 | 245 | rna-XM_038321673.2 3555597 | 37 | 120106085 | 120106329 | Arvicola amphibius 1047088 | CAG|GTACAAAAAA...TCTGATTTGATG/CCGTGTCTGATT...TGCAG|GTT | 0 | 1 | 53.182 |
| 17673807 | GT-AG | 0 | 0.0455971922562692 | 2154 | rna-XM_038321673.2 3555597 | 38 | 120103776 | 120105929 | Arvicola amphibius 1047088 | CAA|GTATGTCTTC...AAAACTTTAATA/AAGTTTTTGAAT...GTTAG|AGA | 2 | 1 | 54.593 |
| 17673808 | GT-AG | 0 | 1.000000099473604e-05 | 2488 | rna-XM_038321673.2 3555597 | 39 | 120101065 | 120103552 | Arvicola amphibius 1047088 | GAG|GTACAGAGTA...TCCACCTTATGA/TGTTTTTCCACC...CATAG|ATA | 0 | 1 | 56.624 |
| 17673809 | GT-AG | 0 | 1.000000099473604e-05 | 393 | rna-XM_038321673.2 3555597 | 40 | 120100425 | 120100817 | Arvicola amphibius 1047088 | AAG|GTAGAGTTAG...TGTTTGTTACTC/TTGTTACTCATC...TATAG|GAT | 1 | 1 | 58.873 |
| 17673810 | GT-AG | 0 | 1.000000099473604e-05 | 1068 | rna-XM_038321673.2 3555597 | 41 | 120099127 | 120100194 | Arvicola amphibius 1047088 | AAG|GTTGGGTAAA...TTTTCTTTAAAA/CTTTTCCTCAAG...TTAAG|GCC | 0 | 1 | 60.967 |
| 17673811 | GT-AG | 0 | 1.000000099473604e-05 | 775 | rna-XM_038321673.2 3555597 | 42 | 120098103 | 120098877 | Arvicola amphibius 1047088 | CAG|GTACGTGACA...GCTCCCTTCCCA/ATGATGTTTACC...TTTAG|TCC | 0 | 1 | 63.234 |
| 17673812 | GT-AG | 0 | 1.000000099473604e-05 | 98 | rna-XM_038321673.2 3555597 | 43 | 120097866 | 120097963 | Arvicola amphibius 1047088 | AAG|GTAAGATAAA...TTTTTTTTGATG/TTTTTTTTGATG...TCAAG|GTA | 1 | 1 | 64.5 |
| 17673813 | GT-AG | 0 | 1.5796730210154173e-05 | 124 | rna-XM_038321673.2 3555597 | 44 | 120097632 | 120097755 | Arvicola amphibius 1047088 | CAG|GTATGACTTT...AGTTTATTGAAA/ATTGAGTTTATT...AATAG|GTT | 0 | 1 | 65.501 |
| 17673814 | GT-AG | 0 | 1.000000099473604e-05 | 467 | rna-XM_038321673.2 3555597 | 45 | 120096946 | 120097412 | Arvicola amphibius 1047088 | AAG|GTGAAGCCCA...TCTTCCTGATCA/GTCTTCCTGATC...TCCAG|GTG | 0 | 1 | 67.495 |
| 17673815 | GT-AG | 0 | 1.000000099473604e-05 | 838 | rna-XM_038321673.2 3555597 | 46 | 120095917 | 120096754 | Arvicola amphibius 1047088 | CAG|GTTTAAAATA...TGATCTTTGGCT/CTACAAGTGATC...ATCAG|ATA | 2 | 1 | 69.234 |
| 17673816 | GT-AG | 0 | 7.064242479749119e-05 | 115 | rna-XM_038321673.2 3555597 | 47 | 120095629 | 120095743 | Arvicola amphibius 1047088 | TTG|GTATGTGAGC...GTTTCTTTGTTA/ACTATCTTTACT...ACAAG|GGC | 1 | 1 | 70.809 |
| 17673817 | GT-AG | 0 | 1.316943299095935e-05 | 1684 | rna-XM_038321673.2 3555597 | 48 | 120093665 | 120095348 | Arvicola amphibius 1047088 | GAA|GTAAGTTGAA...CCTGCTTTGTTT/GGAATATTAATA...TATAG|ATA | 2 | 1 | 73.359 |
| 17673818 | GT-AG | 0 | 1.000000099473604e-05 | 535 | rna-XM_038321673.2 3555597 | 49 | 120092857 | 120093391 | Arvicola amphibius 1047088 | AAG|GTGAGCTAGC...TAGTCTGTAACA/TCTGTTTTCATT...CATAG|GCG | 2 | 1 | 75.844 |
| 17673819 | GT-AG | 0 | 1.000000099473604e-05 | 1182 | rna-XM_038321673.2 3555597 | 50 | 120091440 | 120092621 | Arvicola amphibius 1047088 | CAG|GTGAACATGT...GAAACATTAACA/GAAACATTAACA...TATAG|GAA | 0 | 1 | 77.984 |
| 17673820 | GT-AG | 0 | 1.000000099473604e-05 | 2556 | rna-XM_038321673.2 3555597 | 51 | 120088657 | 120091212 | Arvicola amphibius 1047088 | CAG|GTAAGTGTGC...AGGTTTTTAATG/AGGTTTTTAATG...TCTAG|AAT | 2 | 1 | 80.051 |
| 17673821 | GT-AG | 0 | 0.000120624585072 | 86 | rna-XM_038321673.2 3555597 | 52 | 120088420 | 120088505 | Arvicola amphibius 1047088 | AAG|GTAAACATTA...CATTCCTTCTTT/TAGTCATTCACA...CATAG|TTA | 0 | 1 | 81.426 |
| 17673822 | GT-AG | 0 | 1.000000099473604e-05 | 159 | rna-XM_038321673.2 3555597 | 53 | 120088071 | 120088229 | Arvicola amphibius 1047088 | CAG|GTAATATTAG...TTAATCTTACCA/TTTAATCTTACC...TTTAG|GAT | 1 | 1 | 83.156 |
| 17673823 | GT-AG | 0 | 1.000000099473604e-05 | 1765 | rna-XM_038321673.2 3555597 | 54 | 120085836 | 120087600 | Arvicola amphibius 1047088 | CAG|GTAAGCAGTT...CATTCCTTTTTT/ATTTATCTGAAG...AATAG|TGG | 0 | 1 | 87.435 |
| 17673824 | GT-AG | 0 | 1.000000099473604e-05 | 449 | rna-XM_038321673.2 3555597 | 55 | 120085252 | 120085700 | Arvicola amphibius 1047088 | CAG|GTGAGTCTGA...ATTCTGCTGACT/ATTCTGCTGACT...TACAG|GAG | 0 | 1 | 88.664 |
| 17673825 | GT-AG | 0 | 1.000000099473604e-05 | 255 | rna-XM_038321673.2 3555597 | 56 | 120084844 | 120085098 | Arvicola amphibius 1047088 | CAG|GTACTGCATC...TGTTTTTTACAT/TTTTACATCATC...CACAG|GTC | 0 | 1 | 90.057 |
| 17673826 | GT-AG | 0 | 1.000000099473604e-05 | 133 | rna-XM_038321673.2 3555597 | 57 | 120084531 | 120084663 | Arvicola amphibius 1047088 | GTG|GTAAGTCCTG...TAATGCTTATTC/TTAATGCTTATT...TTTAG|GAC | 0 | 1 | 91.696 |
| 17673827 | GT-AG | 0 | 1.000000099473604e-05 | 578 | rna-XM_038321673.2 3555597 | 58 | 120083725 | 120084302 | Arvicola amphibius 1047088 | AAG|GTAATGCATC...ATTGCTTGAACA/CTGTTGCTAATG...GGCAG|GAT | 0 | 1 | 93.772 |
| 17673828 | GT-AG | 0 | 1.000000099473604e-05 | 117 | rna-XM_038321673.2 3555597 | 59 | 120083396 | 120083512 | Arvicola amphibius 1047088 | GAG|GTAAGGGTCA...CATTCCATAAAT/AAATGTTTCATA...TTCAG|TAT | 2 | 1 | 95.702 |
| 17673829 | GT-AG | 0 | 1.000000099473604e-05 | 3908 | rna-XM_038321673.2 3555597 | 60 | 120079381 | 120083288 | Arvicola amphibius 1047088 | CAG|GTATGAAAGA...CAATTCTTTCTC/TCTGTTTGCATG...TGTAG|CTG | 1 | 1 | 96.677 |
| 17673830 | GT-AG | 0 | 1.276834271473932e-05 | 86 | rna-XM_038321673.2 3555597 | 61 | 120079115 | 120079200 | Arvicola amphibius 1047088 | AAG|GTAATCAGCT...ACTGGCTTGACT/TCTATATTAAAT...TATAG|CTG | 1 | 1 | 98.316 |
| 17673831 | GT-AG | 0 | 4.6491858006694826e-05 | 1186 | rna-XM_038321673.2 3555597 | 62 | 120077822 | 120079007 | Arvicola amphibius 1047088 | CAG|GTGACTATTT...TCTCTCTTACCT/TTCTCTCTTACC...CACAG|GTT | 0 | 1 | 99.29 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);