introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
62 rows where transcript_id = 22607837
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 122605760 | GT-AG | 0 | 1.000000099473604e-05 | 31430 | rna-XM_021193999.2 22607837 | 1 | 87908508 | 87939937 | Mus pahari 10093 | CAG|GTGGGTGACC...TTTTCTTTTTCT/CTGCTACTAATA...TTCAG|AAC | 2 | 1 | 0.783 |
| 122605761 | GT-AG | 0 | 3.3083394688725606e-05 | 310 | rna-XM_021193999.2 22607837 | 2 | 87908034 | 87908343 | Mus pahari 10093 | AGG|GTATTTAGAT...ATAATGTAAACA/GTTAAGATAATG...TTCAG|GTG | 1 | 1 | 2.277 |
| 122605762 | GT-AG | 0 | 1.000000099473604e-05 | 1622 | rna-XM_021193999.2 22607837 | 3 | 87906256 | 87907877 | Mus pahari 10093 | AGG|GTAAGACTTG...ATATTTTTAGTG/CATATTTTTAGT...CACAG|AGC | 1 | 1 | 3.698 |
| 122605763 | GT-AG | 0 | 1.000000099473604e-05 | 1392 | rna-XM_021193999.2 22607837 | 4 | 87904727 | 87906118 | Mus pahari 10093 | CTG|GTGAGTAAAT...CTGTTTTTAAGT/CTGTTTTTAAGT...TTAAG|GTA | 0 | 1 | 4.945 |
| 122605764 | GT-AG | 0 | 1.000000099473604e-05 | 821 | rna-XM_021193999.2 22607837 | 5 | 87903847 | 87904667 | Mus pahari 10093 | AAG|GTAAGTAACA...TTTGCATTAAAA/TTTTGTTTCATT...CACAG|TAG | 2 | 1 | 5.483 |
| 122605765 | GT-AG | 0 | 1.000000099473604e-05 | 3459 | rna-XM_021193999.2 22607837 | 6 | 87900174 | 87903632 | Mus pahari 10093 | CAG|GTGAGAGCCA...CTGATCTTGATT/ATGTTTCTGATC...TTCAG|CTT | 0 | 1 | 7.432 |
| 122605766 | GT-AG | 0 | 1.000000099473604e-05 | 399 | rna-XM_021193999.2 22607837 | 7 | 87899649 | 87900047 | Mus pahari 10093 | AGG|GTGAGTTTCT...GAAACCTTAAAA/AAATGACTGACC...TCAAG|GAT | 0 | 1 | 8.579 |
| 122605767 | GT-AG | 0 | 1.000000099473604e-05 | 898 | rna-XM_021193999.2 22607837 | 8 | 87898678 | 87899575 | Mus pahari 10093 | CTG|GTAAGTGTTT...TTTTTTTTAAAC/TTTTTTTTAAAC...TCAAG|GGT | 1 | 1 | 9.244 |
| 122605768 | GT-AG | 0 | 1.000000099473604e-05 | 2680 | rna-XM_021193999.2 22607837 | 9 | 87895900 | 87898579 | Mus pahari 10093 | GAG|GTAAGTTGTA...CAAATCTTATAC/TGTGTATTCACA...TCTAG|GAC | 0 | 1 | 10.137 |
| 122605769 | GT-AG | 0 | 1.000000099473604e-05 | 2127 | rna-XM_021193999.2 22607837 | 10 | 87893599 | 87895725 | Mus pahari 10093 | GAT|GTAAGTGTCT...GAATGTTTAAAA/GTTTTCCCCATT...TGCAG|GTT | 0 | 1 | 11.721 |
| 122605770 | GT-AG | 0 | 0.0001502370231122 | 682 | rna-XM_021193999.2 22607837 | 11 | 87892692 | 87893373 | Mus pahari 10093 | TTG|GTATGTGAAT...CTTATTTTAAAA/CTTTTTGTAACT...TTTAG|ATT | 0 | 1 | 13.77 |
| 122605771 | GT-AG | 0 | 1.000000099473604e-05 | 1218 | rna-XM_021193999.2 22607837 | 12 | 87891372 | 87892589 | Mus pahari 10093 | GAG|GTAAGTAATA...TGATTTTTAACC/TGATTTTTAACC...TGTAG|GTT | 0 | 1 | 14.699 |
| 122605772 | GT-AG | 0 | 1.000000099473604e-05 | 2765 | rna-XM_021193999.2 22607837 | 13 | 87888337 | 87891101 | Mus pahari 10093 | GGG|GTAAGTAATT...TGGTTTTTACCT/TTGGTTTTTACC...GGCAG|ATG | 0 | 1 | 17.158 |
| 122605773 | GT-AG | 0 | 4.946569971325715e-05 | 120 | rna-XM_021193999.2 22607837 | 14 | 87888086 | 87888205 | Mus pahari 10093 | CAG|GTACTGTGCT...CTTTTTTTGATG/CTTTTTTTGATG...TTCAG|GCA | 2 | 1 | 18.352 |
| 122605774 | GT-AG | 0 | 0.0082463500864679 | 503 | rna-XM_021193999.2 22607837 | 15 | 87887421 | 87887923 | Mus pahari 10093 | AAG|GTACCTAAGT...TTTGTTTTAATG/TTTGTTTTAATG...TTCAG|GAA | 2 | 1 | 19.827 |
| 122605775 | GT-AG | 0 | 1.000000099473604e-05 | 104 | rna-XM_021193999.2 22607837 | 16 | 87887198 | 87887301 | Mus pahari 10093 | ACA|GTAAGAGACT...TTTTCCTTTTTA/TGAGTGTTAACT...AATAG|CTC | 1 | 1 | 20.911 |
| 122605776 | GT-AG | 0 | 1.535798348995535e-05 | 86 | rna-XM_021193999.2 22607837 | 17 | 87887024 | 87887109 | Mus pahari 10093 | GAG|GTACAGATGA...CCTGCTTTATCT/TTATCTATTATT...CATAG|GTG | 2 | 1 | 21.712 |
| 122605777 | GT-AG | 0 | 0.000109974666125 | 2140 | rna-XM_021193999.2 22607837 | 18 | 87884782 | 87886921 | Mus pahari 10093 | AAG|GTATAACTAT...TTCTCCTAAAAC/ATAATAATGACT...AACAG|CAA | 2 | 1 | 22.641 |
| 122605778 | GT-AG | 0 | 1.000000099473604e-05 | 468 | rna-XM_021193999.2 22607837 | 19 | 87884173 | 87884640 | Mus pahari 10093 | AGG|GTAAGACTCT...TGATTATTAAAA/GAAGTTGTCATT...CCCAG|GAA | 2 | 1 | 23.925 |
| 122605779 | GT-AG | 0 | 0.00213583733268 | 473 | rna-XM_021193999.2 22607837 | 20 | 87883500 | 87883972 | Mus pahari 10093 | AAG|GTAACCAACA...TTTTCTCTGATT/TTTTCTCTGATT...TTCAG|GTA | 1 | 1 | 25.747 |
| 122605780 | GT-AG | 0 | 1.000000099473604e-05 | 156 | rna-XM_021193999.2 22607837 | 21 | 87883147 | 87883302 | Mus pahari 10093 | AAG|GTTGGTGTTG...GAGTCCTTTTCT/GGCATCTTCACC...TAAAG|GTC | 0 | 1 | 27.541 |
| 122605781 | GT-AG | 0 | 1.000000099473604e-05 | 564 | rna-XM_021193999.2 22607837 | 22 | 87882418 | 87882981 | Mus pahari 10093 | CAG|GTAATGTGCT...TAATCTTTTGCC/CTGTGATTAATC...TCCAG|GGG | 0 | 1 | 29.044 |
| 122605782 | GT-AG | 0 | 1.000000099473604e-05 | 856 | rna-XM_021193999.2 22607837 | 23 | 87881407 | 87882262 | Mus pahari 10093 | GAG|GTAAGTTAGA...GCTTCTATATTT/AATTTTTTCAAG...TTAAG|GTT | 2 | 1 | 30.455 |
| 122605783 | GT-AG | 0 | 1.000000099473604e-05 | 166 | rna-XM_021193999.2 22607837 | 24 | 87881098 | 87881263 | Mus pahari 10093 | ATG|GTGAGCATTC...TTTTTTTTAAAA/TTTTTTTTTAAA...TTCAG|GTG | 1 | 1 | 31.758 |
| 122605784 | GT-AG | 0 | 1.000000099473604e-05 | 1341 | rna-XM_021193999.2 22607837 | 25 | 87879555 | 87880895 | Mus pahari 10093 | CAA|GTAAGGGCTG...TAAGGCCTAACA/TAAGGCCTAACA...TACAG|GTC | 2 | 1 | 33.597 |
| 122605785 | GT-AG | 0 | 0.0001157762745484 | 1490 | rna-XM_021193999.2 22607837 | 26 | 87877955 | 87879444 | Mus pahari 10093 | TAG|GTATTTGAGA...ATTAATTTATCT/ATTTATCTAACT...TTTAG|ATA | 1 | 1 | 34.599 |
| 122605786 | GT-AG | 0 | 1.000000099473604e-05 | 226 | rna-XM_021193999.2 22607837 | 27 | 87877581 | 87877806 | Mus pahari 10093 | CGA|GTGAGTCTCT...AGAATGTTAATT/TGTTAATTGACA...TTCAG|AAA | 2 | 1 | 35.947 |
| 122605787 | GT-AG | 0 | 0.0003667307469418 | 1331 | rna-XM_021193999.2 22607837 | 28 | 87876090 | 87877420 | Mus pahari 10093 | GAG|GTATGTTGAT...ATTGTTTTACAC/GATTGTTTTACA...GGCAG|GAC | 0 | 1 | 37.404 |
| 122605788 | GT-AG | 0 | 2.3117351814536988e-05 | 3779 | rna-XM_021193999.2 22607837 | 29 | 87872190 | 87875968 | Mus pahari 10093 | AAG|GTAACATATT...ACTGCCTGGAGT/TGGGAACTAATG...TCCAG|AAC | 1 | 1 | 38.506 |
| 122605789 | GT-AG | 0 | 1.000000099473604e-05 | 1092 | rna-XM_021193999.2 22607837 | 30 | 87870849 | 87871940 | Mus pahari 10093 | CAG|GTTAGTAAAC...TCTCTCTCAAAA/CTCTCTCTCAAA...TGTAG|GCC | 1 | 1 | 40.774 |
| 122605790 | GT-AG | 0 | 0.0002397249383209 | 1978 | rna-XM_021193999.2 22607837 | 31 | 87868586 | 87870563 | Mus pahari 10093 | CAG|GTATGGCTTC...CTCTTCTTATTT/ACTCTTCTTATT...GGCAG|TGC | 1 | 1 | 43.37 |
| 122605791 | GT-AG | 0 | 0.0003043604428367 | 110 | rna-XM_021193999.2 22607837 | 32 | 87868331 | 87868440 | Mus pahari 10093 | CAG|GTATGTATGT...ACTTCTTTGTTT/AAATGTTTGATG...TATAG|TCA | 2 | 1 | 44.69 |
| 122605792 | GC-AG | 0 | 1.000000099473604e-05 | 297 | rna-XM_021193999.2 22607837 | 33 | 87867895 | 87868191 | Mus pahari 10093 | CAG|GCAAGGAAAA...TCTTCCTTTTCA/TTCCTTTTCACG...TTTAG|GAT | 0 | 1 | 45.956 |
| 122605793 | GT-AG | 0 | 1.000000099473604e-05 | 103 | rna-XM_021193999.2 22607837 | 34 | 87867561 | 87867663 | Mus pahari 10093 | CAG|GTAGGTGGCA...GTTTCCTTATCC/GGTTTCCTTATC...CTTAG|GTT | 0 | 1 | 48.06 |
| 122605794 | GT-AG | 0 | 1.000000099473604e-05 | 510 | rna-XM_021193999.2 22607837 | 35 | 87866870 | 87867379 | Mus pahari 10093 | GAG|GTAATGGCCT...GTTTTATTAACT/GTTTTATTAACT...TGAAG|GAA | 1 | 1 | 49.709 |
| 122605795 | GT-AG | 0 | 0.0006347859694797 | 549 | rna-XM_021193999.2 22607837 | 36 | 87866159 | 87866707 | Mus pahari 10093 | CAG|GTATGCAACA...CATTCCATACCC/TTGCATTCCATA...TGTAG|GAA | 1 | 1 | 51.184 |
| 122605796 | GT-AG | 0 | 1.000000099473604e-05 | 258 | rna-XM_021193999.2 22607837 | 37 | 87865683 | 87865940 | Mus pahari 10093 | CAG|GTATAAGAAA...TCTGTATTAATT/TCTGTATTAATT...TGCAG|GTC | 0 | 1 | 53.169 |
| 122605797 | GT-AG | 0 | 0.069276075326287 | 2249 | rna-XM_021193999.2 22607837 | 38 | 87863279 | 87865527 | Mus pahari 10093 | CAA|GTATGTCTTC...TCTTTTTTGATA/TCTTTTTTGATA...GTCAG|AGA | 2 | 1 | 54.581 |
| 122605798 | GT-AG | 0 | 8.327297316046744e-05 | 2537 | rna-XM_021193999.2 22607837 | 39 | 87860519 | 87863055 | Mus pahari 10093 | GAG|GTATAGAGTA...TTTTTTTTTTCC/AGAATGCAGACT...CATAG|ATA | 0 | 1 | 56.612 |
| 122605799 | GT-AG | 0 | 1.000000099473604e-05 | 381 | rna-XM_021193999.2 22607837 | 40 | 87859891 | 87860271 | Mus pahari 10093 | AAG|GTAGAGCTAA...GTAATTTTGATT/TTGTCACTCATC...CATAG|GAC | 1 | 1 | 58.862 |
| 122605800 | GT-AG | 0 | 1.000000099473604e-05 | 1065 | rna-XM_021193999.2 22607837 | 41 | 87858596 | 87859660 | Mus pahari 10093 | AAG|GTTTGGTAAA...CTTTCCTTAAGT/TTTCTTTTGAAA...TTAAG|GCC | 0 | 1 | 60.956 |
| 122605801 | GT-AG | 0 | 0.0012194962506333 | 982 | rna-XM_021193999.2 22607837 | 42 | 87857365 | 87858346 | Mus pahari 10093 | CAG|GTATGCAGCA...TGGCCCATAATG/ATAATGTTTACC...CTTAG|TCC | 0 | 1 | 63.224 |
| 122605802 | GT-AG | 0 | 1.000000099473604e-05 | 86 | rna-XM_021193999.2 22607837 | 43 | 87857140 | 87857225 | Mus pahari 10093 | AAG|GTAAGATGGA...GTTATTTTGGTG/AGTAAAATGATT...TCAAG|GTA | 1 | 1 | 64.49 |
| 122605803 | GT-AG | 0 | 0.0119243476466622 | 128 | rna-XM_021193999.2 22607837 | 44 | 87856902 | 87857029 | Mus pahari 10093 | CAG|GTATGCCTTT...TTGACTTTACTG/CTTGTGTTGACT...ACTAG|GTT | 0 | 1 | 65.492 |
| 122605804 | GT-AG | 0 | 1.000000099473604e-05 | 401 | rna-XM_021193999.2 22607837 | 45 | 87856282 | 87856682 | Mus pahari 10093 | AAG|GTGAAGCCCA...TGTTCCTGATCA/CCTGTACTTACT...TCCAG|GTG | 0 | 1 | 67.486 |
| 122605805 | GT-AG | 0 | 1.000000099473604e-05 | 1099 | rna-XM_021193999.2 22607837 | 46 | 87854992 | 87856090 | Mus pahari 10093 | CAG|GTTGAAAATA...TGAACTTTGACT/TGAACTTTGACT...ATTAG|GTA | 2 | 1 | 69.226 |
| 122605806 | GT-AG | 0 | 2.1641317780580745e-05 | 127 | rna-XM_021193999.2 22607837 | 47 | 87854692 | 87854818 | Mus pahari 10093 | TTG|GTATGTGAGC...TTCATTGTGACT/AATGGTTTCATT...TTTAG|GTC | 1 | 1 | 70.801 |
| 122605807 | GT-AG | 0 | 8.626369835790981e-05 | 1064 | rna-XM_021193999.2 22607837 | 48 | 87853348 | 87854411 | Mus pahari 10093 | GAA|GTAAGTTTAA...CCTGTTTTATTT/ACCTGTTTTATT...TATAG|GTA | 2 | 1 | 73.352 |
| 122605808 | GT-AG | 0 | 1.000000099473604e-05 | 166 | rna-XM_021193999.2 22607837 | 49 | 87852909 | 87853074 | Mus pahari 10093 | AAG|GTGAGCTAGT...TTTGTTTTCATT/TTTGTTTTCATT...CATAG|GCG | 2 | 1 | 75.838 |
| 122605809 | GT-AG | 0 | 1.000000099473604e-05 | 3274 | rna-XM_021193999.2 22607837 | 50 | 87849400 | 87852673 | Mus pahari 10093 | CAG|GTGAACACTT...TAACACTTAGTA/ATAACACTTAGT...TGTAG|GAA | 0 | 1 | 77.978 |
| 122605810 | GT-AG | 0 | 1.000000099473604e-05 | 2502 | rna-XM_021193999.2 22607837 | 51 | 87846671 | 87849172 | Mus pahari 10093 | CAG|GTAAATGTGT...CTCTCCTTCATG/AGGGTTCTAACA...TCTAG|AAT | 2 | 1 | 80.046 |
| 122605811 | GT-AG | 0 | 0.0001379643809897 | 93 | rna-XM_021193999.2 22607837 | 52 | 87846427 | 87846519 | Mus pahari 10093 | AAG|GTAAACATTA...ATTCCCTTTTCC/TAGTCACTGAAT...CATAG|TTA | 0 | 1 | 81.421 |
| 122605812 | GT-AG | 0 | 1.000000099473604e-05 | 160 | rna-XM_021193999.2 22607837 | 53 | 87846077 | 87846236 | Mus pahari 10093 | CAG|GTAATGTTAG...ATTTTCTTTCTA/ATGATTTTCATT...TGTAG|GAT | 1 | 1 | 83.151 |
| 122605813 | GT-AG | 0 | 1.000000099473604e-05 | 2093 | rna-XM_021193999.2 22607837 | 54 | 87843514 | 87845606 | Mus pahari 10093 | CAG|GTAAGCAATT...AAAGTTTTATTC/CAAAGTTTTATT...AATAG|TGG | 0 | 1 | 87.432 |
| 122605814 | GT-AG | 0 | 1.000000099473604e-05 | 1715 | rna-XM_021193999.2 22607837 | 55 | 87841664 | 87843378 | Mus pahari 10093 | CAG|GTGTGTCTGA...CAGTTTTTCTTC/TAAAGGCTCAGT...TACAG|GAG | 0 | 1 | 88.661 |
| 122605815 | GT-AG | 0 | 1.000000099473604e-05 | 241 | rna-XM_021193999.2 22607837 | 56 | 87841270 | 87841510 | Mus pahari 10093 | CAG|GTACTGCACG...ATCACTTTCTCC/CACATCATCACT...CACAG|GTC | 0 | 1 | 90.055 |
| 122605816 | GT-AG | 0 | 1.000000099473604e-05 | 654 | rna-XM_021193999.2 22607837 | 57 | 87840436 | 87841089 | Mus pahari 10093 | GTG|GTGAGTCCTG...TAATACTTATTC/TTAATACTTATT...TTTAG|GAC | 0 | 1 | 91.694 |
| 122605817 | GT-AG | 0 | 1.000000099473604e-05 | 1083 | rna-XM_021193999.2 22607837 | 58 | 87839125 | 87840207 | Mus pahari 10093 | AAG|GTAATGAATC...ATGTTTCTAATG/ATGTTTCTAATG...GGTAG|GAC | 0 | 1 | 93.77 |
| 122605818 | GT-AG | 0 | 1.000000099473604e-05 | 134 | rna-XM_021193999.2 22607837 | 59 | 87838779 | 87838912 | Mus pahari 10093 | GAG|GTAAGGGTGC...CATTCCATAAAT/AAATATTTCATA...TTCAG|TAT | 2 | 1 | 95.701 |
| 122605819 | GT-AG | 0 | 1.000000099473604e-05 | 2844 | rna-XM_021193999.2 22607837 | 60 | 87835828 | 87838671 | Mus pahari 10093 | CAG|GTATGAAAGA...CAATTCTTTTTC/TGTATGCAAATT...TTTAG|CGG | 1 | 1 | 96.676 |
| 122605820 | GT-AG | 0 | 1.000000099473604e-05 | 81 | rna-XM_021193999.2 22607837 | 61 | 87835567 | 87835647 | Mus pahari 10093 | AAG|GTAATTGGTC...GTTGGTTTGACT/GTTGGTTTGACT...CATAG|CTG | 1 | 1 | 98.315 |
| 122605821 | GT-AG | 0 | 1.6899469981604092e-05 | 1187 | rna-XM_021193999.2 22607837 | 62 | 87834273 | 87835459 | Mus pahari 10093 | CAG|GTGACTATTT...CGGTTCTTACTG/TTCTTACTGATT...CACAG|GTT | 0 | 1 | 99.29 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);