introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
59 rows where transcript_id = 9059367
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 48952828 | GT-AG | 0 | 1.000000099473604e-05 | 85109 | rna-XM_036566096.1 9059367 | 1 | 26529039 | 26614147 | Colossoma macropomum 42526 | CAG|GTAGGATGGA...CTCTCTTTGTCT/GATTGAGTAATC...TGCAG|GGT | 1 | 1 | 0.751 |
| 48952829 | GT-AG | 0 | 1.000000099473604e-05 | 5403 | rna-XM_036566096.1 9059367 | 2 | 26614223 | 26619625 | Colossoma macropomum 42526 | CAG|GTGAGTGTAT...TTGTGTTTCTCT/TCTGTGTTTAGG...TTCAG|GTG | 1 | 1 | 1.592 |
| 48952830 | GT-AG | 0 | 1.000000099473604e-05 | 6729 | rna-XM_036566096.1 9059367 | 3 | 26619741 | 26626469 | Colossoma macropomum 42526 | CAG|GTCAGCAGCA...TGTTTTCTGACA/CTCTTTCTCACT...TCCAG|TGA | 2 | 1 | 2.881 |
| 48952831 | GT-AG | 0 | 1.000000099473604e-05 | 444 | rna-XM_036566096.1 9059367 | 4 | 26626663 | 26627106 | Colossoma macropomum 42526 | GAG|GTACTGAAAA...TCCTCTTTACTG/TTCCTCTTTACT...GACAG|ACT | 0 | 1 | 5.044 |
| 48952832 | GT-AG | 0 | 1.000000099473604e-05 | 4392 | rna-XM_036566096.1 9059367 | 5 | 26627620 | 26632011 | Colossoma macropomum 42526 | AAG|GTGAGGGGGT...AGTGTCTTGACT/CTTATGCTCATT...TGTAG|ATG | 0 | 1 | 10.794 |
| 48952833 | GT-AG | 0 | 0.004961843476796 | 1234 | rna-XM_036566096.1 9059367 | 6 | 26632135 | 26633368 | Colossoma macropomum 42526 | ATG|GTATGCAGTT...TATTTCTTATGA/TTATTTCTTATG...CTCAG|AAT | 0 | 1 | 12.172 |
| 48952834 | GT-AG | 0 | 1.000000099473604e-05 | 345 | rna-XM_036566096.1 9059367 | 7 | 26633561 | 26633905 | Colossoma macropomum 42526 | CAA|GTAATGCACA...GTGTGTATGATT/GTGTGTATGATT...TCCAG|TTC | 0 | 1 | 14.324 |
| 48952835 | GT-AG | 0 | 1.5617026246392117e-05 | 3654 | rna-XM_036566096.1 9059367 | 8 | 26634038 | 26637691 | Colossoma macropomum 42526 | GAG|GTAAGCAGCT...GTCTCTTTATCT/CTATCACTCATT...TGCAG|GTG | 0 | 1 | 15.804 |
| 48952836 | GT-AG | 0 | 1.000000099473604e-05 | 8632 | rna-XM_036566096.1 9059367 | 9 | 26637923 | 26646554 | Colossoma macropomum 42526 | CAG|GTAATAGAGC...GTTTTTGTATTT/TGTTTTTGTATT...CAAAG|GTG | 0 | 1 | 18.393 |
| 48952837 | GT-AG | 0 | 1.000000099473604e-05 | 5014 | rna-XM_036566096.1 9059367 | 10 | 26646678 | 26651691 | Colossoma macropomum 42526 | CAG|GTAAGAACAC...TACCTTTTAATG/CTTTTAATGACC...ATCAG|AAC | 0 | 1 | 19.771 |
| 48952838 | GT-AG | 0 | 1.000000099473604e-05 | 10754 | rna-XM_036566096.1 9059367 | 11 | 26651884 | 26662637 | Colossoma macropomum 42526 | GAG|GTAAGGTTTC...TGCTCATTAGTT/CTTGTGCTCATT...TGAAG|CTG | 0 | 1 | 21.923 |
| 48952839 | GT-AG | 0 | 1.000000099473604e-05 | 154 | rna-XM_036566096.1 9059367 | 12 | 26662808 | 26662961 | Colossoma macropomum 42526 | CAG|GTTAGCATTA...AATATCATATCT/TACACACTAATA...CACAG|AGA | 2 | 1 | 23.829 |
| 48952840 | GT-AG | 0 | 1.000000099473604e-05 | 1119 | rna-XM_036566096.1 9059367 | 13 | 26663137 | 26664255 | Colossoma macropomum 42526 | GAG|GTGAGGAGTG...TGTCCTGTAACA/GTAATATTGAGT...TGCAG|GTC | 0 | 1 | 25.79 |
| 48952841 | GT-AG | 0 | 1.000000099473604e-05 | 2555 | rna-XM_036566096.1 9059367 | 14 | 26664452 | 26667006 | Colossoma macropomum 42526 | CAG|GTGAGAGGGA...CTGTACTTAACA/CTGTACTTAACA...TGTAG|GTA | 1 | 1 | 27.987 |
| 48952842 | GT-AG | 0 | 1.2939936942160112e-05 | 2909 | rna-XM_036566096.1 9059367 | 15 | 26667174 | 26670082 | Colossoma macropomum 42526 | CAG|GTAACACATG...GTATCTTTAGGT/TTCTGGCTCATG...TTTAG|GTG | 0 | 1 | 29.859 |
| 48952843 | GT-AG | 0 | 0.0003212072426058 | 3161 | rna-XM_036566096.1 9059367 | 16 | 26670203 | 26673363 | Colossoma macropomum 42526 | GAG|GTACACACAT...AGTGCATTAATT/AGTGCATTAATT...TCCAG|TCT | 0 | 1 | 31.204 |
| 48952844 | GT-AG | 0 | 1.000000099473604e-05 | 627 | rna-XM_036566096.1 9059367 | 17 | 26673583 | 26674209 | Colossoma macropomum 42526 | CAG|GTGAGACACA...TATATTTGAGCA/TTATATTTGAGC...TGTAG|GTG | 0 | 1 | 33.658 |
| 48952845 | GT-AG | 0 | 1.000000099473604e-05 | 1828 | rna-XM_036566096.1 9059367 | 18 | 26674360 | 26676187 | Colossoma macropomum 42526 | AAG|GTAAGACACC...TTTTCTTTTTCA/TTCTTTTTCAGC...TTCAG|GCC | 0 | 1 | 35.34 |
| 48952846 | GT-AG | 0 | 1.000000099473604e-05 | 1461 | rna-XM_036566096.1 9059367 | 19 | 26676303 | 26677763 | Colossoma macropomum 42526 | AAG|GTCAGTCTAC...ATGTTATTAAAT/ATGTTATTAAAT...TGTAG|CTA | 1 | 1 | 36.629 |
| 48952847 | GT-AG | 0 | 1.000000099473604e-05 | 1091 | rna-XM_036566096.1 9059367 | 20 | 26677880 | 26678970 | Colossoma macropomum 42526 | CAG|GTAAGATTCA...TCTGTGTTGAAG/TCCTCGCTCATA...GTCAG|GCA | 0 | 1 | 37.929 |
| 48952848 | GT-AG | 0 | 1.000000099473604e-05 | 481 | rna-XM_036566096.1 9059367 | 21 | 26679094 | 26679574 | Colossoma macropomum 42526 | AGG|GTAGGGAGCA...AATTACTTAGGC/AACACCCTCACA...CCCAG|CAA | 0 | 1 | 39.307 |
| 48952849 | GT-AG | 0 | 1.000000099473604e-05 | 1541 | rna-XM_036566096.1 9059367 | 22 | 26679758 | 26681298 | Colossoma macropomum 42526 | GAG|GTAAGTCAGC...AAGTTCTGAAAA/GAAGTTCTGAAA...CTTAG|GAT | 0 | 1 | 41.358 |
| 48952850 | GT-AG | 0 | 1.000000099473604e-05 | 222 | rna-XM_036566096.1 9059367 | 23 | 26681418 | 26681639 | Colossoma macropomum 42526 | AGA|GTGAGTGAAC...GTCTTCATAGCA/GCTCTGCTAACA...TATAG|ATT | 2 | 1 | 42.692 |
| 48952851 | GT-AG | 0 | 1.000000099473604e-05 | 279 | rna-XM_036566096.1 9059367 | 24 | 26681707 | 26681985 | Colossoma macropomum 42526 | GAG|GTAAGCAGAA...CTGTCTTTGTAT/CAGACAGTTATT...TGTAG|ACA | 0 | 1 | 43.443 |
| 48952852 | GT-AG | 0 | 1.000000099473604e-05 | 256 | rna-XM_036566096.1 9059367 | 25 | 26682096 | 26682351 | Colossoma macropomum 42526 | TAA|GTAAGGAGCT...TTGTTTGTATCT/CTGTGATTAAGC...TTTAG|CAT | 2 | 1 | 44.676 |
| 48952853 | GT-AG | 0 | 1.000000099473604e-05 | 267 | rna-XM_036566096.1 9059367 | 26 | 26682422 | 26682688 | Colossoma macropomum 42526 | TGG|GTAAGTGGCA...AAGATCTTAAGA/ATACATTTCATG...TCCAG|GCG | 0 | 1 | 45.461 |
| 48952854 | GT-AG | 0 | 1.000000099473604e-05 | 429 | rna-XM_036566096.1 9059367 | 27 | 26682779 | 26683207 | Colossoma macropomum 42526 | GAC|GTAAGAGCAA...TGAGTTTTGAAT/TGAGTTTTGAAT...TGCAG|GAA | 0 | 1 | 46.469 |
| 48952855 | GT-AG | 0 | 1.000000099473604e-05 | 592 | rna-XM_036566096.1 9059367 | 28 | 26683301 | 26683892 | Colossoma macropomum 42526 | AAG|GTGCTTGGTT...TTCTTCTTTCCC/TGCAGTTTTATG...CTCAG|GAG | 0 | 1 | 47.512 |
| 48952856 | GT-AG | 0 | 4.672315097391163e-05 | 654 | rna-XM_036566096.1 9059367 | 29 | 26684005 | 26684658 | Colossoma macropomum 42526 | AGG|GTACGTCACA...TGACTTTTGACC/TCTGTTTTGATA...TGTAG|GTT | 1 | 1 | 48.767 |
| 48952857 | GT-AG | 0 | 0.0063277842127054 | 549 | rna-XM_036566096.1 9059367 | 30 | 26684850 | 26685398 | Colossoma macropomum 42526 | CTG|GTACACTGCC...CTGTTCTGAATC/GCTGTTCTGAAT...TTCAG|ACG | 0 | 1 | 50.908 |
| 48952858 | GT-AG | 0 | 3.772713117494251e-05 | 226 | rna-XM_036566096.1 9059367 | 31 | 26685501 | 26685726 | Colossoma macropomum 42526 | AAG|GTGCATTTAT...TCCACTTTAACT/ACTGAATTAATT...CACAG|GCA | 0 | 1 | 52.051 |
| 48952859 | GT-AG | 0 | 1.000000099473604e-05 | 851 | rna-XM_036566096.1 9059367 | 32 | 26685870 | 26686720 | Colossoma macropomum 42526 | GAG|GTTAGATTCC...AAGTATTTATTT/TAAGTATTTATT...TGCAG|GGA | 2 | 1 | 53.654 |
| 48952860 | GT-AG | 0 | 1.000000099473604e-05 | 13288 | rna-XM_036566096.1 9059367 | 33 | 26686824 | 26700111 | Colossoma macropomum 42526 | AAG|GTAGAAACTG...ATCTTCATGATC/TCTTCATTCATC...TGCAG|CTC | 0 | 1 | 54.808 |
| 48952861 | GT-AG | 0 | 1.000000099473604e-05 | 14858 | rna-XM_036566096.1 9059367 | 34 | 26700359 | 26715216 | Colossoma macropomum 42526 | AAG|GTAAGAACCA...ATATTTTTATTC/CATATTTTTATT...TTTAG|AGA | 1 | 1 | 57.577 |
| 48952862 | GT-AG | 0 | 1.000000099473604e-05 | 2390 | rna-XM_036566096.1 9059367 | 35 | 26715483 | 26717872 | Colossoma macropomum 42526 | GAG|GTAATGGAGC...TGTTCCTTGTCT/ACTGCTCTTATA...TGCAG|AGG | 0 | 1 | 60.558 |
| 48952863 | GT-AG | 0 | 0.0037006592637593 | 914 | rna-XM_036566096.1 9059367 | 36 | 26718011 | 26718924 | Colossoma macropomum 42526 | AAG|GTATGCATAA...CTCTCCTTGCTA/TCCTTGCTAACA...TGCAG|TCG | 0 | 1 | 62.105 |
| 48952864 | GT-AG | 0 | 1.000000099473604e-05 | 778 | rna-XM_036566096.1 9059367 | 37 | 26719021 | 26719798 | Colossoma macropomum 42526 | ATG|GTAAGTGCAT...GTGGCTTTGCCT/TCTTGTATGAGT...TTTAG|ACG | 0 | 1 | 63.181 |
| 48952865 | GT-AG | 0 | 1.000000099473604e-05 | 1210 | rna-XM_036566096.1 9059367 | 38 | 26719915 | 26721124 | Colossoma macropomum 42526 | CAT|GTAAGACACA...TGTGCTTTACTT/ATGTGCTTTACT...CTCAG|GTT | 2 | 1 | 64.481 |
| 48952866 | GT-AG | 0 | 1.000000099473604e-05 | 346 | rna-XM_036566096.1 9059367 | 39 | 26721192 | 26721537 | Colossoma macropomum 42526 | GAG|GTGAGAAAAT...GAGCTTTTGACT/ACTCTTTTGATT...TATAG|GGT | 0 | 1 | 65.232 |
| 48952867 | GT-AG | 0 | 1.000000099473604e-05 | 90 | rna-XM_036566096.1 9059367 | 40 | 26721642 | 26721731 | Colossoma macropomum 42526 | AGA|GTGAGTCAAG...TTCTCTTTATTT/GTTCTCTTTATT...TACAG|TTT | 2 | 1 | 66.398 |
| 48952868 | GT-AG | 0 | 1.000000099473604e-05 | 93 | rna-XM_036566096.1 9059367 | 41 | 26721802 | 26721894 | Colossoma macropomum 42526 | CAT|GTACGTGAGA...CTTTGCTGAATG/GCTTTGCTGAAT...CACAG|GAG | 0 | 1 | 67.182 |
| 48952869 | GT-AG | 0 | 0.0001328784767729 | 136 | rna-XM_036566096.1 9059367 | 42 | 26721982 | 26722117 | Colossoma macropomum 42526 | GAG|GTAAACCATC...CTGTCTTTAATA/ACCTTTTTAACT...TTAAG|GTG | 0 | 1 | 68.157 |
| 48952870 | GT-AG | 0 | 1.000000099473604e-05 | 101 | rna-XM_036566096.1 9059367 | 43 | 26722211 | 26722311 | Colossoma macropomum 42526 | AAG|GTAAAACGTT...CTGTTCTTGAAA/ACATAACTTATT...TTAAG|GAC | 0 | 1 | 69.2 |
| 48952871 | GT-AG | 0 | 1.000000099473604e-05 | 116 | rna-XM_036566096.1 9059367 | 44 | 26722363 | 26722478 | Colossoma macropomum 42526 | GAG|GTGAGATGTT...TCCCCCTTACTT/TTGGTTCTCATC...TGTAG|AGA | 0 | 1 | 69.771 |
| 48952872 | GT-AG | 0 | 4.867691026030376e-05 | 898 | rna-XM_036566096.1 9059367 | 45 | 26722557 | 26723454 | Colossoma macropomum 42526 | GAG|GTACTTCATC...TACACATTAACA/TACACATTAACA...TTCAG|GGA | 0 | 1 | 70.646 |
| 48952873 | GT-AG | 0 | 1.000000099473604e-05 | 257 | rna-XM_036566096.1 9059367 | 46 | 26723641 | 26723897 | Colossoma macropomum 42526 | AAG|GTGATCCTCC...TAATTCTGATCA/GTAATTCTGATC...ACTAG|GTG | 0 | 1 | 72.73 |
| 48952874 | GT-AG | 0 | 1.4297578988805553e-05 | 233 | rna-XM_036566096.1 9059367 | 47 | 26724070 | 26724302 | Colossoma macropomum 42526 | CAG|GTAAGCTCAG...GTGCTGTTGATT/TGTATTTTCATG...TGAAG|CTC | 1 | 1 | 74.658 |
| 48952875 | GT-AG | 0 | 1.000000099473604e-05 | 1878 | rna-XM_036566096.1 9059367 | 48 | 26724536 | 26726413 | Colossoma macropomum 42526 | AAG|GTGAGTGTAG...CTCCCATCAACC/CTCCCTCCCATC...TACAG|CAG | 0 | 1 | 77.27 |
| 48952876 | GT-AG | 0 | 1.1355416830657265e-05 | 2412 | rna-XM_036566096.1 9059367 | 49 | 26726671 | 26729082 | Colossoma macropomum 42526 | CAA|GTAAGTGTCA...CATGTTTTAATA/CATGTTTTAATA...AACAG|GAA | 2 | 1 | 80.15 |
| 48952877 | GT-AG | 0 | 1.000000099473604e-05 | 863 | rna-XM_036566096.1 9059367 | 50 | 26729216 | 26730078 | Colossoma macropomum 42526 | AAA|GTGAGTAGTA...CTCCTCCTGAAC/TCTAATGTTACT...CACAG|GAG | 0 | 1 | 81.641 |
| 48952878 | GT-AG | 0 | 1.000000099473604e-05 | 570 | rna-XM_036566096.1 9059367 | 51 | 26730142 | 26730711 | Colossoma macropomum 42526 | GAG|GTAGAGCTTC...TACATTTTAACA/TTAACATTTATT...CACAG|TTA | 0 | 1 | 82.347 |
| 48952879 | GT-AG | 0 | 1.000000099473604e-05 | 1778 | rna-XM_036566096.1 9059367 | 52 | 26730740 | 26732517 | Colossoma macropomum 42526 | AAG|GTAAGATTGT...TACTTCTTCTTT/CATTGTGTGATG...TGAAG|TGG | 1 | 1 | 82.661 |
| 48952880 | GT-AG | 0 | 1.000000099473604e-05 | 432 | rna-XM_036566096.1 9059367 | 53 | 26732681 | 26733112 | Colossoma macropomum 42526 | CAG|GTAAAGCCCT...CGTTTTTTACTG/ACGTTTTTTACT...TCCAG|GGA | 2 | 1 | 84.488 |
| 48952881 | GT-AG | 0 | 1.000000099473604e-05 | 1889 | rna-XM_036566096.1 9059367 | 54 | 26733235 | 26735123 | Colossoma macropomum 42526 | AAG|GTAATGTGGA...ATTGCCTGATTG/CATTGCCTGATT...GTCAG|GTG | 1 | 1 | 85.855 |
| 48952882 | GT-AG | 0 | 0.4994172046187021 | 1543 | rna-XM_036566096.1 9059367 | 55 | 26735256 | 26736798 | Colossoma macropomum 42526 | AGG|GTAACCTGCG...ACATCTTTAATT/AATACATTCATT...TGCAG|ACT | 1 | 1 | 87.335 |
| 48952883 | GT-AG | 0 | 1.000000099473604e-05 | 182 | rna-XM_036566096.1 9059367 | 56 | 26736964 | 26737145 | Colossoma macropomum 42526 | GTG|GTTAGTACTG...ATGTCTTTTGTT/AATATATTAATG...TGAAG|TTT | 1 | 1 | 89.184 |
| 48952884 | GT-AG | 0 | 1.000000099473604e-05 | 157 | rna-XM_036566096.1 9059367 | 57 | 26737225 | 26737381 | Colossoma macropomum 42526 | AAG|GTGATGTCTT...TATATCTTGCTG/TATATATATATC...TTCAG|GGG | 2 | 1 | 90.069 |
| 48952885 | GT-AG | 0 | 3.859348231638894e-05 | 275 | rna-XM_036566096.1 9059367 | 58 | 26737583 | 26737857 | Colossoma macropomum 42526 | GCT|GTAAGTCTGA...ATTGCTTTTCCT/TCCTGTTTTATG...TACAG|GCT | 2 | 1 | 92.322 |
| 48952886 | GT-AG | 0 | 1.000000099473604e-05 | 135 | rna-XM_036566096.1 9059367 | 59 | 26737997 | 26738131 | Colossoma macropomum 42526 | AAG|GTGAACTTTT...TAACCCTCAATA/TACTTTTTCATC...TACAG|CCT | 0 | 1 | 93.88 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);