introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
47 rows where transcript_id = 23199961
This data as json, CSV (advanced)
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
126230436 | GT-AG | 0 | 1.000000099473604e-05 | 118 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 1 | 1787888 | 1788005 | Neocallimastix californiae 1754190 | AAA|GTAAGTGAAT...TATATTTTAATT/TATATTTTAATT...TATAG|CAT | 2 | 1 | 29.203 |
126230437 | GT-AG | 0 | 0.0015916899213555 | 130 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 2 | 1788039 | 1788168 | Neocallimastix californiae 1754190 | GAT|GTAAGTTTAA...TTTTTTTTAATA/TTTTTTTTAATA...TATAG|TGA | 2 | 1 | 29.892 |
126230438 | GT-AG | 0 | 1.000000099473604e-05 | 104 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 3 | 1788268 | 1788371 | Neocallimastix californiae 1754190 | AAT|GTAAAAAAAA...TATTTTTTATAT/AAATTTCTAAAT...TATAG|CTC | 2 | 1 | 31.957 |
126230439 | GT-AG | 0 | 0.0003894958468346 | 92 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 4 | 1788444 | 1788535 | Neocallimastix californiae 1754190 | ATT|GTAATATTAA...AATTTTTTAAAA/ATTATTCTAAAA...AATAG|ATA | 2 | 1 | 33.458 |
126230440 | GT-AG | 0 | 0.0006670648724093 | 104 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 5 | 1788608 | 1788711 | Neocallimastix californiae 1754190 | TCT|GTAAATTAAT...ATTTTTTTATAC/TATTTTTTTATA...TTTAG|TCA | 2 | 1 | 34.96 |
126230441 | GT-AG | 0 | 0.0517100110206443 | 95 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 6 | 1788784 | 1788878 | Neocallimastix californiae 1754190 | ATT|GTATAATTTA...CAATTATTAATT/CAATTATTAATT...TATAG|ACA | 2 | 1 | 36.462 |
126230442 | GT-AG | 0 | 0.0030357246890462 | 133 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 7 | 1788951 | 1789083 | Neocallimastix californiae 1754190 | ACT|GTAAATTTAC...TATATTTTAAAA/AAGTTATTAATT...TTTAG|ACA | 2 | 1 | 37.964 |
126230443 | GT-AG | 0 | 2.145556087948931e-05 | 91 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 8 | 1789156 | 1789246 | Neocallimastix californiae 1754190 | ATT|GTAAAAATAT...ATGATTTTAAAT/ATGATTTTAAAT...AATAG|ATA | 2 | 1 | 39.466 |
126230444 | GT-AG | 0 | 0.9012329502900353 | 109 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 9 | 1789319 | 1789427 | Neocallimastix californiae 1754190 | TTT|GTATGTTTAA...TTTTTTTTAAAA/AATATATTAATT...TAAAG|AAA | 2 | 1 | 40.968 |
126230445 | GT-AG | 0 | 7.255412320339255e-05 | 108 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 10 | 1789572 | 1789679 | Neocallimastix californiae 1754190 | ATT|GTAATATATA...TATATTTTATTA/TTATATTTTATT...TACAG|ATG | 2 | 1 | 43.972 |
126230446 | GT-AG | 0 | 1.7320470016174534e-05 | 217 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 11 | 1789752 | 1789968 | Neocallimastix californiae 1754190 | ACT|GTAATATAAA...TAAATTTTATAT/TTAAATTTTATA...TATAG|ACG | 2 | 1 | 45.474 |
126230447 | GT-AG | 0 | 0.0018939573251076 | 107 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 12 | 1790041 | 1790147 | Neocallimastix californiae 1754190 | CAT|GTAATTTTTT...AAAATTTTAATT/ATTATATTCATT...TACAG|TGG | 2 | 1 | 46.975 |
126230448 | GT-AG | 0 | 0.0066898975009663 | 97 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 13 | 1790220 | 1790316 | Neocallimastix californiae 1754190 | AAT|GTATTTAATA...ATAATTTTAAAA/TAATTATTAATA...TATAG|GCA | 2 | 1 | 48.477 |
126230449 | GT-AG | 0 | 1.000000099473604e-05 | 110 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 14 | 1790389 | 1790498 | Neocallimastix californiae 1754190 | ATT|GTAATAAAAA...ATATCTTTATAT/TTTATATTTAAT...TTAAG|ATA | 2 | 1 | 49.979 |
126230450 | GT-AG | 0 | 0.1010811529841304 | 106 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 15 | 1790571 | 1790676 | Neocallimastix californiae 1754190 | ATT|GTATATATTA...ATATTATTAAAT/TATATTTTCATA...AATAG|ACG | 2 | 1 | 51.481 |
126230451 | GT-AG | 0 | 0.0071116577453177 | 109 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 16 | 1790749 | 1790857 | Neocallimastix californiae 1754190 | TTT|GTATGACTAT...ATTATCTTAATT/ATTATCTTAATT...TTTAG|AAT | 2 | 1 | 52.983 |
126230452 | GT-AG | 0 | 3.164727138254149e-05 | 81 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 17 | 1790930 | 1791010 | Neocallimastix californiae 1754190 | ATT|GTAAATAAAA...TTATTATTAATT/TTATTATTAATT...ATTAG|ATG | 2 | 1 | 54.485 |
126230453 | GT-AG | 0 | 4.309670098485577e-05 | 116 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 18 | 1791083 | 1791198 | Neocallimastix californiae 1754190 | TAT|GTAAGATATT...TTATTTTTAAAT/TTATTTTTAAAT...AAAAG|TCA | 2 | 1 | 55.987 |
126230454 | GT-AG | 0 | 1.000000099473604e-05 | 114 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 19 | 1791271 | 1791384 | Neocallimastix californiae 1754190 | ATT|GTAAGAATAA...TTATTCTAATTA/ATTATTCTAATT...TATAG|ATA | 2 | 1 | 57.489 |
126230455 | GT-AG | 0 | 8.300712586277914e-05 | 93 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 20 | 1791457 | 1791549 | Neocallimastix californiae 1754190 | ATT|GTAATATTAT...TTATCTATAATT/TTATATTTAATA...TAAAG|ATG | 2 | 1 | 58.99 |
126230456 | GT-AG | 0 | 3.755622373575205e-05 | 319 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 21 | 1791622 | 1791940 | Neocallimastix californiae 1754190 | ATT|GTAATAATAA...TTTTTTTTATTA/TTTTTTTTTATT...ATTAG|ACA | 2 | 1 | 60.492 |
126230457 | GT-AG | 0 | 0.00011742418679 | 129 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 22 | 1792013 | 1792141 | Neocallimastix californiae 1754190 | ATT|GTAAATAATT...ATATTATTAAAT/ATATTATTAAAT...AAAAG|ATA | 2 | 1 | 61.994 |
126230458 | GT-AG | 0 | 4.04751025760581e-05 | 129 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 23 | 1792214 | 1792342 | Neocallimastix californiae 1754190 | ATT|GTAAATATAT...TATATATTAAAA/AAAATATTTATA...TTTAG|ACG | 2 | 1 | 63.496 |
126230459 | GT-AG | 0 | 8.270298159305701e-05 | 102 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 24 | 1792415 | 1792516 | Neocallimastix californiae 1754190 | ATT|GTAATATATA...AATATTTTAAAT/AATATTTTAAAT...AATAG|ATG | 2 | 1 | 64.998 |
126230460 | GT-AG | 0 | 0.0214948476575035 | 93 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 25 | 1792589 | 1792681 | Neocallimastix californiae 1754190 | ATT|GTATAATATA...TATTTTTTATAA/TTATTTTTTATA...ATTAG|ATG | 2 | 1 | 66.5 |
126230461 | GT-AG | 0 | 0.0011558542878441 | 352 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 26 | 1792754 | 1793105 | Neocallimastix californiae 1754190 | ACT|GTAATATTTA...TTTTTTTTACCA/TTTTTTTTTACC...TATAG|TCG | 2 | 1 | 68.002 |
126230462 | GT-AG | 0 | 3.039092493118843e-05 | 281 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 27 | 1793178 | 1793458 | Neocallimastix californiae 1754190 | ATT|GTAAGTATTA...TAATTGTTATTT/TTGTTATTTAAA...TATAG|GTT | 2 | 1 | 69.504 |
126230463 | GT-AG | 0 | 3.41610670209062e-05 | 86 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 28 | 1793531 | 1793616 | Neocallimastix californiae 1754190 | ATT|GTAAAATTAA...TATTTCATATTC/AAATATTTCATA...TATAG|ATG | 2 | 1 | 71.005 |
126230464 | GT-AG | 0 | 0.0037737358719893 | 103 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 29 | 1793689 | 1793791 | Neocallimastix californiae 1754190 | ATT|GTAGTGTTTA...TATGCTTTGATA/AATTTATTCAAT...TTTAG|ACA | 2 | 1 | 72.507 |
126230465 | GT-AG | 0 | 1.000000099473604e-05 | 121 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 30 | 1793864 | 1793984 | Neocallimastix californiae 1754190 | ATT|GTAATAAATT...TATATATTAATA/TATATATTAATA...TAAAG|ATA | 2 | 1 | 74.009 |
126230466 | GT-AG | 0 | 1.000000099473604e-05 | 133 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 31 | 1794057 | 1794189 | Neocallimastix californiae 1754190 | ATT|GTAAGACATT...TATCTATTAATA/TTTATGTTAATA...TGAAG|TCG | 2 | 1 | 75.511 |
126230467 | GT-AG | 0 | 0.349489820940526 | 114 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 32 | 1794262 | 1794375 | Neocallimastix californiae 1754190 | ATT|GTATATATCT...AATTTTTTAACG/ATTATATTAACT...TCTAG|ATT | 2 | 1 | 77.013 |
126230468 | GT-AG | 0 | 0.5361286176797276 | 104 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 33 | 1794448 | 1794551 | Neocallimastix californiae 1754190 | ATT|GTATATATTA...TATATTTTAATA/TATATTTTAATA...AATAG|ATG | 2 | 1 | 78.515 |
126230469 | GT-AG | 0 | 2.770098799411196e-05 | 99 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 34 | 1794624 | 1794722 | Neocallimastix californiae 1754190 | ATT|GTAATAATTT...AATTTATTAAAT/AATTTATTAAAT...TCTAG|ACA | 2 | 1 | 80.017 |
126230470 | GT-AG | 0 | 0.0005338437591006 | 108 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 35 | 1794795 | 1794902 | Neocallimastix californiae 1754190 | ATT|GTAAATATTA...ATATTATTAATA/ATATTATTAATA...TATAG|ATA | 2 | 1 | 81.519 |
126230471 | GT-AG | 0 | 1.000000099473604e-05 | 133 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 36 | 1794975 | 1795107 | Neocallimastix californiae 1754190 | AAT|GTAAGAATAA...TATACTTCAAAA/ATATACTTCAAA...AAAAG|TTC | 2 | 1 | 83.02 |
126230472 | GT-AG | 0 | 0.0007974489505486 | 104 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 37 | 1795180 | 1795283 | Neocallimastix californiae 1754190 | ATT|GTAAATAATT...TTTTTTTTAATA/TTTTTTTTAATA...ACAAG|ACG | 2 | 1 | 84.522 |
126230473 | GT-AG | 0 | 0.0016722516357591 | 152 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 38 | 1795356 | 1795507 | Neocallimastix californiae 1754190 | ATT|GTAAGTTTTA...TATTTATTAATT/TATTTATTAATT...TTTAG|ATT | 2 | 1 | 86.024 |
126230474 | GT-AG | 0 | 0.0014648419216789 | 229 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 39 | 1795580 | 1795808 | Neocallimastix californiae 1754190 | ACT|GTAAGTATTA...ATTTTTTTAATC/ATTTTTTTAATC...AAAAG|ATG | 2 | 1 | 87.526 |
126230475 | GT-AG | 0 | 6.632448186432543e-05 | 104 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 40 | 1795881 | 1795984 | Neocallimastix californiae 1754190 | AAT|GTAAATAAAA...TTTTTTTTAAAA/TTATATCTTATA...AAAAG|TCG | 2 | 1 | 89.028 |
126230476 | GT-AG | 0 | 1.000000099473604e-05 | 131 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 41 | 1796057 | 1796187 | Neocallimastix californiae 1754190 | TTT|GTAAGATAGT...TTTTTCATAAAT/TTATTTTTCATA...ACTAG|GTT | 2 | 1 | 90.53 |
126230477 | GT-AG | 0 | 0.0001211528375586 | 86 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 42 | 1796260 | 1796345 | Neocallimastix californiae 1754190 | ATT|GTAAGCATAT...TAATTGTTATAT/ATAATATTAAAT...AATAG|ATG | 2 | 1 | 92.032 |
126230478 | GT-AG | 0 | 0.0003295645255003 | 157 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 43 | 1796418 | 1796574 | Neocallimastix californiae 1754190 | ATT|GTAATTAATC...CTTTTTTTATTT/TTTTATTTCATT...ATAAG|ACA | 2 | 1 | 93.534 |
126230479 | GT-AG | 0 | 0.0071631472313055 | 92 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 44 | 1796647 | 1796738 | Neocallimastix californiae 1754190 | ATT|GTAAACTATT...TTTAATTTAATA/TTAAATTTAATT...CATAG|ATA | 2 | 1 | 95.035 |
126230480 | GT-AG | 0 | 0.0010776845265073 | 138 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 45 | 1796811 | 1796948 | Neocallimastix californiae 1754190 | TTT|GTAAGTTTAT...TTTTTTTTATTA/TTTTTTTTTATT...TATAG|ACG | 2 | 1 | 96.537 |
126230481 | GT-AG | 0 | 0.0293531894075013 | 125 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 46 | 1797021 | 1797145 | Neocallimastix californiae 1754190 | ATT|GTATATAATC...TGATTTTTATAA/TATATATTGATT...ACTAG|ATT | 2 | 1 | 98.039 |
126230482 | GT-AG | 0 | 0.0024948856601423 | 175 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA663081 23199961 | 47 | 1797218 | 1797392 | Neocallimastix californiae 1754190 | AAT|GTAATTTTTA...TTTCTTTTATAT/ATTTCTTTTATA...AATAG|TAA | 2 | 1 | 99.541 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);