introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
35 rows where transcript_id = 720770
This data as json, CSV (advanced)
Suggested facets: score, length, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
3822081 | GT-AG | 0 | 0.0010746549608455 | 60 | rna-gnl|I4U23|003361-T1 720770 | 1 | 10558936 | 10558995 | Adineta vaga 104782 | AAA|GTAATTTTAT...GACATCTTATTT/ATTTTTTTCAAT...ATTAG|AAA | 0 | 1 | 6.342 |
3822082 | GT-AG | 0 | 0.001587961449283 | 61 | rna-gnl|I4U23|003361-T1 720770 | 2 | 10559374 | 10559434 | Adineta vaga 104782 | AAT|GTAAGTTTTC...TTGTTTTTAAAA/TATCTTTTCATT...ATCAG|TCA | 0 | 1 | 12.305 |
3822083 | GT-AG | 0 | 1.000000099473604e-05 | 74 | rna-gnl|I4U23|003361-T1 720770 | 3 | 10559574 | 10559647 | Adineta vaga 104782 | AAG|GTAAGAATAT...TGATTCTTATCA/ATGATTCTTATC...CTTAG|ATG | 1 | 1 | 14.498 |
3822084 | GT-AG | 0 | 1.1903737969010134e-05 | 60 | rna-gnl|I4U23|003361-T1 720770 | 4 | 10559850 | 10559909 | Adineta vaga 104782 | CAG|GTTTGTACGA...AATTTCTTGTCT/CAATATATAACT...AATAG|ATT | 2 | 1 | 17.684 |
3822085 | GT-AG | 0 | 0.0046448551799807 | 60 | rna-gnl|I4U23|003361-T1 720770 | 5 | 10560069 | 10560128 | Adineta vaga 104782 | AAC|GTATATGATA...AATATTTTGAAT/CTTGTTTTAAGT...TTTAG|ATT | 2 | 1 | 20.192 |
3822086 | GT-AG | 0 | 0.0009566009682866 | 57 | rna-gnl|I4U23|003361-T1 720770 | 6 | 10560311 | 10560367 | Adineta vaga 104782 | ATA|GTACGTTAAT...ATACTTTTATAT/CGTTGTTTGACT...TTAAG|TGA | 1 | 1 | 23.064 |
3822087 | GT-AG | 0 | 3.099362800001375e-05 | 51 | rna-gnl|I4U23|003361-T1 720770 | 7 | 10560467 | 10560517 | Adineta vaga 104782 | CAT|GTAGGTATCA...TGATTGTTGAAA/TTGAAAATAATT...TTTAG|CTG | 1 | 1 | 24.625 |
3822088 | GT-AG | 0 | 1.000000099473604e-05 | 62 | rna-gnl|I4U23|003361-T1 720770 | 8 | 10560773 | 10560834 | Adineta vaga 104782 | TTG|GTAGGATAAC...AAAATATTAATT/AAAATATTAATT...TTCAG|CAA | 1 | 1 | 28.648 |
3822089 | GT-AG | 0 | 1.8827285600262967e-05 | 66 | rna-gnl|I4U23|003361-T1 720770 | 9 | 10561194 | 10561259 | Adineta vaga 104782 | CAA|GTAAAGATTC...TCTTCTTTGAAA/TCTTCTTTGAAA...TGTAG|AAA | 0 | 1 | 34.311 |
3822090 | GT-AG | 0 | 1.000000099473604e-05 | 51 | rna-gnl|I4U23|003361-T1 720770 | 10 | 10561352 | 10561402 | Adineta vaga 104782 | TGG|GTAAAAACTT...TTGTTCTTCATA/TTGTTCTTCATA...GTTAG|ATA | 2 | 1 | 35.763 |
3822091 | GT-AG | 0 | 0.001295167354008 | 155 | rna-gnl|I4U23|003361-T1 720770 | 11 | 10561590 | 10561744 | Adineta vaga 104782 | CAA|GTAAGCTATT...CTATTTTTATCG/CCTATTTTTATC...ATTAG|ATT | 0 | 1 | 38.713 |
3822092 | GT-AG | 0 | 1.000000099473604e-05 | 67 | rna-gnl|I4U23|003361-T1 720770 | 12 | 10561851 | 10561917 | Adineta vaga 104782 | CAA|GTAAAAAACA...GAATTCTTGAAT/ATTACATTCATA...TATAG|ATA | 1 | 1 | 40.385 |
3822093 | GT-AG | 0 | 0.0020223438038586 | 67 | rna-gnl|I4U23|003361-T1 720770 | 13 | 10562012 | 10562078 | Adineta vaga 104782 | AAG|GTATAGTTTT...CTGTTTTTGTTT/TCTAGAGTTAGT...ACTAG|AGG | 2 | 1 | 41.868 |
3822094 | GT-AG | 0 | 1.000000099473604e-05 | 52 | rna-gnl|I4U23|003361-T1 720770 | 14 | 10562398 | 10562449 | Adineta vaga 104782 | AAA|GTTAGTCATT...ACTTTCTTATTA/CACTTTCTTATT...TATAG|GGT | 0 | 1 | 46.9 |
3822095 | GT-AG | 0 | 1.000000099473604e-05 | 47 | rna-gnl|I4U23|003361-T1 720770 | 15 | 10562562 | 10562608 | Adineta vaga 104782 | GAG|GTTGGTTGTT...TCATCTTCAGAT/GATATGATCATC...TCTAG|GTC | 1 | 1 | 48.667 |
3822096 | GT-AG | 0 | 1.000000099473604e-05 | 58 | rna-gnl|I4U23|003361-T1 720770 | 16 | 10562744 | 10562801 | Adineta vaga 104782 | AAG|GTTTGATTTC...TTGTTTCTATTT/GTTAAAATAATT...TTTAG|GTC | 1 | 1 | 50.797 |
3822097 | GT-AG | 0 | 1.000000099473604e-05 | 64 | rna-gnl|I4U23|003361-T1 720770 | 17 | 10563000 | 10563063 | Adineta vaga 104782 | ATG|GTTTGAAAGA...ATTTCTTCAACA/AATTTCTTCAAC...TGTAG|GAC | 1 | 1 | 53.92 |
3822098 | GT-AG | 0 | 0.0001142955056092 | 57 | rna-gnl|I4U23|003361-T1 720770 | 18 | 10563157 | 10563213 | Adineta vaga 104782 | CAG|GTATAAACTT...AAATCTTTATTT/TTAGTTTTCATA...TCTAG|ATG | 1 | 1 | 55.387 |
3822099 | GT-AG | 0 | 0.0057961818961238 | 61 | rna-gnl|I4U23|003361-T1 720770 | 19 | 10563283 | 10563343 | Adineta vaga 104782 | AAC|GTATGATTAC...TTAGTTTTGATT/TTAGTTTTGATT...CCTAG|GCG | 1 | 1 | 56.476 |
3822100 | GT-AG | 0 | 1.7176240515815292e-05 | 51 | rna-gnl|I4U23|003361-T1 720770 | 20 | 10563404 | 10563454 | Adineta vaga 104782 | ACA|GTTAGTTGAT...AATTTCTTAATA/TATAGTTTAATT...TTTAG|GCA | 1 | 1 | 57.422 |
3822101 | GT-AG | 0 | 1.1681657196151542e-05 | 61 | rna-gnl|I4U23|003361-T1 720770 | 21 | 10563724 | 10563784 | Adineta vaga 104782 | AAA|GTAAATGCTG...ATCTTTTTAGTG/AATCTTTTTAGT...TTTAG|GGT | 0 | 1 | 61.666 |
3822102 | GT-AG | 0 | 0.000188059542448 | 48 | rna-gnl|I4U23|003361-T1 720770 | 22 | 10564198 | 10564245 | Adineta vaga 104782 | ACT|GTAATGTATA...TATTTTTTGAAT/TATTTTTTGAAT...TCTAG|TCA | 2 | 1 | 68.181 |
3822103 | GT-AG | 0 | 0.0001680730095073 | 56 | rna-gnl|I4U23|003361-T1 720770 | 23 | 10564315 | 10564370 | Adineta vaga 104782 | AAA|GTACGTCATT...TTTGCCTTTTTC/ATCTTCGTCATT...ATTAG|ATC | 2 | 1 | 69.27 |
3822104 | GT-AG | 0 | 0.0783117796422497 | 66 | rna-gnl|I4U23|003361-T1 720770 | 24 | 10564468 | 10564533 | Adineta vaga 104782 | GTT|GTATGTTTAT...GAAATTTTAGAA/CGAAATTTTAGA...TCTAG|GAT | 0 | 1 | 70.8 |
3822105 | GT-AG | 0 | 1.000000099473604e-05 | 425 | rna-gnl|I4U23|003361-T1 720770 | 25 | 10564841 | 10565265 | Adineta vaga 104782 | GAG|GTAATTCGTT...ATATTTTTGATT/ATATTTTTGATT...TTTAG|GTG | 1 | 1 | 75.643 |
3822106 | GT-AG | 0 | 0.0002161105399019 | 55 | rna-gnl|I4U23|003361-T1 720770 | 26 | 10565362 | 10565416 | Adineta vaga 104782 | AAG|GTATGATTGA...ATCTCATTACTT/CTGTATCTCATT...CTCAG|GTA | 1 | 1 | 77.157 |
3822107 | GT-AG | 0 | 0.0002475922484262 | 68 | rna-gnl|I4U23|003361-T1 720770 | 27 | 10565638 | 10565705 | Adineta vaga 104782 | TCT|GTATGAAAAA...TGTTTCTTCGCT/ATCATTTTTATA...TTCAG|GAA | 0 | 1 | 80.644 |
3822108 | GT-AG | 0 | 0.0627069891972979 | 56 | rna-gnl|I4U23|003361-T1 720770 | 28 | 10565852 | 10565907 | Adineta vaga 104782 | GAA|GTATATTGTT...TGGTACTTAAAA/CTGGTACTTAAA...TCTAG|ACG | 2 | 1 | 82.947 |
3822109 | GT-AG | 0 | 1.000000099473604e-05 | 60 | rna-gnl|I4U23|003361-T1 720770 | 29 | 10566044 | 10566103 | Adineta vaga 104782 | CAA|GTGAATAAAA...TGCCTTTTGAGA/CTAAAACTAAAA...ATTAG|GGA | 0 | 1 | 85.092 |
3822110 | GT-AG | 0 | 4.660850935951689e-05 | 217 | rna-gnl|I4U23|003361-T1 720770 | 30 | 10566299 | 10566515 | Adineta vaga 104782 | AAA|GTAAATATCC...GTCCTTTTATTG/TGTCCTTTTATT...TTTAG|GGT | 0 | 1 | 88.168 |
3822111 | GT-AG | 0 | 0.0457805885683197 | 59 | rna-gnl|I4U23|003361-T1 720770 | 31 | 10566701 | 10566759 | Adineta vaga 104782 | AAG|GTATCTGATC...CAATCATTGATT/AAAATATTTATT...CATAG|TTT | 2 | 1 | 91.087 |
3822112 | GT-AG | 0 | 2.287241078989187e-05 | 59 | rna-gnl|I4U23|003361-T1 720770 | 32 | 10566920 | 10566978 | Adineta vaga 104782 | CGA|GTAAAACTTG...TGTTTTTTAAAA/CTGTTTTTTAAA...TAAAG|GCA | 0 | 1 | 93.611 |
3822113 | GT-AG | 0 | 1.000000099473604e-05 | 99 | rna-gnl|I4U23|003361-T1 720770 | 33 | 10567083 | 10567181 | Adineta vaga 104782 | AAG|GTACTAGAAA...CTCCTATTGATC/CTATTGATCATT...TTTAG|TAC | 2 | 1 | 95.252 |
3822114 | GT-AG | 0 | 2.67663845566774e-05 | 55 | rna-gnl|I4U23|003361-T1 720770 | 34 | 10567330 | 10567384 | Adineta vaga 104782 | AAT|GTACGTAAAC...TCTTTCTGAAAT/CATATTATAATT...TCTAG|GCA | 0 | 1 | 97.586 |
3822115 | GT-AG | 0 | 1.000000099473604e-05 | 61 | rna-gnl|I4U23|003361-T1 720770 | 35 | 10567486 | 10567546 | Adineta vaga 104782 | TTG|GTAAGTTAAA...TTTTTCTTTTCA/TTTCTTTTCACA...TTTAG|TAT | 2 | 1 | 99.18 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);