home / WtMTA

introns

Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)

id
INTEGER (primary key), globally unique identifier for each intron
dinucleotide_pair
TEXT, terminal dinucleotide sequences of the intron
is_minor
INTEGER, indicates if the intron is a minor intron (1) or not (0)
score
REAL, score representing the probability (0-100%) of the intron being minor
length
INTEGER, length of the intron in base pairs
transcript_id
INTEGER (foreign key referencing transcripts(id)), parent transcript
ordinal_index
INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
start
INTEGER, start position of the intron in the genome
end
INTEGER, end position of the intron in the genome
taxonomy_id
INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
scored_motifs
TEXT, motifs scored for the intron
phase
INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
in_cds
INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
relative_position
REAL, relative position of the intron within the transcript (as a percentage of coding length)

16 rows where transcript_id = 15550478

✎ View and edit SQL

This data as json, CSV (advanced)

Suggested facets: dinucleotide_pair, score, phase, in_cds

id ▼ dinucleotide_pair is_minor score length transcript_id ordinal_index start end taxonomy_id scored_motifs phase in_cds relative_position
84003949 GT-AG 0 1.000000099473604e-05 270 rna-XM_028391186.1 15550478 3 55116227 55116496 Glycine soja 3848 ATA|GTTAAGTATA...TTATTTTTACAT/ATTATTTTTACA...TACAG|GAG 0 1 9.423
84003950 GT-AG 0 1.000000099473604e-05 102 rna-XM_028391186.1 15550478 4 55116004 55116105 Glycine soja 3848 AAG|GTGTGAATTG...TTAATTGTAAAT/TGTAAATTTATT...TTCAG|GTC 1 1 11.98
84003951 GT-AG 0 1.000000099473604e-05 365 rna-XM_028391186.1 15550478 5 55115520 55115884 Glycine soja 3848 AAG|GTCATATATC...TGCTTCTTATTT/CTGCTTCTTATT...TACAG|GGT 0 1 14.494
84003952 GT-AG 0 0.0017953269348022 100 rna-XM_028391186.1 15550478 6 55115366 55115465 Glycine soja 3848 TCT|GTAAGTTTGT...CTTTTTTTATTA/CCTTTTTTTATT...TGTAG|GTG 0 1 15.635
84003953 GT-AG 0 0.0002143385106449 308 rna-XM_028391186.1 15550478 7 55114869 55115176 Glycine soja 3848 CAG|GTAGCACTAC...ATCTTTTTGATT/ATCTTTTTGATT...TTCAG|GTT 0 1 19.628
84003954 GT-AG 0 1.000000099473604e-05 116 rna-XM_028391186.1 15550478 8 55114597 55114712 Glycine soja 3848 TTT|GTTAGTATAT...ATTTTTTTAATA/ATTTTTTTAATA...TATAG|GAT 0 1 22.924
84003955 GC-AG 0 1.000000099473604e-05 92 rna-XM_028391186.1 15550478 9 55114409 55114500 Glycine soja 3848 AAG|GCAAGATTCT...TTTCTTTTGATG/CTTTTGCTTATG...TGCAG|CTA 0 1 24.952
84003956 GT-AG 0 2.6584149898679537e-05 1289 rna-XM_028391186.1 15550478 10 55110835 55112123 Glycine soja 3848 CAG|GTAAGTTTCT...GGAGCTTTGATG/TCCCTACTAACT...AGCAG|GAA 2 1 73.231
84003957 GT-AG 0 1.000000099473604e-05 440 rna-XM_028391186.1 15550478 11 55110225 55110664 Glycine soja 3848 AAG|GTTGGTAGTA...ATTCTCATATTC/AATATTCTCATA...TTTAG|GAT 1 1 76.822
84003958 GT-AG 0 9.70953277944502e-05 98 rna-XM_028391186.1 15550478 12 55109867 55109964 Glycine soja 3848 AAT|GTGTGTATGA...ATAAACTTAATT/ACTTAATTTACA...TGTAG|AGT 0 1 82.316
84003959 GT-AG 0 1.000000099473604e-05 105 rna-XM_028391186.1 15550478 13 55109588 55109692 Glycine soja 3848 GCG|GTCAGTTCAC...AAGATTTTGATC/AAGATTTTGATC...TGCAG|AAT 0 1 85.992
84003960 GT-AG 0 1.068695156671848e-05 1444 rna-XM_028391186.1 15550478 14 55108006 55109449 Glycine soja 3848 AAG|GTATTGGTTT...ATTGTCTAACAT/AATTGTCTAACA...ATCAG|GGA 0 1 88.908
84003961 GT-AG 0 1.000000099473604e-05 1652 rna-XM_028391186.1 15550478 15 55106112 55107763 Glycine soja 3848 CAA|GTAAGATAAT...GGATCTATGATT/ATAATTGTGAAT...TATAG|GTA 2 1 94.021
84003962 GT-AG 0 0.0003015412263482 871 rna-XM_028391186.1 15550478 16 55105129 55105999 Glycine soja 3848 AAG|GTATGTGTGC...TTTCCCTTTTCT/CCCTTTTCTATT...CGCAG|GTT 0 1 96.387
84012264 GT-AG 0 1.000000099473604e-05 420 rna-XM_028391186.1 15550478 1 55117715 55118134 Glycine soja 3848 CAA|GTGAGTAATA...TATATTTTATTC/ATATATTTTATT...TTCAG|GTT   0 5.409
84012265 GT-AG 0 0.0025405539665968 986 rna-XM_028391186.1 15550478 2 55116626 55117611 Glycine soja 3848 TTG|GTATTGGTTT...GTTTTTTTAATC/TTTAATCTCATT...AACAG|ATT   0 7.585

Advanced export

JSON shape: default, array, newline-delimited, object

CSV options:

CREATE TABLE "introns" (
"id" INTEGER,
  "dinucleotide_pair" TEXT,
  "is_minor" INTEGER,
  "score" REAL,
  "length" INTEGER,
  "transcript_id" INTEGER,
  "ordinal_index" INTEGER,
  "start" INTEGER,
  "end" INTEGER,
  "taxonomy_id" INTEGER,
  "scored_motifs" TEXT,
  "phase" INTEGER,
  "in_cds" INTEGER,
  "relative_position" REAL
  ,PRIMARY KEY ([id]),
   FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
   FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
    ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
    ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
    ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
    ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
    ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
    ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
    ON [introns] ([in_cds]);
Powered by Datasette · Queries took 26.75ms · Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)