Projects

Miscellaneous

Genome Comparison - Pgenerosa_v074 vs Pgenerosa_v070 with MUMmer Promer on Mox

  • 3 min read

In continuing to further improve our geoduck genome annotation, I’m attempting to figure out why Scaffold 1 of our assembly doesn’t have any annotations. As part of that I’ve decided to perform a series of genome comparisons and see how they match up, with an emphasis on Scaffold 1, using MUMmer 3.23 (specifically, promer for protein level comparisons). This software is specifically designed to do this type of comparison.

Read More

Genome Comparison - Pgenerosa_v074 vs S.glomerata NCBI with MUMmer Promer on Mox

  • 3 min read

In continuing to further improve our geoduck genome annotation, I’m attempting to figure out why Scaffold 1 of our assembly doesn’t have any annotations. As part of that I’ve decided to perform a series of genome comparisons and see how they match up, with an emphasis on Scaffold 1, using MUMmer 3.23 (specifically, promer for protein level comparisons). This software is specifically designed to do this type of comparison.

Read More

Genome Comparison - Pgenerosa_v074 vs M.yessoensis NCBI with MUMmer Promer on Mox

  • 3 min read

In continuing to further improve our geoduck genome annotation, I’m attempting to figure out why Scaffold 1 of our assembly doesn’t have any annotations. As part of that I’ve decided to perform a series of genome comparisons and see how they match up, with an emphasis on Scaffold 1, using MUMmer 3.23 (specifically, promer for protein level comparisons). This software is specifically designed to do this type of comparison.

Read More

Genome Comparison - Pgenerosa_v074 vs H.sapiens NCBI with MUMmer Promer on Mox

  • 3 min read

In continuing to further improve our geoduck genome annotation, I’m attempting to figure out why Scaffold 1 of our assembly doesn’t have any annotations. As part of that I’ve decided to perform a series of genome comparisons and see how they match up, with an emphasis on Scaffold 1, using MUMmer 3.23 (specifically, promer for protein level comparisons). This software is specifically designed to do this type of comparison.

Read More

Genome Comparison - Pgenerosa_v074 vs C.virginica NCBI with MUMmer Promer on Mox

  • 3 min read

In continuing to further improve our geoduck genome annotation, I’m attempting to figure out why Scaffold 1 of our assembly doesn’t have any annotations. As part of that I’ve decided to perform a series of genome comparisons and see how they match up, with an emphasis on Scaffold 1, using MUMmer 3.23 (specifically, promer for protein level comparisons). This software is specifically designed to do this type of comparison.

Read More

Genome Comparison - Pgenerosa_v074 vs C.gigas NCBI with MUMmer Promer on Mox

  • 3 min read

In continuing to further improve our geoduck genome annotation, I’m attempting to figure out why Scaffold 1 of our assembly doesn’t have any annotations. As part of that I’ve decided to perform a series of genome comparisons and see how they match up, with an emphasis on Scaffold 1, using MUMmer 3.23 (specifically, promer for protein level comparisons). This software is specifically designed to do this type of comparison.

Read More

Genome Comparison - Pgenerosa_v074 vs Pgenerosa_v070 with MUMmer on Mox

  • 3 min read

In continuing to further improve our geoduck genome annotation, I’m attempting to figure out why Scaffold 1 of our assembly doesn’t have any annotations. As part of that I’ve decided to perform a series of genome comparisons and see how they match up, with an emphasis on Scaffold 1, using MUMmer (v4) (specifically, nucmer for nucleotide comparisons). This software is specifically designed to do this type of comparison.

Read More

Genome Comparison - Pgenerosa_v074 vs Pgenerosa_v074 with MUMmer on Mox

  • 3 min read

In continuing to further improve our geoduck genome annotation, I’m attempting to figure out why Scaffold 1 of our assembly doesn’t have any annotations. As part of that I’ve decided to perform a series of genome comparisons and see how they match up, with an emphasis on Scaffold 1, using MUMmer (v4) (specifically, nucmer for nucleotide comparisons). This software is specifically designed to do this type of comparison.

Read More

Genome Comparison - Pgenerosa_v074 vs S.glomerata NCBI with MUMmer on Mox

  • 3 min read

In continuing to further improve our geoduck genome annotation, I’m attempting to figure out why Scaffold 1 of our assembly doesn’t have any annotations. As part of that I’ve decided to perform a series of genome comparisons and see how they match up, with an emphasis on Scaffold 1, using MUMmer (v4) (specifically, nucmer for nucleotide comparisons). This software is specifically designed to do this type of comparison.

Read More

Genome Comparison - Pgenerosa_v074 vs M.yessoensis NCBI with MUMmer on Mox

  • 3 min read

In continuing to further improve our geoduck genome annotation, I’m attempting to figure out why Scaffold 1 of our assembly doesn’t have any annotations. As part of that I’ve decided to perform a series of genome comparisons and see how they match up, with an emphasis on Scaffold 1, using MUMmer (v4) (specifically, nucmer for nucleotide comparisons). This software is specifically designed to do this type of comparison.

Read More

Genome Comparison - Pgenerosa_v074 vs H.sapiens NCBI with MUMmer on Mox

  • 3 min read

In continuing to further improve our geoduck genome annotation, I’m attempting to figure out why Scaffold 1 of our assembly doesn’t have any annotations. As part of that I’ve decided to perform a series of genome comparisons and see how they match up, with an emphasis on Scaffold 1, using MUMmer (v4) (specifically, nucmer for nucleotide comparisons). This software is specifically designed to do this type of comparison.

Read More

Genome Comparison - Pgenerosa_v074 vs C.gigas NCBI with MUMmer on Mox

  • 3 min read

In continuing to further improve our geoduck genome annotation, I’m attempting to figure out why Scaffold 1 of our assembly doesn’t have any annotations. As part of that I’ve decided to perform a series of genome comparisons and see how they match up, with an emphasis on Scaffold 1, using MUMmer (v4) (specifically, nucmer for nucleotide comparisons). This software is specifically designed to do this type of comparison.

Read More

Genome Comparison - Pgenerosa_v074 vs C.virginica NCBI with MUMmer on Mox

  • 3 min read

In continuing to further improve our geoduck genome annotation, I’m attempting to figure out why Scaffold 1 of our assembly doesn’t have any annotations. As part of that I’ve decided to perform a series of genome comparisons and see how they match up, with an emphasis on Scaffold 1, using MUMmer (v4) (specifically, nucmer for nucleotide comparisons). This software is specifically designed to do this type of comparison.

Read More

Data Summary - P.generosa Transcriptome Assemblies Stats

  • 1 min read

In our continuing quest to wrangle the geoduck transcriptome assemblies we have, I was tasked with compiling assembly stats for our various assemblies. The table below provides an overview of some stats for each of our assemblies. Links within the table go to the the notebook entries for the various methods from which the data was gathered. In general:

Read More

FastQC-MultiQC - Additional C.gigas WGBS Sequencing Data from Genewiz Received 20190501

  • ~1 min read

Earlier today, we received the additional G.gigas sequencing data from Genewiz. Wanted to run through FastQC again and get an updated report for each data set. Admittedly, it probably won’t look much different from the initial FastQC run on 20190415, due to the fact that the additional sequencing was simply appended to the previous data. Since FastQC examines a subset of the data in each file, I’d fully expect the FastQC report to look the same. However, we’ll have a greater number of sequences in each file. This should, in turn, increase the number of reads retained after quality trimming.

Read More

Data Analysis - C.virginica MBD Sequencing Coverage

  • 2 min read

A while ago, Steven tasked me with assessing some questions related to the sequencing coverage we get doing MBD-BSseq in this GitHub issue. At the heart of it all was really to try to get an idea of how much usable data we actually get when we do an MBD-BSseq project. Yaamini happened to have done an MBD-BSseq project relatively recently, and it’s one she’s actively working on analyzing, so we went with that data set.

Read More

Metagenomics Gene Prediction - P.generosa Water Samples Using MetaGeneMark on Mox to Compare pH Treatments

  • 2 min read

Continuing with a relatively quick comparison of pH treatments (pH=7.1 vs. pH=8.2), I wanted to run gene prediction on the MEGAHIT assemblies I made yesterday. I ran MetaGeneMark on the two pH-specific assemblies on Mox. This should be a very fast process (I’m talking, like a couple of minutes fast), so it enhances the annotation with very little effort and time.

Read More

Metagenome Assemblies - P.generosa Water Samples Trimmed HiSeqX Data Using Megahit on Mox to Compare pH Treatments

  • 2 min read

A report involving our work on the geoduck water metagenomics is due later this week and our in-depth analysis for this project using Anvi’o is likely to require at least another week to complete. Even though we have a broad overview of the metagenomic taxa present in these water samples, we don’t have data in a format for comparing across samples/treatments. So, I initiated our simplified pipeline (MEGAHIT > MetaGeneMark > BLASTn > KronaTools) for examining our metagenomic data of the two treatments:

Read More

RNA Isolation and Quantification - Crab Hemolypmh Using Quick-DNA-RNA Microprep Plus Kit

  • 1 min read

In the continuing struggle to isolate RNA from the Chionoecetes bairdi hemolymph preserved in RNAlater, Pam managed to find the Quick-DNA-RNA Microprep Plus Kit (ZymoResearch) as a potential option. We received a free sample of the kit and so I gave it a shot. Grace pulled 10 samples she’d previously used to isolate RNA (and was unable to get anything out of virtually all of them using the RNeasy Plus Micro Kit (Qiagen)) for me to try out this new kit:

Read More

Transcriptome Assembly - Geoduck Tissue-specific Assembly Larvae Day5 EPI99 with HiSeq and NovaSeq Data on Mox

  • 2 min read

I previously assembled and annotated P.generosa larval Day 5 transcriptome (20190318 - mislabeled as Juvenile Day 5 in my previous notebook entries) using just our HiSeq data from our Illumina collaboration. This was a an oversight, as I didn’t realize that we also had NovaSeq RNAseq data. So, I’ve initiated another de novo assembly using Trinity incorporating both sets of data.

Read More

Data Management - Data Migration and Drive Expansion on Gannet

  • 1 min read

A little while ago, we installed some additional hard drives in Gannet (Synology RS3618XS) with the intention of expanding the total storage space. However, the original set of HDDs were set up as RAID10. As it turns out, RAID10 configurations cannot be expanded! So, the new set of HDDs were configured as a separate volume (Volume 2) in a RAID6 configuration. After backing up the /volume1/web directory (via rsync) to our UW Google Drive, I begane the data migration.

Read More

Genome Assessment - BUSCO Metazoa on P.generosa v071 on Mox

  • 6 min read

Ran BUSCO on our completed annotation of the P.generosa v071 genome (GFF) (subset of sequences >10kbp). See this notebook entry for genome annotation info. This provides a nice metric on how “complete” a genome assembly (or transcriptome) is. Additionally, BUSCO is tied in with Augustus for gene prediction and generates ab initio gene models. With that said, since I just want to evaluate the completeness of this particular genome assembly, I’ll be using the annotated genome generated through two rounds of SNAP gene prediction. Otherwise, I’d use the initial MAKER annotations to generate an Augustus gene model that could be used in conjuction with the SNAP models (I’ll likely do this at a later date).

Read More

Data Wrangling - CpG OE Calculations on C.virginica Genes

  • 2 min read

Steven tasked me with processing ~90 FastA files containing gene sequences from C.virginica in this GitHub Issue. He needed to determine the Observed/Expected (O/E) ratio of CpGs in each FastA. He provided this example code and this link to all the files. Additionally, today, he tasked Kaitlyn with merging all of the output CpG O/E values for each sample in to a single file, but I decided to tackle it anyway.

Read More

Methylation Analysis - C.virginica

  • 4 min read

This is a quick and dirty (i.e. no adaptor/quality trimming) assessment of all of our Crassostrea irginica bisulfite sequencing efforts to date in order to get a rough idea of the methylation mapping, per this GitHub issue. Ran Bismark on Mox on a series of subset of the reads:

Read More

DNA Isolation and Quantification - Ronit’s C.gigas Ploidy Ctenidia

  • ~1 min read

Last week, I isolated DNA from all of Ronti’s ctenidia samples, however one sample (D18-C) didn’t isolate properly. So, I performed another isolation procedure with another section of frozen tissue. Tissue was excised from frozen tissue block via razor blade (weight not recorded) and pulverized under liquid nitrogen. Samples were incubated O/N @ 37oC (heating block) in 350uL of MB1 Buffer + 25uL Proteinase K, per the E.Z.N.A. Mollusc DNA Kit (Omega) instructions.

Read More

qPCR - Relative mitochondrial abundance in C.gigas diploids and triploids subjected to acute heat stress via COX1

  • 2 min read

Using the C.gigas cytochrome c oxidase (COX1) primers (SR IDs: 1713, 1714)I designed the other day, I ran a qPCR on a subset of Ronit’s diploid/triploid control/heat shocked oyster DNA that Shelly had previously isolated and performed global DNA methylation assay. The goal is to get a rough assessment of whether or not there appear to be differences in relative mitochondrial abundances between these samples.

Read More

FastQC and Trimming - Metagenomics (Geoduck) HiSeqX Reads from 20180809

  • 1 min read

Steven tasked me with assembling our geoduck metagenomics HiSeqX data. The first part of the process is examining the quality of the sequencing reads, performing quality trimming, and then checking the quality of the trimmed reads. It’s also possible (likely) that I’ll need to run another round of trimming. The process is documented in the Jupyter Notebook linked below. After these reads are cleaned up, I’ll transfer them over to our HPC nodes (Mox) and try assembling them.

Read More

Installation - Microsoft Machine Learning Server (Microsoft R Open) on Emu/Roadrunner R Studio Server

  • ~1 min read

Steven recently saw an announcement that Microsoft R Open now handles multi-threaded processing (default R does not), so we were interested in trying it out. I installed MLR/MRO on Emu/Roadrunner (Apple Xserve; Ubuntu 16.04). Followed the Microsoft installation directions for Ubuntu. In retrospect, I think I could’ve just installed MRO, but this gets the job done as well and won’t hurt anything.

Read More

Mox - Password-less SSH!

  • 1 min read

The high performance computing (HPC) cluster (called Mox) at Univ. of Washington (UW) frustratingly requires a password when SSH-ing, even when SSH keys are in use. I have a lengthy, unintelligible password that I use for my UW account, so having to type this in any time I want to initiate a new SSH session on Mox is a painful process.

Read More

Total Alkalinity Calculations - Yaamini’s Ocean Chemistry Samples

  • 1 min read

I ran a subset of Yaamini’s ocean chemistry samples on our T5 Excellence titrator (Mettler Toledo) at the beginning of April. The subset were samples taken from the beginning, middle, and end of the experiment. The rationale for this was to assess whether or not total alkalinity (TA) varied across the experiment. If there was little variation, then there’d likely be no need to run all of the samples. However, should there be temporal differences, then all samples should be processed.

Read More

FAIL - Missing Data on Owl!

  • 1 min read

Uh oh. There appears to be some data that’s been removed from Owl. I noticed this earlier when trying to look at some of Sean’s data. His data should be in a folder with his name in Owl/scaphapoda

Read More

RNA Isolation - Olympia oyster gonad tissue in paraffin histology blocks

  • 1 min read

My previous go at this was a little premature - I didn’t wait for Laura to fully annotate her slides/blocks. Little did I know, the tissue was mostly visceral mass and, as such, I didn’t hit much in the way of actual gonad tissue. So, I’m redoing this, now that Grace has gone through and annotated the blocks to point out gonad tissue. SN-10-16 was sent to Katherine Silliman on 20170720.

Read More

RNA Isolation - Olympia oyster gonad tissue in paraffin histology blocks

  • 2 min read

UPDATE 20170712: The RNA I isolated below is from incorrect regions of tissue. I misunderstood exactly what this tissue was, and admittedly, jumped the gun. The tissue is actually collected from the visceral mass - which contains gonad (a small amount) and digestive gland (a large amount). The RNA isolated below will be stored in one of the Shellfish RNA boxes and I will isolate RNA from the correct regions indicated by Grace

Read More

Data Received – Jay’s Coral epiRADseq - Not Demultiplexed

  • 1 min read

Previously downloaded Jay’s epiRADseq data that was provided by the Genomic Sequencing Laboratory at UC-Berkeley. It was provided already demultiplexed (which is very nice of them!). To be completionists on our end, we requested the non-demultiplexed data set.

Read More

Oyster Sampling - Olympia Oyster OA Populations at Manchester

  • 1 min read

I helped Katherine Silliman with her oyster sampling today from her ocean acidification experiment with Olympia oysters (Ostrea lurida) at the Kenneth K. Chew Center for Shellfish Research & Restoration, which is housed at the NOAA Northwest Fisheries Science Center at Manchester in a partnership with the [Puget Sound Restoration Fund (PSRF)(http://www.restorationfund.org/). We sampled the following tissues and stored in 1mL RNAlater:

Read More

Computing - Amazon EC2 Cost “Analysis”

  • 3 min read

I recently moved some computing jobs over to Amazon’s Elastic Cloud Computing (EC2) in attempt to avoid some odd computing issues/errors I kept encountering on our lab computers (Apple Xserve 3,1).

Read More

Dissection - Frozen Geoduck & Pacific Oyster

  • 2 min read

We’re working on a project with Washington Department of Natural Resources’ (DNR) Micah Horwith to identify potential proteomic biomarkers in geoduck (Panopea generosa) and  Pacific oyster (Crassostrea gigas). One aspect of the project is how to best conduct sampling of juvenile geoduck (Panopea generosa) and Pacific oyster (Crassostrea gigas) to minimize changes in the proteome of ctenidia tissue during sampling. Generally, live animals are shucked, tissue dissected, and then the tissue is “snap” frozen. However, Micah’s crew will be collecting animals from wild sites around Puget Sound and, because of the remote locations and the means of collection, will have limited tools and time to perform this type of sampling. Time is a significant component that will have great impact on proteomic status in each individual.

Read More

Opticon2 Calibration

  • 2 min read

Jake and Steven recently noticed localized “hot spots” on most of Jake’s recent qPCRs, where higher levels of fluorescence were consistently showing up in interior portions of the plates than the outer portion of the plates.

Read More

Reverse Transcription - O.lurida DNased RNA Controls and 1hr Heat Shock

  • ~1 min read

Performed reverse transcription on the Olympia oyster DNased RNA from the control samples and the 1hr heat shock samples of Jake’s project. To accommodate the large numbers of anticipated genes to be targeted in subsequent qPCRs, I prepared 100μL reactions (normally, 25μL reactions are prepared) using 250ng of each DNased RNA. A 1:10 dilution of the oligo dT primers (Promega) was prepared to improve pipetting accuracy. All incubations were performed in a thermal cycler without using a heated lid.

Read More

Gel Purification - Olympia Oyster and Sea Pen PCRs

  • ~1 min read

Purified DNA from the remaining PCR bands excised by Jake on 20150609 and 20150610, as well as Jonathan’s sea pen PCRs from 20150604, using Ultrafree-DA spin columns (Millipore). Transferred gel pieces from storage tubes (1.5mL snap cap tubes) to spin columns. Spun 10,000g, 5mins @ RT. Transferred purified DNA back to original storage tubes. See the sequence_log (Google Sheet) for a full list of the samples and the sequencing plates layouts. Purified DNA was stored @ 4C O/N. Will prepare and submit plates for Sanger sequencing tomorrow.

Read More

DNA Isolation - Test Sample

  • 1 min read

Due to the recent poor quality gDNA that has been isolated from C.gigas gonad, I decided to do a quick test using TE for DNA pellet resuspension in hopes that old Buffer EB (Qiagen) or old nuclease-free H2O (Promega) are to blame for the apparent, rapid degradation that I’ve experienced.

Read More

DNA Quality Check - Yanouk’s Oyster gDNA

  • ~1 min read

We’ve had some Illumina sequencing issues with Yanouk’s samples, so I ran the samples out on a 0.8% agarose gel to evaluate the levels of degradation. Loaded 2uL of each sample. Did not load equal quantities of gDNA, due to the lack of available gDNA in the samples we submitted for Illumina sequencing. Added 2uL of H2O to samples 37 & 38 in hopes of having sufficient DNA for visualization on the gel.

Read More

DNA Isolation - Geoduck

  • ~1 min read

Isolated additional geoduck gDNA from the two fresh (now frozen) geoduck’s that Brent provided me with on 20140212 so that we can potentially isolate RNA from the same geoducks to tie in with the DNA Illumina sequencing that Axa will be conducting. Isolated DNA using the DNeasy Blood & Tissue Kit (Qiagen) according to the manufacturer’s protocol (incubated minced siphon tissue at 56C for 3hrs). Eluted with 75uL of ddH2O and spec’d on NanoDrop1000.

Read More

DNA Precipitation - Geoduck DNA from 20140213

  • ~1 min read

After speaking with Axa regarding the DNA concentrations, he would like the DNA from the ethanol-fixed tissue to be more concentrated, and he wants them in ddH2O instead of Buffer AE (from the Qiagen DNeasy Kit). So, I preformed a standard ethanol precipitation. Added 0.1 volumes of 3M sodium acetate (pH = 5.2) [15uL], 2.5 volumes of 100% ethanol [412.5uL] and incubated @ -20C over the weekend.

Read More

DNA Isolation - Geoduck

  • ~1 min read

Since yesterday’s DNA isolation failed to yield sufficient quantity of DNA from the ethanol-fixed samples, I isolated additional DNA from the same samples.

Read More

DNA Isolation - Geoduck

  • ~1 min read

Isolated gDNA from 6 geoduck siphon samples provided by Brent using the Qiagen DNeasy Spin Kit. Samples were as follows:

Read More

Gylcogen and Carboyhydrate Assays - Emma’s C.gigas Whole Body Samples

  • ~1 min read

Samples were previously freeze dried overnight and stored @ -20C. To maximize sample homogeneity and, thus, increase accuracy of both assays, all samples were mechanically pulverized in their existing tubes. Approximately half of each sample was weighed and used for the glycogen assay. The remainder of each sample was stored @ -20C.

Read More