Sophia S. Liau

DNA Processing

September 2024 Algorithmic Systems CPEN 221 Labs

This set of labs focused on the design and implementation of various methods for processing DNA sequences, such as protein recognition, splicing and cutting. These labs were incredibly useful for my understanding of data structures, constructors and writing parameters. They were completed in Java. In hindsight, I realized they were much more simple then I had thought, but they were a great introduction to the world of algorithms and data structures. Note: Currently, the code is not available for public viewing as they are part of my course materials.

DNA Processing Project Image

Technical Overview

The DNA Processing labs involved building a DNA datatype from scratch to model and analyze sequences. Given a string, I created a method to parse through the codons, removing junk and grouping the nucleotides into groups of three. Using this method, I could then take a clean set of strings, to then use for codon mutations, such as splicing and cutting with complimentary enzymes. Furthermore, we were able to identify if it were a protein based on biological criteria such as start and stop codon, size and CG mass. Finally, there were other various smaller methods working with individual nucleotides, such as mass, specific nucleotide content, and nucleotide counts.