UGA Bulletin Logo

Environmental Data Science in Genomics


Course Description

An introduction to data science tools used in analyses of Environmental-Omics datasets, covering conceptual and practical aspects. This course will establish foundational scientific programming skills (Unix command line, R, Python, Git), and introduce common bioinformatic tools used in metabarcoding, metagenomic, and metatranscriptomic studies of marine, terrestrial, and host-associated ecosystems.


Athena Title

Environmental Genomics


Semester Course Offered

Offered spring


Grading System

A - F (Traditional)


Student learning Outcomes

  • Upon successful completion of this course, the student will understand fundamental principles and approaches in Environmental-Omics and be able to apply these to their own graduate research.
  • Upon successful completion of this course, the student will be able to design customized bioinformatics workflows to test ecological and evolutionary hypotheses in metabarcoding, metagenomic, and transcriptomic studies.
  • Upon successful completion of this course, the student will be able to explain the utility and limitations of public sequence database resources for environmental studies of largely undescribed organisms.
  • Upon successful completion of this course, the student will be able to evaluate different software tools and be able to choose the most appropriate one for their research needs.
  • Upon successful completion of this course, the student will have developed a working knowledge of foundational programming skills (Unix, R, Python, Git), and be able to apply these skills in a practical data analysis exercise
  • Upon successful completion of this course, the student will be able to compare and contrast analytical tools and sequencing approaches used in different types of Environmental-Omics studies.
  • Upon successful completion of this course, the student will be able to design a rigorous Environmental-Omics study, including fieldwork, hypothesis testing, sequencing strategy, and bioinformatic data analysis approaches.

Topical Outline

  • The Unix Shell
  • Version Control with Git
  • Programming with Python
  • Programming with R and R Studio
  • HPCC computing and introduction to the GACRC
  • Intro to Environmental-Omics workflows, and compare and contrast methodological approaches
  • Current landscape of sequencing platforms and technologies (long-read vs. short-read)
  • Metabarcoding study design and computational workflows
  • Metagenomic study design and computational workflows
  • Metatranscriptomic study design and computational workflows
  • Machine Learning and Artificial Intelligence software tools
  • Environmental-Omics Capstone Project – design a data analysis workflow and apply it to a problem of relevance related to the student’s graduate research; students will present their approach and analysis results to the entire class

Syllabus


Public CV