Introduction to high-throughput Sequencing Data Analysis
Overview of next-generation sequencing data analysis including hands-on practice using computational tools to solve a variety of biological problems. Topics include: Quality Control, contamination analysis, Genome assembly, alignments, genome annotation, RNASeq, Atac-Seq, ChIP-Seq and SNP analysis. Students will be able to work at their own pace and choose topics they are most interested in learning for the majority of the course.
Prerequisite for this course is EEOB546X “Computational Skills for Biological Data” Students will be provided access to high performance computing clusters to work on the data analysis.
Provide a hands-on experience in analyzing next-gen sequencing data. Students will understand various sequencing technologies and how to apply appropriate tools to analyze big data.
Graduate students interested in gaining experience in a variety of high-throughput sequencing data analyses tailored to their needs.
Instructor: | Andrew Severin |
severin@iastate.edu | |
Office Hours: | Tue 10-11 in 206 Science I |
Class Meetings: | TR 8:00 – 9:20 AM in MBB 1340 |
Textbook: No required textbook. Course materials and research articles will be provided and taken mainly from the Bioinformatics Workbook at https://github.com/ISUgenomics/bioinformatics-workbook/blob/master/index.md
|Points | Description|
|-|-|
|20| Slack, XSEDE, Github Assignment|
|20| Regular expressions Assignment|
|40| Project plans |
|20|Alignment project|
|100|RNA-Seq project|
|400|Projects 1-4 (100 points each)|
|600 |Total|
Projects 1, 2, 3 and 4 are determined by the student and can be chosen based on their interests. The following topics are currently available to choose from:
Project Type | Description | ||
---|---|---|---|
Genome assembly | Illumina | Pacbio | hybrid |
Genome annotation | Gene models | functional prediction | |
SNP Calling | 90 fish exp | ||
RNA-Seq | with a genome | without a genome | Comparison of Results |
Metagenomics | Qiime | ||
Dex-Seq | |||
Atac-Seq | |||
ChIP-Seq |
Example 1 | |
---|---|
Genome assembly Pacbio | (hybrid, illumina, comparison) |
Genome annotation (gene models) | (functional annotation) |
RNA-Seq without a Genome | (comparison to with a genome) |
Dex-Seq an extension to the RNA-Seq analysis |
Project plans are required for each of the four projects chosen by the students and are to be completed prior to the start of the next project time start period.
Multiple projects can be completed in each time period. However, projects must be designated as one of the four main class projects or a bonus project prior to start and cannot be changed. Class projects are out of 100 points as described above. Bonus projects are out of 25 points but have a a maximum possible points of 50, meaning that 25 extra credit points could be achieved per bonus project.
Students are expected to gain an understanding of the importance of bioinformatic approaches in modern biological research. For each topic covered, they will understand the fundamental biological questions and be able to apply appropriate computational tools. Students will understand basic principles underlying computational approaches, realize the limitations of available tools, and be able to critically interpret results.
Assignments turned in late will receive a 10% reduction off of the total possible points for every day it is turned in late until 10 days after the assignment is due at which time no points will be given even if the assignment is turned in late.
Students are required to complete projects and assignments independently but can work together to help each other by asking questions as needed while going through the provided tutorials. Projects will be recorded using markdown in a GitHub private repositories provided by the instructor to each student. Projects are expected to include an Introduction summarizing the project goals and species, Methods that include each command used to generate output used in the results, Results describing the major findings, a Discussion section that describes the meaning behind several results discovered and a Future Directions section that describes where the student would take the project from here. Figures and tables should be included in the appropriate sections. The report should reflect the students understanding beyond just getting an answer.
If you have a disability and require accommodations, please contact one of the instructors early in the semester so that your learning needs may be appropriately met. You will need to provide documentation of your disability to the Disability Resources (DR) office, located on the main floor of the Student Services Building, Room 1076, 515-294-6624.
Iowa State University strives to maintain our campus as a place of work and study for faculty, staff, and students that is free of all forms of prohibited discrimination and harassment based upon race, ethnicity, sex (including sexual assault), pregnancy, color, religion, national origin, physical or mental disability, age, marital status, sexual orientation, gender identity, genetic information, or status as a U.S. veteran. Any student who has concerns about such behavior should contact his/her instructor, Student Assistance at 515-294-1020 or email dso-sas@iastate.edu, or the Office of Equal Opportunity and Compliance at 515-294-7612.
If an academic or work requirement conflicts with your religious practices and/or observances, you may request reasonable accommodations. Your request must be in writing, and your instructor or supervisor will review the request. You or your instructor may also seek assistance from the Dean of Students Office or the Office of Equal Opportunity and Compliance.
After Week 7, course topics will depend on the majority interest of the class. Projects turned in by the students will be determined on individual interests based on project plans for each period.
BCB 660: Introduction to NextGen Sequencing Tentative class schedule Spring 2018
Week | Date | Day | Topic | Homework | |
---|---|---|---|---|---|
1 | 9-Jan | T | Introduction and Github/XSEDE/Slack setup/markdown | ||
11-Jan | R | Unix review | |||
2 | 16-Jan | T | No Class | ||
18-Jan | R | File formats | |||
3 | 23-Jan | T | Regular expressions | ||
25-Jan | R | Introduction & Sequencing Technologies | |||
4 | 30-Jan | T | QC and Contamination | Regular Expressions Assignment due | |
1-Feb | R | Alignment | |||
5 | 6-Feb | T | Alignment work day | QC and contamination Assignment due | |
8-Feb | R | RNA-Seq | Individual project plan 1 due | ||
6 | 13-Feb | T | RNA-Seq | Alignment Assignment due | |
15-Feb | R | RNA-Seq | |||
7 | 20-Feb | T | RNA-Seq | ||
22-Feb | R | Project time | RNA-Seq Assignment Due | ||
8 | 27-Feb | T | Project time | ||
1-Mar | R | Project time | Individual project plan 2 due | ||
9 | 6-Mar | T | Project Deadline 1 | 1 Project Report due for Interval 1 | |
8-Mar | R | Project time | |||
10 | 13-Mar | T | Spring Break | ||
15-Mar | R | Spring Break | |||
11 | 20-Mar | T | Project time | ||
22-Mar | R | Project time | Individual project plan 3 due | ||
12 | 27-Mar | T | Project Deadline 2 | 1 Project due for Interval 2 | |
29-Mar | R | Project time | |||
13 | 3-Apr | T | Project time | ||
5-Apr | R | Project time | Individual project plan 4 due | ||
14 | 10-Apr | T | Project Deadline 3 | 1 Projects due for Interval 3 | |
12-Apr | R | Project time | |||
15 | 17-Apr | T | Project time | ||
19-Apr | R | Project time | |||
16 | 24-Apr | T | Project Deadline 4 | 1 Projects due for Interval 4 | |
26-Apr | R | Project time | Final Project reports Due Finals Week | ||
17 | 1-May | T | Finals Week | ||
3-May | R |