BIOE591: Preliminary Assessment
This course is targeted at graduate students who plan to work with high-throughput, short-read DNA sequence data. I therefore assume a solid background in genetics as relates to your chosen field. I further assume some proficiency in using the computer, and prior exposure (even if limited) to the basics of scientific programming in R or Python. To help me adjust my instruction to your needs and current skillset, I would like you to answer the following questions in a separate document and email me the answers. (This will not be graded, but will count towards your participation grade.)
Genetics
What is the Central Dogma?
What is DNA? What are the four nucleotides?
What is RNA? What are the four ribonucleotides?
Where do you find DNA in an organisms?
What is an allele, and what is an allele frequency? How would you calculate an allele frequency with DNA sequence data?
What is a genetic marker?
Where does genetic variation come from?
What forces impact patterns of genetic variation?
What is a genome?
How big is a mammal genome (to an order of magnitude)?
What is ploidy? Are humans diploid, haploid, or tetraploid?
What is recombination?
What is linkage disequilibrium?
What are three types of mutations?
What is population genetic stucture / population subdivision?
Computer Skills
What kind of computer do you have, and what operating system does it run?
What is memory? How much memory does your computer have?
What is storage? How much storage space does your computer have?
Create a folder (or directory) for this course and provide its path below.
Pick a recent research project, class assignment, or work task that required you to create a directory. How did you structure its contents?
Have you ever used the command line?
What is a computing cluster, and have you ever used one?
What programming languages have you used, if any?
What is a package (or library)?
Have you ever written a function?
What is version control?
Have you used Git or GitHub?
What does it mean for an analysis to be reproducible?
Have you ever archived data?
What is open source software?
Open
What are you hoping to get out of this class?
Do you have your own dataset, or will you be working with an example?
What kind of questions do you want to address with data?
What are you worried about in this class, if anything?
Is there anything else I should know to help you have a good semester?