Skip to main content

 Genomics_command_line_quiz1

For all projects, you may use your own Unix-based system and, where applicable, ensure that you are running the version of the software specified in the assignments. Alternatively, you may use the VMBox virtual machine environment provided with the course materials. Instructions on how to download and use the environment can be found on the course web site.


For the following questions, refer to the class workflow and use the data in the Online materials (‘gencommand_proj1_data.tar.gz’) to answer the questions. Assume you sequenced and assembled the genome of Malus domestica (apple), and performed gene annotation. You then collected samples and ran RNA-seq experiments to determine sets of genes that are expressed in the various tissues. This information was stored, respectively, in the following files: “apple.genome”, “apple.genes”, “apple.condition{A,B,C}”.


NOTE: The apple genome and the apple gene annotations for this project were extracted from the Rosaceae Genome Database (RGD). Actual data have then been modified, and hence may not directly reflect the information in the original RGD records.


1. How many chromosomes are there in the genome?

grep -c ">" apple.genome

## 3

2. How many genes?

cut -f1 apple.genes | sort -u | wc -l

##     5453

3. How many transcript variants?

cut -f2 apple.genes | sort -u | wc -l

##     5456

4. How many genes have a single splice variant?

cut -f1 apple.genes | uniq -c | grep " 1 " | wc -l

##     5450

5. How may genes have 2 or more splice variants?

cut -f1 apple.genes | uniq -c | grep -v " 1 " | wc -l

##        3

6. How many genes are there on the ‘+’ strand?

cut -f1,4 apple.genes | sort | uniq -c | grep "+" | wc -l

##     2662

7. How many genes are there on the ‘-’ strand?

cut -f1,4 apple.genes | sort | uniq -c | grep "-" | wc -l

##     2791

8. How many genes are there on chromosome chr1?

9. How many genes are there on each chromosome chr2?

10. How many genes are there on each chromosome chr3?

cut -f1,3 apple.genes | sort -u | cut -f2 | sort | uniq -c

## 1624 chr1

## 2058 chr2

## 1771 chr3

11. How many transcripts are there on chr1?

12. How many transcripts are there on chr2?

13. How many transcripts are there on chr3?

cut -f2,3 apple.genes | sort -u | cut -f2 | sort | uniq -c

## 1625 chr1

## 2059 chr2

## 1772 chr3

14. How many genes are in common between condition A and condition B?

cut -f1 apple.conditionA | sort -u > sortA

cut -f1 apple.conditionB | sort -u > sortB

comm -1 -2 sortA sortB | wc -l

##     2410

15. How many genes are specific to condition A?

comm -2 -3 sortA sortB | wc -l

##     1205

16. How many genes are specific to condition B?

comm -1 -3 sortA sortB | wc -l

##     1243

17. How many genes are in common to all three conditions?

cut -f1 apple.conditionC | sort -u > sortC

comm -1 -2 sortA sortB > AB_common

comm -1 -2 AB_common sortC | wc -l

##     1608

Comments

Popular posts from this blog

Immunotherapy

 

Genetic Mutations

  GENETIC MUTATIONS Genetic mutations  are spontaneous changes occurring in the genetic material, (DNA) of an individual. Mutations can be passed to the next generations or may not, depending upon the type of cell in which they occur.       They can be harmful, beneficial or even neutral. Usually these are caused naturally but they can be caused by harmful radiations or chemicals or drugs etc. Somatic cell’s mutations do not pass to the offspring. While the mutations in germ line cells are hereditary. The term mutation was given by Devries in 1909. Types of Mutations Two most basic categories of mutations are: Point mutations These mutations are those occurring only at certain points in chromosomes or the genes if it. For example, insertion, deletion etc. Chromosomal mutation More sever class of mutations is chromosomal mutations. It includes deletion of one or set of chromosome, or may be more number of chromosomes is present in genome than normal numbers....