This material may consist of step-by-step explanations on how to solve a problem or examples of proper writing, including the use of citations, references, bibliographies, and formatting. This material is made available for the sole purpose of studying and learning - misuse is strictly forbidden.
# Problem 1
# On the Golub et al. (1999) data set, we consider the correlation between the Zyxin gene
# expression values and each of the gene in the data set
#  3051 38
golub <- data.frame(golub)
gol.fac <- factor( golub.cl, levels=0:1, labels=c("ALL","AML"))
# (a)How many of the genes have correlation values less than negative 0.5? (Those genes are
# highly negatively correlated with Zyxin gene).
#  "4847" "Zyxin" "X95735_at"
correlations <- apply(golub,1,cor, as.numeric( golub[2124,] ))
correlations.less.than.05 <- correlations < 0.5
#  2941
# 2941 gnes
# (b)Find the gene names for the top five genes that are most negatively correlated with
# Zyxin gene.
o <- order(correlations)
#  "Macmarcks"
#  "Inducible protein mRNA"
#  "C-myb gene extracted from Human (c-myb) gene, complete primary cds, and five complete alternatively spliced cds"
#  "Oncoprotein 18 (Op18) gene"
#  "54 kDa protein mRNA"...
This is only a preview of the solution. Please use the purchase button to see the entire solution