Need Help with this Question or something similar to this? We got you! Just fill out the order form (follow the link below), and your paper will be assigned to an expert to help you ASAP.
Assignment Task :
Introduction
This assignment is due on Thursday, December 5, 2019, at 11:59 pm. Each student is assigned to an individual database, with a single file containing the data. Each file contains one dependent variable and twenty independent variables. The values of the dependent variable are in the Y column (first column on the left). The values of the twenty-four independent variables are in the columns with names of El to E4 and G1 to G20. There are no missing values; that is, the data file is complete and needs no further processing. This project is worth up to 150 points. Failure to use the correct dataset will lead to a grade of zero. The data sets are named by the last five digits of your Stony Brook University ID as a paid file. The datasets will be posted in a zip format on the class blackboard.
Background
The class blackboard has a pdf file of a paper by Coapi et al. that reports a finding of gene-environment interaction. This paper used multiple regression techniques as the methodology for its findings. You should read it for background, as it is the genesis of the models that you will be given. The data that you are analyzing is synthetic. That is, the TA used a model to generate the data. Your task is to find the model that the TA used for your data. For example, one possible model is Yi = (500 + 5E1, + 25G2, + 50E31G41 +100G5,G6, + 2Z, )2
The class blackboard also contains a paper by Risch et al. that uses a larger collection of data to assess the findings in Caul et al. These researchers confirmed that gm’ et al. calculated their results correctly but that no other dataset had the relation reported in gaspi et al. That is, gm’ et al. seem to have reported a false positive.
Report
The report that you submit should be no more than 2500 words with no more than 3 tables and 2 figures. It should include references (which do not count in the 2500 words). The report may have a technical appendix. The appendix could include your computer programs or describe your procedures for computation. You should include whatever additional material you feel is necessary to report your results in the technical appendix. There are no length restrictions on the appendix. The submission of only computer output without a report is not sufficient and will receive a grade of zero. Analyses that report an incorrect number of observations will also receive a grade of zero.
Your report should be in a standard scientific report format. It should contain an introduction, methods section, results section, and a section with conclusions and discussions. You may add whatever other material you wish in a technical appendix. The introduction should contain the statement of your problem (namely estimating the function that the TA used to generate your data). It should discuss the context of finding GxE interactions, as given by Campi et al. and others. The methods section should discuss how you performed your statistical calculations, what independent variables that you considered, and other methodological issues, such as how you dealt with interaction variables. The results section should contain an objective statement of your findings. That is, it should contain the statement of the model that your group proposes for the data, the analysis of variance table for this model, and other key summary results. The discussion and conclusion section should include the limitations of your procedures. The class blackboard has an editorial (by Cummings) that discusses reporting statistical information.
Guidelines for analysis
The first task for this problem is to use the statistical package of your choice to find the correlations between the independent variables and the dependent variable. Transformations of variables may be necessary. The Box-Cox transformation may find potentially nonlinear transformations of a dependent variable. After selecting the trans-formations of the dependent variable, you may use stepwise regression methods to select the important independent variables. The Lasso technique was helpful to many groups in the past semesters. The TA will usually use at most two-way interactions of the independent variables (that is, terms like E1 G2 or G 3G 4 ) in generating your data. There may also be E2 E° 5 non-linear environmental variables, such as 3 or 4. The TA may well have used three-factor interactions in the models for a few of the groups.
This Report Writing Assignment has been solved by our Report Writing Experts at TVAssignmentHelp. Our Assignment Writing Experts are efficient to provide a fresh solution to this question. We are serving more than 10000+ Students in Australia, UK & US by helping them to score HD in their academics. Our experts are well trained to follow all marking rubrics & referencing style.
Be it a used or new solution, the quality of the work submitted by our assignment experts remains unhampered. You may continue to expect the same or even better quality with the used and new assignment solution files respectively. There’s one thing to be noticed that you could choose one between the two and acquire an HD either way. You could choose a new assignment solution file to get yourself an exclusive, plagiarism (with free Turnitin file), expert quality assignment or order an old solution file that was considered worthy of the highest distinction.