Begin with a statement of a problem or question to be investigated, then introduce the data.
Following the introduction of the problem or question try to provide motivation for the analysis to follow:
Describe here the models and methods used to analyse your data. If necessary, explain why the methods are appropriate and how you looked at the data to examine assumptions, justify transformations, etc. Be specific. (``MPG was transformed to -1/MPG, and log(PRICE) was used to achieve a linear relation'' rather than ``some variables were transformed''.) What methods were used to select your final model? Omit description of general aspects of statistics-- assume your reader knows what a multiple regression model is.
This section should also describe the scientific and statistical issues raised by the results described in the preceding sections. Suggestions for further analysis or other data are appropriate. Summarize (again) your conclusions about the scientific questions and back up your assertions with references to your Results: graphs, tables, etc.
All output should be labeled (e.g., Fig. 1, Table 10, or Exhibit 2) and referred to in the text (eg., "see Exhibit 2"). Output should be summarized where possible or compressed (deleting uninformative stuff), rather than just inserted whole, especially where it deals with subsidiary issues.
Actual level Quality variable ---------- ------------- 1-2 1 3 0
Actual level Bedroom# variable ---------- ------------- 0-2 0 3 1 4 or more 2
Actual level Bedroom# variable ---------- ------------- 1 1 not 1 0
RUN=0 : One (or more) pilot studies with NOBS <= 4 observations for each group in your design. That is, random sampling 4 observations for each group combinaton (total 2 by 3 by 2 = 12 group combinations) from the given real estate sales data and carry out a complete balanced analysis of variance. Also consider if tansformations of the response variable is necessary. You should use pilot study to examine your choice of sample sizes required for each group combination.
Note: The MSE from an analysis of variance of your pilot data and The maximum and minimum factor level means for a given factor can be used for power/sample size analysis of that design.
RUN=1 : The main study, with a design determined from your pilot data. The number of observations per group should be chosen based on your power and sample size analyses from pilot data. We would like to achieve a power of at least .80 for the three main effects and for all the possible interaction. Random sample the determined number of observations from the given real estate sales data and carry out a complete banlanced three way analysis of variance. Also consider if tansformations of the response variable is necessary.
RUN=2 : A replication study - must include the same overall design as the main study, but you may use different sample sizes. The data should be sampled from the given real estate sales data excluding the data used in RUN=1. The purpose of the replication study is simply to determine if your findings from RUN=1 hold up when the study is conducted again. Ideally, the significant findings from your main study should appear again in the replication.
RUN=3: As a comparison,assume that the sample sizes do not reflect the importance of the treatment means. Carry out an complete unbalanced three-way analysis of variance of the entire real estate sales data using the same response variable and three predictor variables discussed above. The analysis should consider transformations of the response variable.
RUN=4: As an another comparison, assume that the sample sizes reflect the importance of the treatment means. Carry out an complete unbalanced three-way analysis of variance of the entire real estate sales data using the same response variable and three predictor variables discussed above. The analysis should consider transformations of the response variable.
The main objective is to document how you designed the study and teased out the results. The most important aspect of the project write up is your description of what you did (and why) and your summary of the results. Remember: PLEASE DO NOT HAND IN MORE THAN 15 PAGES OF COMPUTER OUTPUT.
The following outline suggests the topics to focus on in your writeup.
Discuss consistencies and inconsistencies with run 1.
Discuss consistencies and inconsistencies with run 1.