Stats4us     

 

Exercises
   

Home   │   Services      Feedback      Links   │  Contact      Your Queries

 

 

 

Tutorial sessions may at the student's request include practice at analysing data from research projects, or answering typical applied statistics exam questions and set exercises.

 

This page provides a small sample of the style of practice exam or exercise questions, data analysis and applications which Joan Osborne has prepared and compiled for student learning  purposes (tutoring and short courses), primarily in the field of Biostatistics (e.g. Quantitative Biology).  Many of the exercises below refer to MINITAB printouts, but are also available for students using SPSS.

 

IMPORTANT NOTE:  The questions below are designed for educational and instructional purposes only. With the exceptions of Question 7 (see acknowledgement) and Question 11, the practical research studies and data presented are entirely fictional and of Joan's invention. - They simulate hypothetical 'real world' cases, but are NOT based on actual studies and results. They have NO research validity. 

 

 

    18 EXAMPLE QUESTIONS: Q.1 Q.4 Q.7 Q.10 Q.13 Q.16  
       (on this page) Q.2 Q.5 Q.8 Q.11 Q.14 Q.17

Bottom of page

  Q.3 Q.6 Q.9 Q.12 Q.15 Q.18

 

 

 

Example Question 1 

                                           

 

Pogona vitticeps 

( photo from: http://www.angelfire.com/pq/beardeddragon/Photos )

The following data (see Table) give the abundance over four seasons of the Inland Bearded Dragon Pogona vitticeps.  This species of bearded dragon lives in semi-arid woodland and desert shrubland of the eastern interior of Australia.  It is  omnivorous, feeding on insects, smaller lizards, vegetable matter and flowers.   It is a "sun-loving" species and breeding follows good winter rains and associated abundant food supplies, followed by warmer spring months. 

The data were collected over a 12-month period, with a 10-day trapping period in each season.  The Table provides the number of individuals trapped in each season.  To determine male and female individuals, scale markings were used. Juveniles were identified by a shorter body length (snout to vent).

Test the null hypothesis that the ratio of the numbers of males, females and juveniles remains constant over all four seasons (use a = 0.05).  You must include the alternative hypothesis, and comment on the outcome from the statistical testing.               

 

TABLE:  Numbers of Bearded Dragon Pogona vitticeps over seasons;

 male female and juvenile individuals.

 

Season

Category

Spring

Summer Autumn

Winter

Male 64 66 75 45
Female 86 89 85 50
Juveniles 80 95 20 5

 

Return to Top of page

 

 

Example Question 2 

 

 

Freshwater crab, Austrothelphusa transversa

( photo from: http://www.amonline.net.au/sand/news/freshwater_crabs.htm)

The data in the table below come from a study of water loss in the fresh water crab, Austrothelphusa transversaThis species is widespread in the arid and semi-arid areas of Australia. To escape the dry weather, these crabs construct burrows in clay soils which they sometimes seal with mud. The burrows are usually around 60 cm long and are found in areas which experience annual flooding. The humid air trapped inside the burrows gives the crabs enough moisture to survive until wet weather returns.  Following heavy rain, the crabs often appear in large numbers in the claypans of river floodplains.

For this study the crabs were weighed and then kept at different relative humidities, and weighed again after six days of starvation.  Weight loss in milligrams was computed for each crab. 

 

Weight

loss in mg

(Y)

Percent

relative humidity

(X)

8.98

0.0

8.14

12.0

6.67

29.5

6.08

43.0

5.90

53.0

5.83

62.5

4.69

75.5

4.20

85.0

3.72

93.0

The purpose of the statistical analysis of these data is to establish whether a linear relationship exists between relative humidity and weight loss. 

(a)     Plot these data points on the provided graph paper.  Do not join the points.

(b)     Refer to the MINITAB printout below.  Write out the regression equation and then accurately draw this

         regression line onto the graph paper.

 

(c)     Is the null hypothesis Ho: b = 0 rejected, testing at a = 0.05?  Using information from the MINITAB printout

         below, elaborate on this testing procedure.

(d)        Formulate a conclusion for the study.

 

The following MINITAB printout was obtained:

 

MTB > PRINT C1 C2                       Name c1 = 'WTmg'  c2 = 'PERC'

 ROW  WTmg  PERC

   1   8.98    0.0

   2   8.14   12.0

   3   6.67   29.5

   4   6.08   43.0

   5   5.90   53.0

   6   5.83   62.5

   7   4.69   75.5

   8   4.20   85.0

   9   3.72   93.0

 

MTB > MEAN C1

   MEAN    =      6.0233

MTB > MEAN C2

   MEAN    =      50.389

MTB > DESCRIBE C1 C2

                N     MEAN   MEDIAN   TRMEAN    STDEV   SEMEAN

WTMG            9    6.023    5.900    6.023    1.736    0.579

PERC            9     50.4     53.0     50.4     32.2     10.7

 

              MIN      MAX       Q1       Q3

WTMG        3.720    8.980    4.445    7.405

PERC          0.0     93.0     20.8     80.3

 

MTB > NOBRIEF

MTB > REGRESS C1 1 C2

 

The regression equation is

WTmg = 8.70 - 0.0532 PERC

 

Predictor       Coef       Stdev    t-ratio        P

Constant      8.7036      0.1916      45.44    <0.001

PERC       -0.053192    0.003256     -16.34    <0.001

 

s = 0.2967      R-sq = 97.4%     R-sq(adj) = 97.1%

 

Analysis of Variance

SOURCE       DF          SS          MS         F        P

Regression    1      23.488      23.488    266.89    <0.001

Error         7       0.616       0.088

Total         8      24.104

 

Obs.    PERC      WTMG       Fit Stdev.Fit  Residual   St.Resid

  1      0.0    8.9800    8.7036    0.1916    0.2764      1.22 

  2     12.0    8.1400    8.0653    0.1594    0.0747      0.30 

  3     29.5    6.6700    7.1345    0.1200   -0.4645     -1.71 

  4     43.0    6.0800    6.4164    0.1018   -0.3364     -1.21 

  5     53.0    5.9000    5.8844    0.0992    0.0156      0.06 

  6     62.5    5.8300    5.3791    0.1065    0.4509      1.63 

  7     75.5    4.6900    4.6876    0.1283    0.0024      0.01 

  8     85.0    4.2000    4.1823    0.1499    0.0177      0.07 

  9     93.0    3.7200    3.7568    0.1704   -0.0368     -0.15 

 

MTB > CORR C1 C2

Correlation of WTmg and PERC = -0.987
 

Return to Top of page

 

 

Example Question 3 

 

 

Zebra Finch (male and female)

( photo from: http://www.birdsinbackyards.net )

Birds exhibit adaptations which may be related to the type of food they eat.  Birds from different taxonomic families, and even birds within the same family, exhibit different food preferences.

For a University third-year Biology project, a student studied the zebra finch, Taeniopygia guttata.  The student was to determine whether birds fed preferentially on particular seed types.  For one recording period, six (6) individuals were placed in an enclosure and observed for a 90-minute period.  Six seed types, each in excess of all the birds’ requirements, were placed randomly in separate trays in the enclosure.

The frequencies with which the birds chose different seed types were recorded, and are listed below:

Seed Type

Observed
Frequency
(O)

hulled oats

12

grey millet

28

panicum

32

sunflower

14

white millet

31

sorghum

15

TOTAL

132

(a)     State appropriate hypotheses and then, at a = 0.05, test whether seed preferences exist for the zebra finch. 

(b)     Complete the appropriate statistical test.

(c)     Formulate a conclusion for this preliminary investigation.

(d)     For additional marks, complete appropriate subdivisions on the analysed data.  Comment on the findings.

 

Return to Top of page

 

 

Example Question 4 

 

 

Muja Coal Mine at Collie

( photo from: http://www.griffincoal.com.au/Mining.html )

A collaborative research team of microbiologists and geochemists investigated the potential for sulphate-reducing bacteria to mediate acid mine drainage in the Collie (WA) coal basin.  One laboratory experiment was conducted to test the effects of pH on the growth of one strain of sulphate-reducing bacteria.  The amount of bacterial growth was assessed visually and the culture tubes were ranked from:

   1                 to               7         

least growth                 most growth

 

 pH

Amount of bacteria growth

RANK VALUE

2.0 2
2.5 1
3.0 3
3.5 4
4.0 5
4.5 7
5.0 6

 

(a)     Using the MOST APPROPRIATE method to establish whether there was a correlation between pH

         and bacteria growth (test at a = 0.05), list the null and alternative hypotheses.

(b)        Formulate a conclusion for this experiment.

 

Return to Top of page

 

 

Example Question 5 

                                           

 

The spread of 'die-back disease' through jarrah forest

( photo from:  http://www.naturebase.net/content/view/902/963 )

The spread of Phytophthora cinnamomi infection, commonly known as  'die-back' or 'Phytophthora root-rot disease', is having a devastating impact on native plant species in the southwest of Western Australia.

In Eucalyptus species the symptoms of the infection resemble those of plant water stress.  Increased leaf diffusive resistance is a characteristic symptom of a water-stressed plant.  An increase in leaf diffusive resistance indicates that stomatal pores are closing, in an attempt to conserve water.

         Little is known about the effect of P. cinnamomi on hybrid Eucalyptus species.  A plant physiologist obtained sixteen (16) E. marginata x E. patens seedlings and decided to investigate the effect of P. cinnamomi inoculation on these plants.  The plant physiologist also planned to investigate whether increased lime levels in the soil led to reduced P. cinnamomi infection of these hybrids. 

Thus, the design for this investigation consisted of two independent variables:

FACTOR A:        P. cinnamomi inoculum condition, with two levels of this variable.

         FACTOR B:        soil lime condition, with two levels of this variable.

Each treatment condition was replicated FOUR (4) times.  The dependent variable was leaf diffusive resistance, measured in seconds per square centimetre (s cm-2).

The results from this investigation are recorded below:

FACTOR B:

Soil lime condition

 

 

FACTOR A:

P. cinnamomi  inoculum

No inoculum

 

Inoculum introduced

to soil

No lime added to soil

 

 

23.9 (s cm-2)

88.5 (s cm-2)

25.3

77.5

24.1

74.3

15.9

97.9

Lime added to soil

 

 

26.2 (s cm-2)

31.2 (s cm-2)

14.3

29.3

18.9

18.8

22.2

38.8

 

A two-way ANOVA with replication was used to analyse these data, and a summary table of this analysis is provided below, with a tabulation of the main effects and interaction means.

 

(a)            Formulate null and alternative hypotheses for testing for normality and homogeneity of variances.  Using the first of the MINITAB printouts below, indicate acceptance or rejection at a = 0.05. Be sure you indicate what information from the printout gave acceptance or rejection of H0.

(b)           Complete all appropriate null and alternative hypotheses for testing for differences between means (Factor A and factor B) and the interaction effect, then indicate acceptance or rejection at a = 0.05 using the second of the MINITAB printouts below.  Be sure you indicate what information from the printout gave acceptance or rejection of H0.

(c)            Provide a plot of the ANOVA interaction between the two FACTORS (A and B) – graph paper is provided.

 

(d)                 Using information from the analysis of variance summary, the interaction plot and relevant mean s cm-2, interpret these results. 

(e)            Scheffé tests are NOT required. Explain why such testing is not completed here?

 

 

MINITAB printout for Phytophthora cinnamomi trial follows :

 

Test for Equal Variances: s/cm-2 versus INOCULUM, LIME

 

95% Bonferroni confidence intervals for standard deviations

 

INOCULUM  LIME  N    Lower    StDev    Upper

       0     0  4  2.12409   4.3112  25.8539

       0     2  4  2.48536   5.0445  30.2511

       1     0  4  5.31092  10.7795  64.6432

       1     2  4  4.06190   8.2443  49.4404

  

Bartlett's Test (normal distribution)

Test statistic = 2.75, p-value = 0.433

  

Levene's Test (any continuous distribution)

Test statistic = 1.67, p-value = 0.226

 

 
 

MTB > ANOVA 'S/CM' = INOCULUM ! LIME;

SUBC>   Means INOCULUM ! LIME.

 

Analysis of Variance (Balanced Designs)

 

Factor     Type Levels Values

INOCULUM  fixed      2     0     1

LIME      fixed      2     0     2

 

Analysis of Variance for s/cm-2

 

Source            DF         SS         MS       F      P

INOCULUM           1     5094.4     5094.4   89.30  <0.001

LIME               1     3240.5     3240.5   56.80  <0.001

INOCULUM*LIME      1     2822.3     2822.3   49.47  <0.001

Error             12      684.6       57.0

Total             15    11841.7

 

    MEANS

o = no inoculum (i.e. no P. cinnamomi),
1 = inoculum (P. cinnamomi present)

 INOCULUM   N     s/cm-2

        0   8    21.350

        1   8    57.037

 

o = no lime,
2 = lime present

 LIME    N     s/cm-2

     0   8    53.425

     2   8    24.962

 

Interaction Means

 INOCULUM LIME    N      s/cm-2

        0     0   4    22.300

        0     2   4    20.400

        1     0   4    84.550

        1     2   4    29.525

 

MTB > Table 'P. cinn' 'Lime';

SUBC>   Means 's/cm-2';

SUBC>   N 's/cm-2'.

 

Tabulated Statistics

ROWS: P. cinn    COLUMNS: Lime

 

           0        2      ALL

 

  0   22.300   20.400   21.350

           4        4        8

 

  1   84.550   29.525   57.038

           4        4        8

 

 ALL  53.425   24.962   39.194

           8        8       16

   CELL CONTENTS --

            s/cm-2:MEAN

                  N

Return to Top of page

 

 

Example Question 6 

                                           

 

( photo from:  http://www.co.rockland.ny.us/health/FCH/hhc.htm  )

 

In a study of the length of time spent on individual home visits by public health nurses, data were reported on length of home visit, in minutes, by a sample of 80 nurses (see Table below).  A record was made of each nurse’s age and the type of illness of each patient visited.  The researchers wished to obtain from their investigation answers to the following questions, using a two-way ANOVA at a = 0.05,

  1. Does the mean length of a home visit differ among the different age groups of nurses?

  2. Does type of patient affect the mean length of home visit?

  3. Is there an interaction between nurse’s age and type of patient?

TABLE:    Length of home visit in minutes by Public Health Nurses of different age groups,
for four different Patient Types.

FACTOR B:
Type of Patient

FACTOR A:
Nurse’s Age Group in Years

20-29 30-39 40-49 50 and over

            Cardiac

            (Code 1)

 

20 23 24 28
25 27 28 31
22 26 24 26

27

25

25

29

21

27

30

32

 

            Cancer

 

            (Code 2)

 

30

36

39

40

45

35

42

45

30

37

36

50

35

26

42

45

36

36

40

60

 

            HIV

 

            (Code 3)

 

31

33

41

42

30

36

45

50

40

31

40

40

35

41

40

55

30

31

35

45

 

            Tuberculosis

 

            (Code 4)

 

20

19

24

29

21

21

25

30

20

24

30

28

20

26

26

27

19

27

23

30

 

A two-way ANOVA with replication was used to analyse these data, and provided below are the summary table of this analysis, plus the table of means, and the interaction means plot.

 

(a)   Testing for Homogeneity of Variance was completed for this data set, prior to any ANOVA analysis. 

       Use the information below to determine whether the null hypothesis of variance equality is

       accepted or rejected.  State the test statistic and indicate the basis of acceptance or rejection of

       the null hypothesis.

 

(b)   Now complete all appropriate null and alternative hypotheses for testing of main effects difference

       between means and interaction, and indicate acceptance or rejection at a = 0.05 (see below)

       – stating how you arrived at these conclusions.

 

(c)   Complete Scheffé (a = 0.05) testing where appropriate.  You should show the two tables associated

       with each set of Scheffé calculations, the first set will show “calculations”, the second table will

       appropriately tabulate means providing underscoring and (or) like letters.

 

(d)   Using information from the analysis of variance summary, the Scheffé testing, and the interaction

       graph, interpret the results.

 

Tabulated Statistics

 ROWS: AGE     COLUMNS: ILLNESS

 

        Cardiac    Cancer     HIV    Tuberculosis      ALL

 

 20-29    23.000   35.200   33.200      20.000        27.850

           5        5        5            5            20

 

 30-39    25.600   34.000   34.400      23.400        29.350

           5        5        5            5            20

 

 40-49    26.200   39.800   40.200      25.600        32.950

           5        5        5            5            20

 

 50 and   29.200   48.000   46.400      28.800        38.100

  over       5        5        5         5             20

 

 ALL      26.000   39.250   38.550   24.450           32.063

             20       20       20         20           80

 

  CELL CONTENTS --

             TIME (minutes):MEAN

                             N

 

 

Worksheet was saved on  26/3/2006

General Linear Model

 

Factor    Levels    Values

AGE           4    20    30    40    50

ILLNESS       4     1     2     3     4

 

Analysis of Variance for TIME (minutes)  

 

Source        DF     Seq SS     Adj SS     Adj MS       F      P

AGE            3    1246.84    1246.84     415.61   26.24  < 0.001

ILLNESS        3    3769.04    3769.04    1256.35   79.33  < 0.001

AGE*ILLNESS    9     211.21     211.21      23.47    1.48    0.174

Error (Within) 64    1013.60    1013.60      15.84

Total         79    6240.69 

 

Unusual Observations for TIME   

 

Obs.     TIME       Fit Stdev.Fit  Residual   St.Resid

  7   45.0000   35.2000    1.7797    9.8000      2.75R

 29   26.0000   34.0000    1.7797   -8.0000     -2.25R

 66   40.0000   48.0000    1.7797   -8.0000     -2.25R

 70   60.0000   48.0000    1.7797   12.0000      3.37R

 74   55.0000   46.4000    1.7797    8.6000      2.42R

 

R denotes an obs. with a large st. resid.

Means for TIME   

 

AGE                     Mean     Stdev

 20-29                 27.85    0.8899

 30-39                 29.35    0.8899

 40-49                 32.95    0.8899

 50 and over           38.10    0.8899

ILLNESS

 1  cardiac            26.00    0.8899

 2  cancer             39.25    0.8899

 3  HIV                38.55    0.8899

 4  tuberculosis       24.45    0.8899 

AGE*ILLNESS

 20       1     23.00    1.7797

 20       2     35.20    1.7797

 20       3     33.20    1.7797

 20       4     20.00    1.7797

 30       1     25.60    1.7797

 30       2     34.00    1.7797

 30       3     34.40    1.7797

 30       4     23.40    1.7797

 40       1     26.20    1.7797

 40       2     39.80    1.7797

 40       3     40.20    1.7797

 40       4     25.60    1.7797

 50       1     29.20    1.7797

 50       2     48.00    1.7797

 50       3     46.40    1.7797

 50       4     28.80    1.7797

 

 

FIGURE 1:   Two interaction representations for the four different age groups and  four different

Patient Types - Length of home visit in minutes by Public Health Nurses.

 

MTB > %Vartest 'TIME' 'AGE' 'ILLNESS';

SUBC>   Confidence 95.0.

Executing from file: Vartest.MAC

 

Macro is running ... please wait

 

Homogeneity of Variance

 

Response TIME

Factors  AGE  ILLNESS

ConfLvl  95.0000

 

Bonferroni confidence intervals for standard deviations

 

  Lower     Sigma     Upper   n  Factor Levels

 

 1.39486   2.91548   17.2742  5  20  1

 2.93760   6.14003   36.3797  5  20  2

 2.06892   4.32435   25.6218  5  20  3

 0.33830   0.70711    4.1896  5  20  4

 0.80057   1.67332    9.9144  5  30  1

 2.16620   4.52769   26.8266  5  30  2

 2.01852   4.21900   24.9976  5  30  3

 1.60828   3.36155   19.9172  5  30  4

 1.28377   2.68328   15.8985  5  40  1

 1.19129   2.48998   14.7531  5  40  2

 1.70500   3.56371   21.1150  5  40  3

 1.29266   2.70185   16.0085  5  40  4

 1.14225   2.38747   14.1458  5  50  1

 3.62791   7.58288   44.9286  5  50  2

 2.92198   6.10737   36.1862  5  50  3

 0.62380   1.30384    7.7253  5  50  4

 

Bartlett's Test (normal distribution) 

Test Statistic: 31.488

P value       :  0.008

  

Levene's Test (any continuous distribution)  

Test Statistic: 0.962

P value       : 0.504

 

Return to Top of page

 

 

Example Question 7 

                                           

 

Tour de France

( photo from: http://www.velonews.com/images/int/5740.7135.f.jpg )

 

Acknowledgement  - The following text and data (excluding the exercises 'a' and 'b') are extracted from

"STATISTICS" by R.S. & J.S.Witte (Sixth Edition, Copyright © 2001 Harcourt, Inc.,  ISBN 0470001828;

pages 308, 326 & 337; and are used here by permission of John Wiley & Sons, Inc., Hoboken, New Jersey.

 

During the 1998 Tour de France, the world’s best-known bicycle race, some cyclists were expelled for attempting to enhance their performance by 'blood-doping' with the synthetic hormone erythroprotein, or EPO, which stimulates the production of oxygen-bearing (and fatigue-inhibiting) red blood cells.  An investigator wants to determine whether this type of blood-doping increases the endurance of athletes under controlled laboratory conditions. (We’ll ignore the very real medical, ethical and legal issues raised by this type of experimentation.)  Volunteer athletes from the local track team are randomly assigned to one of two groups: a blood-doped group, which receives a prescribed amount of EPO, and a non-blood-doped group, which receives a comparable amount of a harmless neutral substance.  Subsequently, after an appropriate interval of time has elapsed, each athlete runs on a rapid treadmill until exhausted.  Total time on the treadmill is used as the measure of endurance.

Preliminary studies of blood doping with EPO reveal an unexpected phenomenon: Lightweight athletes in both the blood-doped and non-blood–doped groups have better endurance scores than heavier athletes.  This factor complicates the search for the effect of blood-doping on endurance scores.

In subsequent experiments, therefore, it is advantageous to pair athletes with similar body weights.  Before collecting the data, the athletes are matched for body weight, beginning with the two lightest and ending with the two heaviest.  Once a member of a given pair has been randomly assigned to either the blood-doped or the non-blood-doped group, the other member of that pair is automatically assigned to the remaining group.  Thereafter, the athletes are treated in the same way as in the original blood-doping experiment; that is, endurance scores are obtained both for athletes who are blood-doped with EPO and for athletes who receive a harmless neutral substance.  The data presented in the Table below shows an Endurance Score (minutes) for the athlete subjects who have been paired on the basis of weight and randomly allocated to the groups.

       

 

TABLE:   Endurance Scores (minutes) for the athlete subjects running on a rapid tread mill.

 Blood Doped

Athletes

(minutes)

Non-Blood-Doped

Athletes

(minutes)

9

7

4

5

8

4

8

6

10

7

10

8

        

           (a)   Carry out the MOST SUITABLE statistical test (using a = 0.05) to determine whether the synthetic

                  hormone erythroprotein (EPO) has caused any change in the endurance of athletes.

           (b)   Summarise the findings of this experiment.

 

Return to Top of page

 

 

Example Question 8 

                                           

 

Tailings discharge into storage cell.

( photo from: http://www.tailings.info/deposition.htm )

At a tailings storage facility in the Goldfields region of Western Australia, an underground nickel mine was depositing tailings into a new cell of the storage.

The tailings monitoring bores on the western edge of the facility showed a range of readings, at the January 2006 sampling, indicating movement perhaps of materials from the storage into the ground water.  Previous readings from all bores had all been in the 25,000 – 35,000 mg/L range.  Hypersaline water was being used in the nickel extraction process with the residual slurry deposited into a tailings cell.

The five bores had six readings taken from each bore, and the TDS (total dissolved solids) in mg/L are given below in the Table below.  The bore locations are depicted on Figure 1 below.

                 TABLE:    TDS in mg/L from five Tailings Dam Monitoring Bores (TDMB) at the nickel mine,

January 2006 sampling.

BORE 1

BORE 2

BORE 3

BORE 4

BORE 5

56,000

54,000

60,000

63,000

60,000

57,000

46,000

42,000

44,000

49,000

44,000

40,000

41,000

44,000

46,000

40,000

39,000

36,000

39,000

41,000

38,000

37,000

43,000

42,000

28,000

33,000

36,000

34,000

29,000

31,000

 
 

 

 

FIGURE 1:       Location of the tailings monitoring bores, at the nickel mine.

Statistical analyses are required to compare mean TDS (mg/L) of the five bores.

            i)     List the dependent and the independent variables.

           ii)     Formulate the appropriate null and alternative hypotheses for comparison between bores of mean TDS, and you will indicate acceptance or rejection at a = 0.05.

         iii)     Complete an analysis of variance (ANOVA) on these data, testing at a = 0.05, to compare mean TDS (mg/L) at the monitoring bores (TDMB).  Complete the analysis by hand showing all workings then check using MINITAB or SPSS.

Provide your MINITAB or SPPS hard copy printout, with appropriate labelling and description (LIS and MTW files [or SPPS files] will be on disc).

          iv)     If a difference between means exists (at a = 0.05) use Scheffé tests to elucidate where any difference between means are statistically significant.  You will manually complete the Scheffé tests, and provide the required two tables associated with the calculations:  One table gives calculations, the second table presents means.

           v)     Finally, formulate a written summary of this sampling using the information from the analysis of variance (ANOVA) summary, with any explanations you may see as being biologically / environmentally feasible.

 

Return to Top of page

 

 

Example Question 9 

                                           

 

'Yabby', Cherax destructor

( photo from: http://www.deh.gov.au/cgi-bin/species-bank/sbank-treatment2.pl?id=69119 )

 

An aquaculture research unit was invited to complete a study; one aspect was to consider growth (as measured by gm of edible flesh) of the introduced yabby Cherax destructor (Family Parastacidae), under different conditions.

 

Farmers in the York to Kelleberrin area of W.A. were approached, with five farms having appropriate dams for the stock – it was a requirement that each farm had four discrete dams which were “similar” (e.g. drainage, catchment, construction type, dam sealing structure).  However, it appeared likely that conditions (e.g. rainfall, soil type) would vary between farms over this 50-100 km radius, thus each LOCATION (farm) was considered a BLOCK in the experimental design.

 

There was a different experimental set-up in each dam, with the same set-ups on all farms.  The experimental set-ups are listed below:

·        Control, no shelter, no shade and no “salt” addition (“salt” has been documented as on acting as pathogens in the waters).

·        Shelter – here poly-piping has been inserted (the poly-piping has been suggested as providing shelter for the crustaceans, and also a “ready made” facility for protection).

·        Shelter + shade (shade trees planted around the dam were suggested as a environmental amelioration with perhaps lower water temperatures).

·        Shelter + shade + “salt” addition (“salt” has been document as acting as pathogens in the water).

In the study, weight of only one yabby from each location was recorded (statisticians had checked this was a valid procedure by sub-sampling a number of animals, and releasing them back into the dam).  Data are present in the Table  below.  It is noted that the experimental design implemented is a randomised block layout, with the five farm (LOCATIONS) blocks, and there are four experimental treatments which have been randomly allocated to the four dams on each farm.

 
 

 

TABLE:   Dry weight in gm, of yabbies (Cherax destructor; Family Parastacidae)

    under experimental set-ups in dams on wheat-belt W.A. farms.

Experimental Set-up (within One Dam)

 

LOCATION

Control

(no shelter, no shade,

no “salt” addition)

Shelter

(piping)

Shelter and

Shade

Shelter, Shade and “salt” addition

Red knob (York

10

58

63

68

Collins Hill (York)

11

55

68

63

Grass Valley (Northam)

6

45

48

47

Five Mile Hill (Northam)

2

50

41

43

Doodlakine (Kelleberrin)

5

37

39

40

            i)     List all the dependent and independent variables.

           ii)     Complete the appropriate analysis of variance on the data testing at a = 0.05.  The analysis of variance must be completed manually showing all workings.  Show the critical values from the Fcritical table (at a = 0.05) used to ascertain significance.  All appropriate hypotheses will have been presented and then acceptance or rejection is to be stated.

         iii)     You will have checked your calculations with MINITAB, and you will provide the MINITAB or SPSS printout. 

         iv)     Use the Scheffé test (a = 0.05), where appropriate, to compare mean dry weights for the experimental treatments.  You will manually complete the Scheffé tests, and provide the required two summary tables for any set of Scheffé testing.  Hypotheses will have been formulated, presented, and their acceptance or rejection given.

          v)     You are also to Scheffé test (a = 0.05) the “BLOCK” means, and establish where there are location differences. You will manually complete the Scheffé tests, and provide the required two summary tables for any set of Scheffé testing.  Hypotheses will have been formulated, presented, and their acceptance or rejection given.

         vi)     Provide a written summary and conclusion for this research trial.

 

Return to Top of page

 

 

Example Question 10 

                                           

 

Bluefin tuna.

( photo from:  http://members.tripod.com/gill_fisher/fsblue.jpg )

 

Southern Bluefin Tuna, Thunnus maccoyii, are being used for an exploratory aquaculture venture off the southern coast of Australia.  The researchers are asked to show the relative growth curve of Southern Bluefin Tuna to a State Fisheries Department for comment.

 

The relative growth curve of fishes is best represented using a log10\ log10 plot – both the horizontal and vertical axes are of log10 scale. 

Prepare the following two plots for presenting to the Fisheries Department, using either 4 cycle by 2 cycle, or 2 cycle by 2 cycle scales :

 

·        Total length (Y1) against standard length (X), and

·        Body length (Y2) against standard length (X)

 

 
 
 

 

 

TABLE 1:  Southern Bluefin Tuna, Thunnus maccoyii, data for relative growth curves, length (cm) n = 9.

Length in cm

Standard (X)

Body  (Y1)

Total (Y2)

37.6

20.4

43.8

44.8

24.1

52.0

54.1

29.0

62.6

64.5

34.8

74.3

74.1

40.7

84.3

84.0

46.6

95.9

93.1

51.5

105.1

106.0

59.5

121.0

116.0

64.5

132.5

 

Return to Top of page

 

 

Example Question 11 

                                           

 

 

Pseudomys sp.

 ( photo from :  http://www.zoo.latrobe.edu.au/Staff/pbf/marsupials.htm )

 

 

The following small mammal species and numbers of individuals were trapped on a control site of Spinifex grasslands with acacias, and on a rehabilitated iron-ore waste-dump site at Newman, Western Australia.

 

Animals were caught in pit-fall traps which had been dug into the ground, with 100 traps per site.  Trapping was at regular monthly intervals over two years.  The control site was 3 km north of the waste dump rehabilitation site.

i)                 Calculate the Shannon Index of Diversity (H), and J the Evenness or Equitability Index, for the Control Site and the Rehabilitation.

ii)                Use Sorenson’s Index of Similarity (ISs) to compare the two sites.

iii)              Represent these data graphically in a way that clearly shows differences between the two trap locations (the Control Site and the Rehabilitation), and species differences.

iv)              Using both the species listing (see Table) and the calculated diversity studies, comment on these data.

TABLE :   Species and numbers of small mammals trapped on the control and the rehabilitation,

Newman 1996-1998, recaptures are excluded (- not captured).

Scientific Name

Common Name

Control Site

Number of

each species

Rehabilitation

Number of

each species

EUTHERIA

 

 

 

ORDER RODENTIA

 

 

 

Mus musculus

House Mouse

5

56

Pseudomys hermannsbuergensis

Sandy Island Mouse

14

_

 

 

 

 

MARSUPIALIA

 

 

 

FAMILY DASYURIDAE

 

 

 

Antechinus rosamondae

 

21

_

Dasycercus cristicauda

Mulgara

18

4

Ningaui timealeyi

Pilbara Ningaui

105

14

Planigale maculata

Common Planigale

41

4

Pseudantechinus macdonnellensis

Fat-tailed Pseudantechinus

 18

 

 _

 

Source of data:  From original paper jointly authored by Joan Osborne.

 

Return to Top of page

 

 

Example Question 12 

                                           

 

 

Hippocampus sp.

( photo from:  http://www.divegallery.com/seahorse_page3.htm )

 

 

Seahorses are generally found in shallow waters where human disturbance is intensive and extensive.  Seahorse habitats – seagrass meadows, coral reefs and mangroves – are among the most threatened in the world.  Additionally, seahorses are overexploited, as pre-packaged medicines and for the aquarium and curio trade, because demand exceeds supply.

The common species, Hippocampus subelongatus, has been found to successfully grow and reproduce in captivity.  Both growth rates and reproductive success have been influenced by diet.  Long-term studies have identified the calanoid copepod Gladioferens imparipes Thomson to be a potential live food source for Hippocampus subelongatus.  As an adjunct to investigating seahorse growth in captivity and thus fish farming potential – environmental chemists investigated the value of the calanoid copepod Gladioferens imparipes as a food source.  When fed on different algal diets, Gladioferens imparipes produces different unsaturated fatty acids (HUFA’s) and these have the potential to affect growth rate of the seahorses.  Copepods raised on different algal diets were fed to seahorses and the growth rate of the seahorses was established.  Seahorse growth rate was the taken as the length (mm) of the seahorse after six months. Juvenile seahorses, at the commencement of the study were same length (ANOVA testing on lengths, and the Levene Test on variances).

As part of a collaborative research program extending to Asia and co-ordinated from America, the study was repeated in five different laboratories where there had been records of successful ornamental fish farming in captivity.  Two laboratories were located in Western Australia, one in Tasmania, one in Malaysia, and one in Chicago (Illinois USA).  In each laboratory copepods were raised on the following diets, and then fed to seahorses.  The conventional bakers yeast diet was used as a control comparison.  Length (mm) of the seahorse was recorded after six months.

Diet No. &
 Type

Diet Description

1 (control)

Control – “conventional” bakers yeast

2 (algal)

Nannochloropsis oculata

3 (algal)

Dunaliella tertiolecta

4 (algal)

Chaetoceros muelleri

5 (algal)

Isochrysis galbana

 

In this study the length weight of only one seahorse, from each laboratory for one diet, has been recorded (statisticians had checked this was a valid procedure by sub-sampling a number of seahorse and releasing them back into the aquaria).  Data are presented in the Table below.  The experimental design implemented is a randomised block layout, with the five laboratories (LOCATIONS) blocks and there are five experimental treatments, which have been randomly allocated to the five aquaria in each laboratory.

 

        TABLE:   Length in mm, of seahorses (the common species Hippocampus subelongatus) under experimental

aquaria set-ups in different laboratories.

 

Experimental Diet (within one laboratory) FACTOR A

LABORATORY
Location – Blocking (FACTOR B)

Diet 1
Control

Diet 2
N. oculata

Diet 3
D. tertiolecta

Diet 4
C. muelleri

Diet 5
I. galbana

Malaysia

10

9

58

63

68

WA - Site 1

11

10

55

68

63

WA - Site 2

6

7

45

48

47

Tasmania

2

5

50

41

43

Chicago

5

3

37

39

40

 

            i)     List all the dependent and independent variables.

           ii)     Complete the appropriate analysis of variance on these data testing at a = 0.05.  The analysis of variance must be completed manually showing all workings.  Show the critical values from the Fcritical table (at a = 0.05) used to ascertain significance.  All appropriate hypotheses will have been presented and then acceptance or rejection is to be stated.

         iii)     Provide manually completed Scheffé tests for both Factor A and Factor B (a = 0.05).   Include workings, and the two standard Tables presented for each of Factor A and Factor B. 

         iv)     Using GLM check your ANOVA calculations.  Also complete on MINITAB Tukey’s Testing for both Factor A and Factor B (Blocking).  Provide a copy of the MINITAB or SPSS printout.  Ensure you include appropriate annotation on the printout to identify the study, and indicating what the Levels of Factor A and Factor B refer to. 

          v)     For the written summary for this research trial you are to use the ANOVA and the Tukey’s Testing data (a = 0.05) – so there is now the requirement to produce two Tukey’s Test summary tables.  One summary table will relate to Factor A – the comparison of mean (n=5) length in mm of seahorses, from the different treatments (diets), where means are rank ordered and like means are underscored or given the same letter.  The second summary table will consider the “BLOCK” means, and establish where there are location differences (Tukey’s Test a = 0.05).

         vi)     Finally, prepare a written summary and conclusion for this research trial.

For Background Interest only  ( text from:  http://www.divegallery.com/seahorse_page3.htm ) :

The profile view in the above photo of Hippocampus sp. makes gender identification easy - females don't have a brood pouch for carrying the developing young (this one is female). For reproduction, the female deposits a long string of eggs into the male's brood pouch, where the developing brood is nurtured until the baby seahorses emerge, looking like miniatures of their parents, commonly 6-12 millimeters in length.  Once born, the young seahorses receive no care from their parents, and are particularly vulnerable to predators.

The species in the photo (Hippocampus kuda) is commonly known as the spotted seahorse, yellow seahorse, or estuary seahorse.  It is characteristically smoother in appearance than most other species, of medium size, 7-17 cm.

 

Return to Top of page

 

 

Example Question 13 

                                           

 

Marron, Cherax tenuimanus

( photo from:  http://albanymarronfarm.com.au )

An aquatic ecologist was studying the invertebrate fauna of Lake Inspiration, a shallow freshwater lake of southern and inland Western Australia, with physio-chemical attributes having been deemed conducive to intensive crustacean ("yabby" or "marron") farming.  At this stage in the studies the ecologist was specifically addressing the abundance and seasonality of the invertebrate fauna (as a potential food source for the farmed crustaceans).

The data on the following page provide a listing of the summer and winter fauna of this lake (see Table 1) with frequencies.  Sampling was from a 10 cm3 volume.

i)        Is this a quantitative or qualitative frequency distribution (and just very briefly explain why)?

ii)       Represent these data graphically in a way that clearly shows the differences between species (keeping the Phylum, Class or Order listing as given), and the differences over time.

TABLE 1:   Abundance (volume 10 cm3) of invertebrate fauna from a fresh water lake in
                   southern Western Australia, over seasons - frequencies for species is given.

Taxa

Season

Phylum, Class
or Order

Species

Summer

Winter

Porifera

 

Spongilla fragilis

6

2

 

 

 

Turbellaria

 

 

Mesostoma sp.

20

18

Prorhynchus sp.

11

6

 

 

 

Nematomorpha

 

Gordius sp.

127

97

 

 

 

Oligochaeta

 

 

Lunbriculus variegatus

50

73

Tubifex tubifex

1,454

506

 

 

 

Crustacea

 

 

 

Austrochiltonia subtenuis

92

111

Candonopsis tenuis

48

54

Simocephalus latirostris

76

32

 

 

 

Insecta

 

 

 

Aedes vigilax

4,645

1,870

Hyphydrus elegans

1,443

219

Procladius villosimanus

448

21

 

 

 

Mollusca

 

 

Isodorella newcombi

25

21

Physa sp.

57

9

 

 

 

 

 

Return to Top of page

 

 

Example Question 14 

                                           

 

( photo from:  http://www.uow.edu.au/science/frontiers/2003/ )

An environmental scientist was investigating the mercury concentration of a river that had residues from a factory discharged into it.  The scientist was required to determine if there were changes in mercury concentration downstream from the factory outlet.

Five (5) sampling stations were chosen and six readings were made at each station.  The data below are mercury (Hg) concentrations (mg/ml) at the five sampling stations which were situated every 500 metres downstream of the outlet.

                    Station 1                              Station 2                             Station 3

                           factory outlet                 500 m downstream                1,000 m downstream

                        56                                      46                                           41

                        54                                      42                                           44

                        60                                      44                                           46

                        63                                      49                                           40

                        60                                      44                                           39

                        57                                      40                                           36

 

                                                   Station 4                                     Station 5

                                                    1,500 m downstream                 2,000 m downstream

                                                   39                                              28

                                                   41                                              33

                                                   38                                              36

                                                   37                                              34

                                                   43                                              29

                                                   42                                              31

i)       List the dependent and the independent variables.

ii)      Formulate the appropriate null and alternative hypotheses for the environmental study, and indicate

         acceptance or rejection at a = 0.05.

 

iii)      Complete an analysis of variance (ANOVA) on these data, testing at a = 0.05, to compare mean Hg levels

         at the sampling stations.  Complete the analysis by hand showing all workings, then check using MINITAB

         or SPSS.   Provide your MINITAB or SPPS printout, with appropriate labelling and description. 

 

iv)     If a difference between means exists (at a = 0.05) use Scheffé tests to elucidate where any differences

        between means are statistically significant.  Manually complete the Scheffé tests, and provide the required

        two tables associated with the calculations, one table gives calculations, the second table presents means.

 

v)      Finally, formulate a written summary of this sampling using the information from the analysis of variance

        (ANOVA) summary, with any explanations you may see as being biologically / environmentally feasible.

 

 

Return to Top of page

 

 

Example Question 15 

                                           

 

Eucalyptus globulus seedlings.

( photo from:  http://www.forestalnet.com/tempo2000/fotos2000.htm )

As part of a collaborative research program between Universities in Chile (South America) and W.A., a plant species trial was established near Lake Rapel in central Chile where rainfall averages 560 mm and the dry season is long (7-9 months).  Four fertiliser amendments (which included a control - no amendment) were applied to Eucalyptus globulus (Tasmanian blue-gum) seed which had been sown at the beginning of winter.  Four months after fertiliser application seedling survival numbers and the height of the seedlings were recorded.  Heights are given for statistical analysis.

The experimental design implemented for this study was a randomised block lay-out with five block replicates of each of the fertiliser amendments (see Table 1).  At this field study site there had appeared to be a trend for soil 'conditions' to improve in the more southerly blocks.  In the absence of different treatments, it was anticipated growth would be better in the southern blocks.  Thus fertiliser amendments were randomly allocated to each block.

TABLE 1:    Randomised block lay-out for five replicates of the four fertiliser amendments

                 on a study site having a possible soil fertility trend in the direction indicated.

BLOCK 1

3

4

1

2

North

 Soil Improvement Trend

↓South

BLOCK 2

1

3

2

4

BLOCK 3

3

2

4

1

BLOCK 4

4

2

1

3

BLOCK 5

1

1

3

2

 

Key:

Amendment 1

control (no nutrients)

 

Amendment 2

phosphate

 

Amendment 3

phosphate + nitrate

 

Amendment 4

phosphate + nitrate + trace elements

 

After four months the height of seedlings (cm) (one randomly selected seedling per treatment / block combination) was recorded and these data are given in Table 2.  The objective of the trial was to establish whether fertiliser addition had improved plant growth.  Comparisons between fertiliser amendments were required, and block variability with respect to plant growth, was to be investigated.

 

TABLE 2:  Height (cm) after four months of Eucalyptus globulus (Tasmanian blue-gum) seedlings,

grown using one of four different fertiliser amendments.

 

Amendment


BLOCK

control
no application of fertiliser

phosphate

(P)

nitrate

(N)

phosphate +
nitrate +
trace elements

(P + N + trace)

BLOCK 1

11

15

21

27

BLOCK 2

10

14

22

29

BLOCK 3

9

14

24

21

BLOCK 4

15

12

20

30

BLOCK 5

14

14

25

33

i)       List all the dependent and independent variables.

ii)      Complete the appropriate analysis of variance on the data testing at a = 0.05.  The analysis of variance

         must be completed manually showing all workings.  Show the critical values from the Fcritical table

         (at a = 0.05) used to ascertain significance.  All appropriate hypotheses will have been presented and

         then acceptance or rejection are to be stated.

         Check your calculations with MINITAB, and provide a MINITAB or SPSS printout. 

iii)      Would you recommend Scheffé testing for block differences?  Elaborate.

iv)     Now use the Scheffé test (a = 0.05), where appropriate, to compare mean heights of Eucalyptus globulus

         seedlings.  Manually complete the Scheffé tests, and provide the required two summary tables for any set

         of Scheffé testing.  Hypotheses will have been formulated, presented, and their acceptance or rejection

         given.

v)      Provide a written summary and conclusion for this research trial.

 

 

Return to Top of page

 

 

Example Question 16 

                                           

 

School of sharks

( photo from:  http://www.aloha.com/~lifeguards/sharintr.html )

 

A zoologist was studying the physiology of two species of sharks (Family Carcharhinidae) known from the waters off Rottnest Island, Western Australia.  One minor aspect of this study was to look at the phosphorus content of different body organs.  The phosphorus content (measured in mg/100 g) of each of four different organs was determined in two species, and the data are tabulated below.

 

Complete the appropriate ANOVA for these data, ensuring all relevant means are compared.

 

FACTOR B:
Species

FACTOR A:
Body Organ

 

Heart

Gills

Liver

Kidney

Dusky Shark

(Carcharhinus obscurus)

86.7

106.3

204.6

188.0

89.0

102.7

209.8

184.6

91.2

108.1

213.2

186.5

88.4

107.9

213.2

186.5

Long-nosed Grey Shark

(Carcharhinus brevipinna)

 

81.2

102.3

201.1

177.3

80.4

99.8

198.7

175.2

77.6

97.3

195.6

179.0

78.2

92.9

192.3

173.4

i)       List all dependent and independent variables.

ii)      Use MINITAB (or SPPS) to complete the appropriate analysis of variance.  Show the critical values from

         the Fcritical table (at a = 0.05) used to ascertain the significance of all main effects, and the interaction. 

         All appropriate hypotheses will have been presented and then acceptance or rejection are to be stated.

        

iii)      Provide an appropriate summary of means either via MINITAB, SPSS or another package (e.g. Excel),

         or manually.

 

iv)     Complete Scheffé (a = 0.05) testing where appropriate.  You should show two tables associated with

         each set of Scheffé calculations, the first set will show your 'calculations' , the second table will

         appropriately tabulate means.

 

v)      Summarise and interpret the data set.  - An interaction graph (presented as a Figure) is to  be included and

         referred to during the interpretation.

 

 

Return to Top of page

 

 

Example Question 17 

                                           

 

Dusky shark

Long-nosed Grey shark

 

A zoologist was studying the physiology of two species of sharks (Family Carcharhinidae) known from the waters off Rottnest Island, Western Australia.  One minor aspect of this study was to look at the phosphorus content of different body organs.  The phosphorus content (measured in mg/100g) of each of four different organs was determined in two species, and data are tabulated below.

Thus the design of this investigation considered two independent variables.

         FACTOR A:        Body  Organ, with four levels of this variable.

         FACTOR B:        Species, with two levels of this variable.

 

Each treatment condition was replicated FOUR (4) times.  The dependent variable was phosphorus content in mg/100g.

 

The results from this investigation are recorded below:

 

FACTOR B:
Species

FACTOR A:
Body Organ

 

Heart

Gills

Liver

Kidney

Dusky Shark

(Carcharhinus obscurus)

86.7

106.3

204.6

188.0

89.0

102.7

209.8

184.6

91.2

108.1

213.2

186.5

88.4

107.9

213.2

186.5

Long-nosed Grey Shark

(Carcharhinus brevipinna)

 

81.2

102.3

201.1

177.3

80.4

99.8

198.7

175.2

77.6

97.3

195.6

179.0

78.2

92.9

192.3

173.4

 

A two-way ANOVA with replication was used to analyse these data. The summary table of this analysis  and the table of means are provided below.

 

(a)        Testing for Homogeneity of Variance was completed for this data set, prior to any ANOVA analysis.  Use the information below to determine whether the null hypothesis of variance equality is accepted or rejected.  State the test statistic and indicate the basis of acceptance or rejection of the null hypothesis.

(b)             Complete all appropriate null and alternative hypotheses and indicate acceptance or rejection at

            a = 0.05  stating how you arrived at these conclusions.

 

(c)             Provide an interaction graphical summary (see below for table of means).

 

(d)             Complete Scheffé (a = 0.05) testing where appropriate.  You should show two tables associated with each set of Scheffé calculations:  the first  will show your 'calculations', and the second will appropriately tabulate means providing underscoring and (or) like letters.

 

(e)             Interpret the results, using information from the analysis of variance summary, the Scheffé testing, and the interaction graph.

 

 

MTB > Table 'SPECIES' 'ORGAN';

SUBC>   Means 'Pmg/100g';

SUBC>   N 'Pmg/100g'.

Tabulated Statistics

 

 ROWS: SPECIES     COLUMNS: ORGAN

 

                        Heart   Gills  Liver   Kidney      ALL

 

   Dusky                88.82   106.25   210.20   186.40     147.92

                        4        4        4        4          16

 

   Long-nosed           79.35    98.08   196.93   176.23     137.64

                        4        4        4        4          16

 

   ALL                  84.09   102.16  203.56  181.31       142.78

                        8        8        8        8          32

 

  CELL CONTENTS --

         Pmg/100g:MEAN

                  N

  

 

MTB > ANOVA 'Pmg/100g' = SPECIES! ORGAN;

SUBC>   Means SPECIES! ORGAN.

 

Analysis of Variance (Balanced Designs)

 

 

Factor     Type Levels Values

SPECIES   fixed      2     1     2

ORGAN     fixed      4     1     2     3     4

 

 

Analysis of Variance for P mg/100g

 

Source           DF         SS         MS       F      P

SPECIES           1        845        845   99.76  <0.001

ORGAN             3      82191      27397 3236.02  <0.001

SPECIES*ORGAN     3         28          9    1.11   0.366

Error            24        203          8

Total            31      83267

 

    MEANS

 

SPECIES           N     P mg/100g