CEMETERY DATA ANALYSIS USING
MS EXCEL
D. B. Hoagland
For this experiment you will collect dates of birth and death for persons buried in a cemetery. We will use MS Excel to estimate various parameters, such as mean life span for males and females, average life expectancy, mortality rates, and survivorship for our graveyard population. Some of these data will be represented graphically to facilitate our investigation. Inspect these graphs carefully and identify trends in the data. Can you pose any questions after close scrutiny of the data?
The following procedures will assist your analysis.
These instructions were written for use with MS Excel version 5.0. Load MS Excel by following the instructions on the screen. A blank worksheet should appear. Check in the Tools menu under Data Anaylsis. . . , to see if you have the function Histogram loaded into the program. If Histogram is not present, you must load it. To accomplish this feat, pull down the Tools menu and select Add-Ins. . . Choose the Analysis ToolPak. WARNING: MS EXCEL USES LOTS OF RAM, AND THIS OPTION USES EVEN MORE!
- MS Excel basics (extremely basic)
- Data input is controlled by using regular boxes or "cells", which have an address that contains a letter denoting a column and a number indicating a row. The width and height of cells are changeable.
- To place information into a cell it must be highlighted. Use arrow keys or the mouse to move the cursor to a specific cell.
- The arrow cursor can also be used to pull down a menu from the top of the worksheet.
- Cell contents can be deleted by highlighting the cell and choosing delete from the menu or hitting the delete key
- Choose File/Save often
- Data entry and analysis.
- Label five consecutive columns: Marker (optional), Sex, DOD (Date of Death), DOB (Date of Birth), and Age. Start ten rows from the top of the worksheet.
- Enter data: type in the recorded data into the first three or four columns.
- Calculate Age data
- Enter =(first DOD datum address)-(first DOB datum address). Example: =C12-D12
- Copy the formula and paste to all cells in the Age column
- Sort data by sex
- Highlight all data (exclude column labels)
- Choose Data/Sort from the menu
- Scroll to Sex column in the Sort By window
- Select Ascending order
- Click OK
- Description of composite data
- Label seven rows above the data with the following: Count, Mean Age, Max Age, Min DOB, Max DOB, Min DOD, Max DOD
- In the cell adjacent to Count, type =count (range of cells for Age, e.g., E10..E110), alternatively, highlight the range of cells with the cursor, hit return or enter
- In the cell adjacent to Mean Age, type =average (range of cells for Age, e.g., E10..E110), hit enter or return
- Adjacent to Max Age, type=max (range of cells for Age, e.g., E10..E110), hit enter or return
- Continue for Min DOB, Max DOB, Min DOD, and Max DOD
- Description of data by sex.
- Copy male data and female data to new ranges
- Complete above descriptive analysis for each sex
- Calculating a Life Table
- Label the next column following Age as Age Class
- Type age groups (five year intervals) in the Age Class column in ascending order. Begin with 0, 1, 6, 11, 16 and end with two years older than the Max Age ( if the oldest person is 101, add 102 and 103)
- Definitions of variables
- x = age interval or age class
- nx = number of survivors at start of age interval X (raw field data)
- lx = proportion of organisms surviving to start age interval x
- dx = number or proportion dying during age interval x to x + 1
- qx = rate of mortality during the age interval x to x + 1
- Lx = number of individuals alive on the average during the interval x to x + 1
- Tx = total years lived by individuals in the population
- ex = mean expectation of life for individuals alive at the start of age interval x
- Calculation of x and dx
- Choose Tools/Data Analysis/Histogram
- Select Age Data column with the label in the Input Range window
- Select Age Class column with the label in the Bin Range window
- Select labels, output range, and check OK. The number of individuals that died in each age class is computed and placed in the frequency column adjacent to the bin column
- Relabel the bin column to x and the frequency column to dx
- Label consecutive columns following dx: lx, 1000qx, Lx, Tx, and ex
- Calculation of lx
- The first lx value (l0) is equal to the sample size (n)
- The remaining lx values are calculated as follows:
- The second lx value (l1) is equal to l0 - d0
- Enter this formula into the second cell in the lx column, e.g., =V16-W16, where V16 is the cell address for l0 and W16 is the cell address for d0
- Copy this formula to the remaining cells in this column
- Calculation of 1000qx
- Set format in this column to one decimal place
- Highlight all cells in the column
- Select Cells from Format menu
- Select Custom, type 0.0 in Code box
- Enter =1000*(dx/lx) in the first cell in this column, NOTE: USE CELL ADDRESSES FOR dx AND lx
- Copy formula to all remaining cells in this column
- Calculation of Lx
- Set format in this column to one decimal place
- Enter =0.5*(x1-x0)*(l1+l0), NOTE: USE CELL ADDRESSES FOR VARIABLES
- Copy formula to all remaining cells in this column
- Calculation of Tx
- Set column format to one decimal place
- Set column width to 12
- Tx is the cumulative sum of Lx values from the bottom of the column
- Type =sum(current cell address..$last cell column letter$last cell row number), e.g., =sum(X39..$X$39)
- Copy formula to the cells above
- Calculation of ex
- Set column format to one decimal place
- Type =Tx/Lx, NOTE: USE CELL ADDRESSES FOR VARIABLES
- Copy formula to remaining cells
- Graphing results
- Plot Temporal Trends in data using Chart Wizard
- Select the Chart Wizard icon from the menu bar
- Create a space for the chart by placing the cursor in an area of the worksheet containing empty cells, depress the mouse button, and drag to desired chart size. A dialogue box should appear entitled "Chart Wizard - Step 1 of 5" with the Range box displaying a cell(s) address
- Select Year of Death (from the raw data) for Temporal Trends
- Depress the Command key, and select the Age of Death data (from raw data), hit Next button
- Choose Line Chart type, hit Next button
- Choose an appropriate Line Chart (#2), hit Next button
- Choose Data Series in Columns, Use First (1) Column for Category (X) Axis Labels, Use First (1) Row for Legend Text, hit Next button
- Add a Legend - Yes, Type in an appropriate Chart Title, and Axes Titles (X=Year of Death, Y=Age of Death), hit Finish button
- Format axes by doubling clicking on the axis and adjust accordingly
- Plot Mortality, Mortality Rate vs. Age, Survivorship, and Life Expectancy as follows: Choose Age data (from Life Table data) as the X axis and the following data as Y variables:
- dx for Mortality
- 1000qx for Mortality Rate vs. Age
- lx for Survivorship
- ex for Life Expectancy
Any comments or suggestions? If so, please send them to Buzz Hoagland (bhoagland@wsc.mass.edu)
Last updated
Return to the Biology Department Homepage
Return to the WSC Homepage