You are not currently logged in. Please log in to CEUfast to enable the course progress and auto resume features.

View Full Course Library

Introduction to Epidemiology

This Course Has Expired

Sorry, but this course is no longer active. We are keeping the course material here for research puposes. View our full course library
Author:    Julia Tortorice (RN, MBA, MSN, NEA-BC, CPHQ)


Define epidemiology and significant terminology used in that science.

  1. Identify classifications of epidemiological studies.
  2. Discuss analytical study designs.
  3. Discuss the role of mathematical and quantitative tools in epidemiology.
  4. Discuss criteria for cause and effect relationships.

Epidemiology Definition

“An apocryphal story is told around CDC that illustrates the confusion sometimes accompanying the term "epidemiology." It seems that one of our scientists, on first arriving at CDC from a clinical practice, found himself somewhat unsure of what epidemiology was all about, so he sought an answer down the street at Emory University.


The first person he asked was a medical student, who told him that epidemiology was "the worst taught course in medical school." The second, a clinical faculty member, told him epidemiology was "the science of making the obvious obscure." Finally, knowing that statistics are important to epidemiology, he asked a statistician, who told him that epidemiology is "the science of long division" and provided him with a summary equation. Giving up on finding a real answer, he returned to CDC. On the way, however, he decided to try one more time. He stopped a native Atlantian who told him that epidemiology was "the study of skin diseases."


A less entertaining, but more conventional, definition of epidemiology is the study of the distribution and determinants of health-related states in specified populations, and the application of this study to control health problems.”1


A comparison between the practice of public health and the practice of healthcare helps describe epidemiology. Where healthcare practitioners collect data on an individual patient by taking a medical history and conducting a physical exam, epidemiologists collect data about an entire population through surveillance systems or descriptive epidemiologic studies.


The healthcare practitioner uses his or her data to make a differential diagnosis. The epidemiologist's data is used to generate hypotheses about the relationships between exposure and disease.


Each discipline then test the hypotheses; the healthcare practitioner by conducting additional diagnostic studies or tests; and the epidemiologist by conducting analytical studies such as cohort or case-control studies.


The final step is to take action. The healthcare practitioner prescribes medical treatment, and the epidemiologist, some form of community intervention to end the health problem and prevent its recurrence.

Epidemiology deals with groups of people rather than with individual patients. It is the basic science of public health. Epidemiology is a highly quantitative discipline based on principles of statistics and research methodologies.

Epidemiologists study the distribution of frequencies and patterns of health events within groups in a population. To do this, they use descriptive epidemiology, which characterizes health events in terms of time, place, and person.

Epidemiologists attempt to search for causes or factors that are associated with increased risk or probability of disease. Analytical epidemiology moves from questions of "who," "what," "where," and "when" and start trying to answer "how" and "why." 

Epidemiology Focus

Infectious diseases were clearly the focus of much of the early epidemiologic work; however, this is no longer true. Epidemiology as it is practiced today is applied to the whole spectrum of health-related events, which includes chronic disease, environmental problems, behavioral problems, and injuries in addition to infectious disease.

Although epidemiology can be used simply as an analytical tool for studying diseases and their determinants, it serves a more active role. Epidemiologic data steers public health decision making and aids in developing and evaluating interventions to control and prevent health problems. This is the primary function of applied, or field epidemiology.

One succinct way to sum up the task of epidemiologists is to say that they "count things." Basically, epidemiologists count cases of disease or injury, define the affected population, and then compute rates of disease or injury in that population. Then they compare these rates with those found in other populations and make inferences regarding the patterns of disease to determine whether a problem exists.


For example you might ask: Is the rate of disease among people with no known risk factors greater than we would expect? Is the pattern or distribution of the cases suspicious? Once a problem has been identified, the data are used to determine the cause of the health problem; the modes of transmission; any factors that are related to susceptibility, exposure, or risk; and any potential environmental determinants.

Epidemiologic Studies

Epidemiologists used several different types of studies. Simply speaking, these can be classified as either experimental, where the epidemiologists have control over the circumstances from the start, or observational, where they do not. Vaccine efficacy trials are a good example of experimental studies because investigators control who gets the vaccine and who doesn't.


Observational studies can be further subdivided into descriptive and analytical studies. In a descriptive study, the epidemiologist collects information to characterize and summarize the health event or problem. In an analytical study, the epidemiologist relies on comparisons between groups to determine the role of various risk factors in causing the problem. Descriptive epidemiology is the most basic of these categories and is fundamental to the work of epidemiologists.


Another way of comparing descriptive and analytical epidemiology is to say that in the descriptive process, we are concerned with "person" (Who was affected?), "place" (Where were they affected?), and time (When were they affected?). Once we know the answers to these questions, we can enter the realm of analytical epidemiology and ask how and why these people were affected.


Any inferences you make are likely bases for hypotheses, which would then have to be tested using one of three analytical study designs: cross-sectional, cohort, and case-control.In all three types, the epidemiologist is attempting to discover the relationship between an exposure or risk factor and a health outcome. For example: Did the chicken salad at the company picnic cause the salmonella outbreak? Does cigarette smoke cause lung cancer? Are alcohol use and motor vehicle crashes related? Does the supplement L-tryptophan cause EMS?


A cross-sectional study, is basically the same as a survey. In this type of study, the epidemiologist defines the population to be studied and then collects information from members of the group about their disease and exposure status. Since the data represent a point in time, it's like taking a "snapshot" of the population. Cross-sectional studies are good for examining the relationship between a variable and a disease, but not for determining cause and effect, which requires data over time. Cohort studies and case-control studies are much better suited to examining cause-and-effect relationships.


In a cohort study, you select the study population according to their exposure; regardless of whether they have the disease or health outcome you're studying. You then determine the outcomes and compare them on the basis of the individuals' exposures. Cohort studies are often referred to as prospective studies because they follow the study population forward in time, from suspected cause to effect. An example would be dividing a group of people on the basis of their smoking status and following them for 20 years to see if they develop lung cancer.


Cohort studies are also used to investigate outbreaks in small, well-defined populations. For example, you would use a cohort study to answer the question posed earlier regarding the cause of a salmonella outbreak at a company picnic. In this situation, you would ask each attendee about their exposure (e.g., what foods and beverages they consumed at the picnic) and whether they became ill afterward. The relationship between exposure and outcome in a cohort study is quantified by calculating the relative risk for the exposure.


Cohort studies have several advantages:

  • You can examine multiple outcomes for a single exposure. For instance, if you select a group based on their smoking exposure, you can look not only at the incidence of lung cancer, but also at the incidence of cardiovascular disease, emphysema, burns, and other smoking-related outcomes.
  • Cohort studies are very useful in examining rare exposures, such as asbestos exposure and lung cancer.
  • You can directly calculate the incidence of disease for each of the exposure groups.
  • The temporal sequence is logical: you are starting with exposure and following forward in time to the development of disease.

Disadvantages of cohort studies are that they are costly in time and resources and, if the disease is rare, you need a large number of subjects. Also, because you are following forward in time, logistical problems may develop and subjects can be lost to follow-up.


In a case-control study, the epidemiologist is working backward, from the effect to the suspected cause. For this reason, case-control studies are often referred to as retrospective studies. Participants are selected on the basis of the presence or absence of the disease or outcome in question, so that you have one group of people (case-subjects) with the health problem and one without (controls). These groups are then compared to determine the presence of specific exposures or risk factors. For example, you could pick a group of people with lung cancer and a group without and then compare them for their history of exposure to smoking. The relationship between exposure and outcome in a case-control study is quantified by calculating the odds ratio.


Case-control studies have three primary advantages:

  • You can examine multiple exposures for a single outcome.
  • They are well suited for studying rare diseases and those with long latency periods.
  • They require fewer case-subjects and are generally quicker and less expensive to conduct than cohort studies, which make them well suited for the conditions of an outbreak investigation.

The disadvantages of case-control studies are that they aren't suitable for studying rare exposures; they are subject to bias because of the method used to select controls; and they don't allow you to directly measure the incidence of disease. Also, because they look backward, case-control studies may create uncertainty about the temporal relationship between exposure and disease.


One important point must be made about analytical epidemiologic studies: In these studies, we are attempting to answer the how and why questions and we are able to quantify the relationship between an exposure and an outcome.


However, a mathematical, or quantitative, relationship between the two is not enough to establish causation. We can demonstrate increased relative risk of lung cancer for smokers in a cohort study, or we can demonstrate in a case-control study that people with lung cancer are much more likely to have smoked in the past; but that alone doesn't establish cause and effect.


In general, five criteria must be met to establish a cause-and-effect relationship:

  • Strength of association--the relationship must be clear.
  • Consistency--observation of the association must be repeatable in different populations at different times.
  • Temporality--the cause must precede the effect.
  • Plausibility--the explanation must make sense biologically.
  • Biological gradient--there must be a dose-response relationship.

Disease Transmission

In talking about epidemiology, it is important to review how outbreaks occur. First, we'll look at three commonly used, and often misunderstood, terms: "epidemic," "outbreak," and "cluster." An epidemic is the occurrence of more cases of disease than would normally be expected in a specific place or group of people over a given period of time.


To an epidemiologist, "outbreak" means basically the same thing. In the public's mind, however, "epidemic" has a far more serious connotation than "outbreak." For this reason, "outbreak" is often used to avoid sensationalism. The third term, "cluster," is occasionally used, incorrectly, in place of "epidemic" or "outbreak."


A cluster is a group of cases in a specific time and place that may or may not be greater than the expected rate. Often the aim of investigating clusters is to determine the baseline rate of disease for that time and place. Two other terms you will come across are "endemic," meaning a high background rate of disease, and "pandemic," meaning very widespread, often global, disease.


For an outbreak, or epidemic, to occur, the basic elements of disease causation and an adequate chain of transmission must be present. Disease occurs when an outside agent capable of causing the disease meets a host that is vulnerable to the agent in an environment that allows the agent and host to interact. Then, given a chain of transmission from one host to another and a suitable mode of spread, an outbreak can develop.


These basic concepts help guide the selection of public health strategies to prevent health problems. Depending on which approach might be most effective, we might direct efforts at the specific agent (e.g., guinea worm), host (e.g., immunization to prevent measles), or environment (e.g., sanitation improvements to prevent salmonella). We can also target a specific point in the chain of transmission. This was the response in the E. coli outbreak in Washington State in the early 1990s, when health officials called for the thorough cooking of hamburgers to interrupt transmission of the bacterium.


To elaborate further, the agent is the entity necessary to cause disease in a susceptible host. Classically, we think of an agent as being biological: a bacterium, parasite, or virus. But an agent can also be a physical force, as in the example of motor vehicle crashes; a chemical, as in an environmental problem; or a nutritional imbalance, as with rickets. When we talk about agents, several characteristics are important to consider:

  • Infectivity--the capacity to cause infection in a susceptible host.
  • Pathogenicity--the capacity to cause disease in a host.
  • Virulence--the severity of disease that the agent causes in the host.

The host is the person, or in a more generic definition, the organism, that is susceptible to the effect of the agent. The status of the host is quite important and is generally classifiable as susceptible, immune, or infected. Finally, and also quite important, is that the host's response to exposure can vary widely, from showing no effect to manifesting subclinical disease, atypical symptoms, straightforward illness, or severe illness.


The environment is the conditions or influences that are not part of either the agent or the host, but that influence their interaction. A wide variety of factors, including physical, climatologic, biologic, social, and economic conditions, can come into play. For instance, in a study of motor vehicle injuries, the agent (mechanical energy) and the host (driver) could be affected by the topography, the weather, and the actions of other drivers. In many infectious disease outbreaks, social and economic conditions cause overcrowding and lead to high levels of exposure.


Agent, host, and environment alone are not sufficient to cause an epidemic; an adequate chain of transmission must be present. This process requires a source for the agent, a portal of exit, a mode of transmission, and a portal of entry. The first element, the source for the agent, is often the place where the agent originates, where it lives, grows, and multiplies, but this is not always the case. The agent that causes botulism (Clostridium botulinum), for example, originates in soil, but the source of most botulism infections is improperly canned food containing C. botulinum spores.


The second element, a portal of exit, is a pathway by which the agent can leave the source. This pathway is usually related to the place where the agent is localized. For instance, the agents causing tuberculosis and the flu are released through the respiratory tract, whereas agents for many stomach ailments are released through the digestive tract. Agents found in the blood, such as hepatitis B and HIV, can be released through cuts or needles.


Once the agent leaves the source, a mode of transmission, or means of carrying it to the host, is needed. This can happen in a number of ways, some of which are direct and some indirect. Direct transmission includes contact with soil or plants as well as contact between people. In indirect transmission, the agent can be airborne, vector borne, or vehicle borne.


In airborne transmission, the agent is carried from the source to the host suspended in air particles. Vector-borne diseases are transmitted indirectly by a live carrier, usually an arthropod, such as mosquitoes, fleas, or ticks. Vehicle-borne diseases are carried by inanimate objects, such as food or water, blood, or items like handkerchiefs, bedding, and surgical instruments.


Finally, there must be a pathway into the host, a portal of entry, that gives the agent access to tissue where it can multiply or act. Often the agent enters the host in the same way that it left the source. This is the case with the flu virus, which leaves the source through the respiratory tract and enters a new host through the respiratory tract.

Field Epidemiology

Field epidemiology is, in the most general terms, the practice or application of epidemiology to control and prevent health problems. Epidemiologists are mobilized under a variety of circumstances, prime ones being when a problem is acute and unexpected and when quick action is required.

The following incidents illustrate some of the key reasons for needing applied, or field, epidemiology:

  • They were unexpected.
  • They demanded a response.
  • The investigators had to go out into the field to solve the problem.

First Incident: In March 1985, a nurse epidemiologist in a county health department noted, while reviewing surveillance data, three cases in a single month of hepatitis B of unusual origin. Hepatitis B, or serum hepatitis, is transmitted through sexual contact and by exposure to infected bodily fluids, but these three patients did not seem to have the usual risk factors. All three people did, however, indicate having received injections at the same healthcare facility.

The nurse's immediate questions were: Is this a coincidence? Did these three cases occur by chance or is there a link? In this instance, the nurse decided to pursue an investigation.” 1

Second Incident: “At 8:30 in the morning on August 2, 1976, Dr. Robert B. Craven of CDC's Viral Diseases Division received a call from a nurse at a Veterans' Hospital in Philadelphia, Pennsylvania. The nurse reported two cases of severe respiratory illness, one of which had been fatal. Both people had attended the annual American Legion Convention held July 21-24. By the evening of August 2, 71 more of the people attending the convention had the same illness, with symptoms of acute onset of fever, chills, headache, malaise, dry cough, and myalgia. Further conversations with local and state public health officials revealed that between July 26 and August 2, 18 conventioneers had died. Deaths were due primarily to pneumonia.

An intense investigation began immediately. The incident became known as the first outbreak of Legionnaires' disease and led to the discovery of the gram-negative pathogen, Legionnella pneumophila. “ 1

Third Incident: “On October 30, 1989, a New Mexico physician notified the state's health department of three patients with marked peripheral eosinophilia and severe myalgia. All three patients had been taking oral preparations of L-tryptophan, a nonprescription drug sold as a dietary supplement in health food stores. Despite extensive clinical evaluation and testing, the illness could not be identified.

An investigation followed and resulted in the characterization of eosinophilia-myalgia syndrome, EMS. The investigation implicated a vehicle for exposure--L-tryptophan dietary supplements--before a suspected agent was identified, and the product was taken off the market. Eventually, the problem was traced to a contaminant that had been introduced by changes in the production process at a single manufacturing facility.’ 1

Field investigations are action oriented, with the main goal being to solve a pressing public health problem. Uppermost in investigators minds is the need to institute the controls necessary to safeguard health as soon as possible, and often, as in the example of L-tryptophan and EMS, this step is taken before the entire investigation is complete. Limited control over the situation, little time for planning a study, and limited data sources and laboratory samples challenge investigators. However, the obligation remains to do the best science possible under the circumstances.


1. Center for Disease Control Web Page, Nov. 2001