Intro. & Technical Enablers
August 01, 2024
Materials:
Brightspace
“Without data, you’re just another person with an opinion.” – W. Edwards Deming
Observational
In observational studies, no attempt is made to control or influence the variables of interest. A survey is a good example.
An example of an observational study is researchers observing a randomly selected group of customers that enter a Walmart Supercenter to collect data on variables such as time spent in the store, gender of the customer, and the amount spent.
Experimental
An experimental study involves the active manipulation of one or more independent variables to observe their effect on a dependent variable, while controlling for confounding factors. Participants are typically randomly assigned to groups (e.g., treatment vs. control), and outcomes are compared to determine causal relationships. This design provides strong evidence for cause-and-effect due to the controlled environment and random assignment.
The largest experimental study ever conducted is believed to be the 1954 Public Health Service experiment for the Salk polio vaccine. Nearly two million U.S. children (grades 1 through 3) were selected.
Population: the set of all elements of interest in a particular study.
Sample: a subset of the population.
Descriptive Statistics: Tabular, graphical, and numerical summaries of data.
Inferential Statistics: The process of using data from the sample to make estimates or test hypotheses about the characteristics of a population
Estimation: Using sample data to approximate population parameters.
Hypotheses Testing: Determining if there is enough evidence in a sample to support a claim about a population.
Prediction: Forecasting future events based on historical data.
Modern Data Driven Decision Making (M3DM):
Technical Pillars:
Practical Application:
Data Mining Lab