BIOLOGY FOR LIFE
  • IB Bio Syllabus
    • Unity and Diversity (A) >
      • A1 molecules >
        • A1.1: Water
        • A1.2: Nucleic Acids
      • A2 Cells >
        • A2.1: Origins of Cells
        • A2.2: Cell Structure
        • A2.3: Viruses
      • A3 Organisms >
        • A3.1: Diversity of Organisms
        • A3.2: Classification and Cladistics
      • A4 Ecosystems >
        • A4.1: Evolution and Speciation
        • A4.2: Conservation of Biodiversity
    • Form and Function (B) >
      • B1 Molecules >
        • B1.1: Carbohydrates and Lipids
        • B1.2: Proteins
      • B2 Cells >
        • B2.1 Membranes and Membrane Transport
        • B2.2 Organelles and Compartmentalization
        • B2.3 Cell Specialization
      • B3 Organisms >
        • B3.1 Gas Exchange
        • B3.2 Transport
        • B3.3 Muscle and Motility
      • B4 Ecosystems >
        • B4.1 Adaptation to Environment
        • B4.2 Ecological Niches
    • Interaction and Interdependence (C) >
      • C1 Molecules >
        • C1.1: Enzymes and Metabolism
        • C1.2: Cell Respiration
        • C1.3: Photosynthesis
      • C2 Cells >
        • C2.1: Chemical Signaling
        • C2.2: Neural Signaling
      • C3 Organisms >
        • C3.1: Integration of Body Systems
        • C3.2: Defense Against Disease
      • C4 Ecosystems >
        • C4.1 Populations and Communities
        • C4.2 Transfers of Energy and Matter
    • Continuity and Change (D) >
      • D1 Molecules >
        • D1.1: DNA Replication
        • D1.2: Protein Synthesis
        • D1.3: Mutation and Gene Editing
      • D2 Cells >
        • D2.1: Cell and Nuclear Division
        • D2.2: Gene Expression
        • D2.3: Water Potential
      • D3 Organisms >
        • D3.1: Reproduction
        • D3.2: Inheritance
        • D3.3: Homeostasis
      • D4 Ecosystems >
        • D4.1: Natural Selection
        • D4.2: Stability and Change
        • D4.3: Climate Change
  • IB Requirements
    • Internal Assessment >
      • Research Design
      • Analysis
      • Conclusion
      • Evaluation
    • External Assessment >
      • Exam Revision
    • Extended Essay
    • Reflective Project
    • Collaborative Sciences Project
    • Learner Profile
  • Skills for Biology
    • Tools >
      • Experimental Techniques >
        • Addressing Safety
        • Measuring Variables >
          • Measurement Uncertainties
          • Observations
          • Biological Drawings
        • Applying Techniques >
          • Microscopy
      • Technology >
        • Tech to Collect Data
        • Tech to Process Data
      • Mathematics >
        • General Math
        • Units and Symbols
        • Processing Uncertainties
        • Graphing >
          • Types of Graphs
          • How to Graph
          • Graph Error Bars
    • Inquiry Processes >
      • Exploring >
        • Research Questions
        • Hypotheses and Predictions
      • Designing >
        • Variables
        • Sampling
      • Control of Variables
      • Collecting Data >
        • Data Tables
      • Processing Data
      • Interpreting Results
      • Concluding
      • Evaluating >
        • Error Analysis
  • Statistics
    • Descriptive Statistics >
      • Skew and the Normal Distribution
      • Outliers
      • Measures of Central Tendancy
      • Measures of Dispersion
      • Correlation Coefficients
      • Coefficient of Determination
    • Inferential Statistics >
      • Standard Error
      • T-Test
      • ANOVA
      • Kruskal-Wallis
      • X2 Test of Independence
      • X2 Goodness of Fit
    • Glossary of Statistic Terms and Equations
  • SHS Course Info
    • Above & Beyond >
      • Biology Club
      • Pumpkin Carving
      • Scavenger Hunt
      • Science News
      • Wood Duck Project (legacy)
      • Invasive Crayfish Project (legacy)
    • Assessment >
      • Class Grading IB Bio I
      • Class Grading IB Bio II
      • Daily Quizzes (legacy)
      • Lab Practicals (legacy)
    • Class Photos
    • Recommendations
  • Contact
    • About >
      • Philosophy
      • Resume
      • Reflection
      • Favorite Quotes
      • AEF Blog
  • Expeditions
    • Bahamas (2009)
    • Trinidad (2010)
    • Trinidad (2011)
    • Ecuador (2012)
    • Trinidad (2013)
    • Peru (2014)
    • Bahamas (2015)
    • Peru (2016)
    • Costa Rica (2017)
    • Costa Rica (2018)
    • Arizona (2022)
    • Florida (2023)
    • Belize (2024)
    • Costa Rica (2025)
  • Summer Ecology Research
  • Teacher Resources

Correlation Coefficients

Correlation coefficients are essential descriptive statistical tools that quantify the strength and direction of the relationship between two variables of data.  When evaluating the relationship between two variables, it is important to determine how the variables are related.
  • Linear relationship:  two variables change together at a constant rate in the same direction, either both increasing or both decreasing.  A linear relationship is a trend in the data that can be modeled by a straight line.
  • Nonlinear monotonic relationship:  one where the variables consistently move in the same general direction (always increasing or always decreasing) but not at a constant, steady rate.  
  • No relationship:  knowing the value of one variable provides no information about the value of the other variable​

Scatter Plot

One way to get a general idea about whether or not two variables are related is to plot them on a “scatter plot”. 
Picture
Strong, positive linear relationship.
Picture
Weak, negative linear relationship.
Picture
Nonlinear, monotonic relationship.

Selecting the Correlation Coefficient

Examining the distribution and pattern of data points in scatter plots is crucial for choosing the appropriate correlation test. 

  • When a scatter plot shows data points that roughly follow a straight line pattern with relatively even distribution around that line, Pearson correlation is appropriate. Look for data that appears to cluster around an imaginary straight line, whether it slopes upward (positive correlation) or downward (negative correlation).  There are online calculators for the Pearson correlation.  Spreadsheet software can calculate the Pearson correlation using the formula =PEARSON(array1, array2).
 
  • If a scatter plot reveals a curved relationship, such as an exponential growth pattern, Spearman rank correlation would be more suitable since it can detect these monotonic but non-linear patterns. There are online calculators for the Spearman rank correlation.  Spreadsheet software can calculate the Spearman rank correlation.  First the data sets need to be individually ranked.  To rank values in Excel, use the formular =RANK.EQ(number, ref, [order]). Number is the value being ranked, ref is the range of values to rank against, and order specifies ascending (1).  Once each data set has been ranked, the use the formula =PEARSON(array1, array2) with the ranked data.

Interpreting Correlation Coefficients

Correlation Coefficients Communicate the Direction of a Relationship: 
  • If a correlation coefficient is a negative number, there is an indirect, negative relationship between the two variables.  A negative relationship means that as values on one variable increase (go up) the values on the other variable tend to decrease (go down) in a predictable manner. 
​
  • If a correlation coefficient is a positive number, there is a direct, positive relationship between the two variables. A positive relationship means that as one variable increases (or decreases) the values of the other variable tend to go in the same direction. If one increases, so does the other. If one decreases, so does the other in a predictable manner.

Correlation Coefficients Always Fall Between -1.00 and  +1.00:  
  • A correlation coefficient of -1.00 indicates that there is a perfect negative relationship between the two variables. This means that as values on one variable increase there is a perfectly predictable decrease in values on the other variable. In other words, as one variable goes up, the other goes in the opposite direction (it goes down).  
 
  • A correlation coefficient of +1.00 indicates that there is a perfect positive relationship between the two variables. This means that as values on one variable increase there is a perfectly predictable increase in values on the other variable. In other words, as one variable goes up so does the other. 
 
  • ​The closer a correlation coefficient approaches plus or minus 1.00, the stronger the relationship is and the more accurately one can predict what happens to one variable based on the knowledge of the other variable. 
 
  • Generally, values between 0.7-1.0 (or -0.7 to -1.0) indicate strong correlations, 0.3-0.7 (or -0.3 to -0.7) suggest moderate correlations, and 0.0-0.3 (or -0.3 to 0.0) represent weak correlations.  In biological systems, even moderate correlations can be biologically meaningful due to the complexity of living organisms and environmental factors. 
 
  • A correlation coefficient of 0.00 indicates that there is a zero correlation, or no relationship, between the two variables. In other words, as one variable changes (goes up or down), it is impossible to know anything about what happens to the other variable.
Picture

Making Statistical Inferences from Correlation Coefficients:

The correlation coefficient is both a descriptive and inferential statistic, depending on its use. It is a descriptive statistic when it simply summarizes the strength and direction of a linear relationship between two variables within a specific dataset. It is an inferential statistic when it is used to determine whether or not a correlation is simply a chance occurrence or if it really is true of the population. Just like other inferential statistical tests, the significance of a correlation tests two hypotheses:
Null Hypothesis (Ho):
"There is not a significant correlation between the two variables; any observed trend or relationship may be due to chance and sampling error."

For example:  
  • There is no significant correlation between the number of students missing class and the outside weather; (if there is a relationship) the relationship we see in the data may be due to chance and sampling error.
Alternative Hypothesis (H1):
"There is a significant correlation between the two variables; the observed trend or relationship is most likely not due to chance or sampling error."

For example:  
  • There is a significant correlation between the number of years spent smoking and the incidence of lung cancer; the positive relationship we see in the data is mostly likely not due to chance or sampling error.
Using the calculated Correlation Coefficient and the number of pairs of data being correlated, online calculators can be used to determine whether the correlation is statistically significant (in other words, “is generalizable” in the larger population).  The calculator returns a "p-value," which represented the probability of calculating the given correlation by chance (assuming there is actually no true relationship between the variables.  
  • A low p-value (typically below 0.05) suggests the correlation is statistically significant, meaning it is unlikely to have occurred randomly and there is a real relationship between the variables.  Reject the null hypothesis. ​
  • A  high p-value (typically above 0.05) suggests the correlation is statistically insignificant, meaning it could have occurred  randomly by chance and there may not be a real relationship between the variables.  Fail to reject the null hypothesis.

A weak correlation coefficient (such as r = 0.3) might still be statistically significant if you have a large sample size, while a stronger correlation (such as r = 0.7) might not reach statistical significance with a very small sample. This is why both the magnitude of the correlation coefficient and its associated p-value matter when interpreting results.

Remember that statistical significance doesn't automatically mean biological significance.  Findings may not have meaningful implications for understanding biological processes even if they meet the mathematical criteria for significance.
I give many of my IB Biology resources away, for the benefit of students and teachers around the world. 
If you've found the materials helpful, please consider sending a gift from our classroom wish list.


​Did I forget something?  Know of a mistake? Have a suggestion?  Let me know by emailing me here.

Before using any of the files available on this site,
​please familiarize yourself with the 
Creative Commons Attribution License. 
​​​It prohibits the use of any material on this site for commercial  purposes of any kind.  ​
​
Picture
  • IB Bio Syllabus
    • Unity and Diversity (A) >
      • A1 molecules >
        • A1.1: Water
        • A1.2: Nucleic Acids
      • A2 Cells >
        • A2.1: Origins of Cells
        • A2.2: Cell Structure
        • A2.3: Viruses
      • A3 Organisms >
        • A3.1: Diversity of Organisms
        • A3.2: Classification and Cladistics
      • A4 Ecosystems >
        • A4.1: Evolution and Speciation
        • A4.2: Conservation of Biodiversity
    • Form and Function (B) >
      • B1 Molecules >
        • B1.1: Carbohydrates and Lipids
        • B1.2: Proteins
      • B2 Cells >
        • B2.1 Membranes and Membrane Transport
        • B2.2 Organelles and Compartmentalization
        • B2.3 Cell Specialization
      • B3 Organisms >
        • B3.1 Gas Exchange
        • B3.2 Transport
        • B3.3 Muscle and Motility
      • B4 Ecosystems >
        • B4.1 Adaptation to Environment
        • B4.2 Ecological Niches
    • Interaction and Interdependence (C) >
      • C1 Molecules >
        • C1.1: Enzymes and Metabolism
        • C1.2: Cell Respiration
        • C1.3: Photosynthesis
      • C2 Cells >
        • C2.1: Chemical Signaling
        • C2.2: Neural Signaling
      • C3 Organisms >
        • C3.1: Integration of Body Systems
        • C3.2: Defense Against Disease
      • C4 Ecosystems >
        • C4.1 Populations and Communities
        • C4.2 Transfers of Energy and Matter
    • Continuity and Change (D) >
      • D1 Molecules >
        • D1.1: DNA Replication
        • D1.2: Protein Synthesis
        • D1.3: Mutation and Gene Editing
      • D2 Cells >
        • D2.1: Cell and Nuclear Division
        • D2.2: Gene Expression
        • D2.3: Water Potential
      • D3 Organisms >
        • D3.1: Reproduction
        • D3.2: Inheritance
        • D3.3: Homeostasis
      • D4 Ecosystems >
        • D4.1: Natural Selection
        • D4.2: Stability and Change
        • D4.3: Climate Change
  • IB Requirements
    • Internal Assessment >
      • Research Design
      • Analysis
      • Conclusion
      • Evaluation
    • External Assessment >
      • Exam Revision
    • Extended Essay
    • Reflective Project
    • Collaborative Sciences Project
    • Learner Profile
  • Skills for Biology
    • Tools >
      • Experimental Techniques >
        • Addressing Safety
        • Measuring Variables >
          • Measurement Uncertainties
          • Observations
          • Biological Drawings
        • Applying Techniques >
          • Microscopy
      • Technology >
        • Tech to Collect Data
        • Tech to Process Data
      • Mathematics >
        • General Math
        • Units and Symbols
        • Processing Uncertainties
        • Graphing >
          • Types of Graphs
          • How to Graph
          • Graph Error Bars
    • Inquiry Processes >
      • Exploring >
        • Research Questions
        • Hypotheses and Predictions
      • Designing >
        • Variables
        • Sampling
      • Control of Variables
      • Collecting Data >
        • Data Tables
      • Processing Data
      • Interpreting Results
      • Concluding
      • Evaluating >
        • Error Analysis
  • Statistics
    • Descriptive Statistics >
      • Skew and the Normal Distribution
      • Outliers
      • Measures of Central Tendancy
      • Measures of Dispersion
      • Correlation Coefficients
      • Coefficient of Determination
    • Inferential Statistics >
      • Standard Error
      • T-Test
      • ANOVA
      • Kruskal-Wallis
      • X2 Test of Independence
      • X2 Goodness of Fit
    • Glossary of Statistic Terms and Equations
  • SHS Course Info
    • Above & Beyond >
      • Biology Club
      • Pumpkin Carving
      • Scavenger Hunt
      • Science News
      • Wood Duck Project (legacy)
      • Invasive Crayfish Project (legacy)
    • Assessment >
      • Class Grading IB Bio I
      • Class Grading IB Bio II
      • Daily Quizzes (legacy)
      • Lab Practicals (legacy)
    • Class Photos
    • Recommendations
  • Contact
    • About >
      • Philosophy
      • Resume
      • Reflection
      • Favorite Quotes
      • AEF Blog
  • Expeditions
    • Bahamas (2009)
    • Trinidad (2010)
    • Trinidad (2011)
    • Ecuador (2012)
    • Trinidad (2013)
    • Peru (2014)
    • Bahamas (2015)
    • Peru (2016)
    • Costa Rica (2017)
    • Costa Rica (2018)
    • Arizona (2022)
    • Florida (2023)
    • Belize (2024)
    • Costa Rica (2025)
  • Summer Ecology Research
  • Teacher Resources