A line of best fit (regression line) on a scatter plot shows the overall trend in your data. If the data points cluster around an upward-sloping line, there is a positive correlation. Downward slope means negative correlation. No clear line means no linear relationship. The R-value tells you how strong the correlation is.
Scatter plots with regression lines are the standard way to visualize relationships between two variables — from "does more ad spend lead to more revenue?" to "does study time predict test scores?" Here is how to create one, read it, and avoid common mistakes.
| Pattern | What It Means | R-value Range | Example |
|---|---|---|---|
| Points cluster tightly around upward line | Strong positive correlation | 0.7 to 1.0 | Height vs weight, experience vs salary |
| Points loosely follow upward trend | Moderate positive correlation | 0.4 to 0.7 | Ad spend vs leads, temperature vs ice cream sales |
| Points scattered with slight upward trend | Weak positive correlation | 0.1 to 0.4 | Sleep hours vs productivity (many confounding factors) |
| Points cluster around downward line | Strong negative correlation | -0.7 to -1.0 | Price vs quantity demanded, altitude vs temperature |
| Points form a random cloud | No linear correlation | -0.1 to 0.1 | Shoe size vs IQ, birth month vs income |
| Points follow a curve, not a line | Nonlinear relationship | R may be low | Diminishing returns (ad spend has decreasing impact at high levels) |
The R-value quantifies the strength and direction of the linear relationship:
| Method | When to Use | Assumptions |
|---|---|---|
| Linear regression (line of best fit) | Two continuous variables with an approximately linear relationship | Data is roughly linear, residuals are normally distributed |
| Polynomial regression | Data follows a curve, not a straight line | Careful with overfitting — higher degree does not always mean better |
| Logarithmic trend | Rapid initial change that levels off | Diminishing returns patterns |
| Exponential trend | Growth that accelerates over time | Population growth, compound interest |
| No regression line | Data has no clear pattern | Sometimes the answer is "no relationship exists" |
A scatter plot with a regression line only tests for linear relationships. Many real-world relationships are nonlinear, seasonal, or multivariate (affected by multiple variables simultaneously). For complex analysis, use statistical software (R, Python with statsmodels, SPSS). A simple scatter plot is a starting point for exploring relationships — not the final word on whether a relationship exists or how strong it is.
Create a scatter plot with regression line right now — paste your data and see the correlation.
Open Scatter Plot Maker