The Science of Online Reviews & Sales

Academic research shows that online reviews have measurable causal effects on business revenue. A one-star increase on Yelp is associated with a 5-9% revenue lift for independent businesses. An extra half-star causes restaurants to sell out 19 percentage points more often during peak hours. For UK trades and home-service businesses, these findings carry direct implications for local visibility, lead conversion, and reputation management on Google and Checkatrade.
Online reviews are no longer a supplement to a business's reputation. For most local trades and home-service providers in the UK, they are the primary mechanism by which that reputation is formed and tested by prospective customers. Beyond intuition and marketing benchmarks, a body of rigorous academic research measures the causal effect of reviews on revenue and purchasing decisions with statistical precision.
This article examines four real, verified studies, explains what each found and where its limits lie, and draws practical implications for trades businesses operating in UK local markets.
Why Academic Research Matters Here
Any marketing agency can claim that "reviews increase sales." What separates that claim from a useful conclusion is methodology: how were causal effects isolated, what biases were controlled for, and how generalisable are the findings?
The four studies presented here share one important methodological characteristic: none of them relies on simple correlation. Each uses a design that supports stronger causal claims than an opinion survey or a cross-sectional dataset alone. That rigour matters when you are deciding how to allocate time and budget in a competitive local market.
Study 1: The Revenue Effect of One Extra Star
Luca, M. (2011, revised 2016). "Reviews, Reputation, and Revenue: The Case of Yelp.com." Harvard Business School Working Paper, No. 12-016.
HBS Faculty Page | Full paper on SSRN
What He Did
Michael Luca at Harvard Business School merged Yelp review data with restaurant revenue records from the Washington State Department of Revenue. To solve the causality problem — good restaurants attract both more reviews and more customers, contaminating a simple correlation — he used a regression discontinuity design.
The mechanism: Yelp rounds each restaurant's average rating to the nearest half-star. A restaurant with a true average of 3.24 displays 3 stars; one with 3.26 displays 3.5 stars. That 0.02-point difference in the underlying rating reflects no actual change in quality, but it creates a visible discontinuity in the public-facing score. Luca exploits this threshold to identify the causal effect of the displayed rating on revenue, rather than simply observing that higher-rated establishments tend to be busier.
Key Finding
A one-star increase in a Yelp rating produces a 5-9% increase in revenue for independent restaurants.
The effect is driven entirely by independent businesses. Chain restaurants show no statistically significant effect, which suggests that reviews substitute for brand reputation when no established brand exists. This is the most directly relevant finding for UK independent tradespeople.
Limits to Keep in Mind
The study focuses on restaurants in the United States during the early years of Yelp. Transfer to other sectors and countries is plausible but not directly demonstrated. Effects in today's higher-saturation review environment — and on platforms such as Google rather than Yelp — may differ in magnitude, though the direction of effects is well-supported by the underlying mechanism.
Study 2: Negative Reviews Hurt More Than Positive Ones Help
Chevalier, J. and Mayzlin, D. (2006). "The Effect of Word of Mouth on Sales: Online Book Reviews." Journal of Marketing Research, 43(3), 345-354.
Journal of Marketing Research | NBER Working Paper 10148
What They Did
Chevalier and Mayzlin collected review and relative sales data for books on Amazon.com and BarnesandNoble.com at three points in time. They used a differences-in-differences approach: if a book receives a one-star review on one platform but not the other, its relative sales at that platform should fall. This design controls for factors that affect both platforms equally — actual product quality, national advertising — and isolates the specific effect of reviews on each site.
Key Findings
- An improvement in a book's average rating on one platform increases its relative sales share at that platform.
- The sales impact of one-star reviews is greater in magnitude than the impact of five-star reviews on relative sales — a finding consistent with well-established negativity bias in consumer psychology.
- Consumers read review text, not just the aggregate score: review length has a statistically significant effect on Amazon sales, suggesting people engage with the content itself, not only with the summary number.
Limits to Keep in Mind
Books are an experience good with high homogeneity across platforms. In home services — where the "product" differs substantially across providers and cannot be evaluated before purchase — effects may be larger because uncertainty about service quality before hiring is much higher than before purchasing a book.

Study 3: Half a Star More Fills Appointment Slots at Peak Times
Anderson, M. and Magruder, J. (2012). "Learning from the Crowd: Regression Discontinuity Estimates of the Effects of an Online Review Database." The Economic Journal, 122(563), 957-989.
Full PDF (UC Berkeley) | EconPapers record
What They Did
Anderson and Magruder analysed 148,000 Yelp reviews for 328 restaurants in the San Francisco area and matched that data to real table reservation availability during peak dining hours (Thursday, Friday, and Saturday evenings), queried 36 hours in advance. They applied the same regression discontinuity design as Luca, exploiting Yelp's half-star rounding threshold to create quasi-random variation in the displayed rating.
Key Findings
An extra half-star on Yelp reduces peak-hour table availability by 19 percentage points — meaning restaurants sell out more frequently during high-demand slots.
The effect amplifies significantly when alternative information is scarce:
- Restaurants without external accreditation (press rankings, recognised accolades): 27 percentage point increase in peak-hour sell-out rate.
- Restaurants with established external recognition: effect is statistically insignificant.
The jump from 3 to 3.5 stars raises the probability of selling out during peak hours from 13% to 34%. The jump from 3.5 to 4 stars adds another 19 percentage points on top.
Limits to Keep in Mind
The study is set in San Francisco, a market with unusually high Yelp user density. In UK markets with lower platform penetration — or where Google dominates over Yelp — effects may be smaller in magnitude but are unlikely to reverse in direction. The core mechanism (rating thresholds driving discrete behavioural shifts) is not platform-specific.
Study 4: A Meta-Analysis Across 26 Studies and 443 Elasticities
Floyd, K., Freling, R., Alhoqail, S., Cho, H.Y., and Freling, T. (2014). "How Online Product Reviews Affect Retail Sales: A Meta-analysis." Journal of Retailing, 90(2), 217-232.
ScienceDirect | DOI: 10.1016/j.jretai.2014.04.004
What They Did
Floyd et al. synthesised 26 prior empirical studies that together produced 443 sales elasticity measurements, using established meta-analytic methods to identify patterns that hold across different markets, products, and review platforms. This is the most generalisable source of the four because it does not depend on a single context or country.
Key Findings
| Dimension | Mean Elasticity | Interpretation |
|---|---|---|
| Review valence (star rating quality) | Es = 0.69 | High: the average score meaningfully moves sales |
| Review volume (number of reviews) | Es = 0.35 | Moderate: more reviews correlates with more sales |
| Third-party platforms vs. own website | Higher for third-party | External platforms are perceived as more credible |
| High-involvement services | Higher than low-involvement | Trades and home services fall firmly here |
Review valence has roughly double the effect on sales compared to review volume, but both are statistically significant and practically relevant across the studies surveyed.
Sales elasticities are consistently higher for independent platforms (Google, Checkatrade, Trustpilot) than for reviews hosted on a business's own website — because consumers perceive less opportunity for editorial manipulation on a third-party platform.
Limits to Keep in Mind
The 26 base studies were published primarily between 2005 and 2013 and focus mainly on e-commerce of tangible goods. Generalisation to high-trust local services such as boiler installations, kitchen renovations, or electrical rewires requires methodological caution. The direction of effects is well-supported; the exact magnitude should not be transferred without adjusting for sector and local UK market context.

Comparative Summary: The Four Studies
| Study | Authors | Year | Key Finding | Practical Implication for UK Trades |
|---|---|---|---|---|
| HBS Working Paper 12-016 | Luca | 2016 | +1 star = +5-9% revenue (independents) | Every rating point has direct economic value for independent tradespeople |
| Journal of Marketing Research | Chevalier & Mayzlin | 2006 | 1-star reviews outweigh 5-star reviews in sales impact | Managing negatives is as urgent as collecting positives |
| The Economic Journal | Anderson & Magruder | 2012 | +0.5 stars = +19 pp peak-time demand shift | Rating thresholds matter as much as the average |
| Journal of Retailing | Floyd et al. | 2014 | Valence Es = 0.69; Volume Es = 0.35 | Review quality has double the effect of review count |
What Research Says About Biases in the System
Understanding that reviews affect sales is not enough. The research also documents systematic biases that distort the information available to consumers and to business owners interpreting their own profiles.
Positivity Bias in Review Distribution
Chevalier and Mayzlin found that reviews tend to be positive on average. Consumers who leave reviews are disproportionately those who had an extreme experience — very good or very bad — not those who had a middling one. This produces a bimodal distribution, not a normal bell curve.
Implication: a business with many five-star reviews and few three- or four-star reviews may appear more polarising than it actually is. Sophisticated consumers adjust for this; many do not.
Consumers Do Not Process All Available Information
Luca found that consumers are more sensitive to rating changes that are more visible and respond more when a profile has more reviews, but they do not use all available information — for example, they do not adjust for the reviewer's network size or account age, even when that information is displayed. In practice, the average star rating and total review count are the two signals that carry the most decision weight.
The Effect Is Asymmetric Across Business Types
Both Luca and Anderson and Magruder document that the review effect is larger for businesses without pre-established brand reputation. For UK trades businesses, this is directly relevant: an independent electrician or local building firm is structurally dependent on its reviews in a way that a national franchise is not. Reviews fill the trust gap that brand recognition would otherwise occupy.

Practical Applications for UK Trades Businesses
The research does not speak directly to plumbers or builders in British local markets, but the underlying mechanisms apply with few adaptations.
1. Rating Thresholds Matter More Than Marginal Incremental Gains
Anderson and Magruder demonstrate that the jumps from 3 to 3.5 stars and from 3.5 to 4 stars produce large, discrete effects on customer demand. Moving from 4.1 to 4.3 may matter less than moving from 3.7 to 4.0. Focus energy on crossing the next visible threshold rather than optimising at the margins.
See how this plays into your overall local visibility strategy.
2. Reviews Do Not Just Attract Leads — They Convert Them
Lead-to-customer conversion is strongly correlated with perceived trust before the first call is made. Reviews are the primary trust signal for a service the consumer cannot evaluate until after the job is done. A strong review profile reduces decision friction at exactly the moment when a potential customer is comparing providers.
3. A One-Star Review Without a Response Is the Worst-Case Scenario
Chevalier and Mayzlin document that the negative sales impact of a one-star review exceeds the positive impact of a five-star review. A public business response moderates that effect by demonstrating professionalism and willingness to resolve problems. Silence amplifies the damage.
For a step-by-step guide to building and managing your review profile on Google and Checkatrade, see Google Reviews for Trades Businesses.
4. Automate Post-Job Review Requests
If the revenue effect of reviews is economically significant, it makes sense to systematise how you generate them. A consistent post-job follow-up sequence — SMS, email, or WhatsApp — removes the friction of asking manually after every completed appointment and ensures the request reaches the customer while the experience is still fresh.
For UK-specific guidance on the full marketing approach for plumbers, see Marketing for Plumbers: UK Complete Guide.
5. Prioritise Third-Party Platforms Over On-Site Testimonials
Floyd et al. document that third-party platforms carry higher sales elasticities than reviews on a business's own website. For UK trades businesses, Google Business Profile is the primary channel because it also determines visibility in local search results and Google Maps — the starting point for the majority of local service searches in the UK. Checkatrade and similar platforms serve as a secondary trust layer with sector-specific credibility.
Explore industry-specific strategies across all trades we serve, or browse the glossary to build fluency with the underlying concepts. The blog covers additional guides across visibility and conversion topics.
What This Research Does Not Say
Honesty about limits matters here:
- None of the four studies covers UK trades or construction businesses directly. The transfer relies on shared mechanisms — information asymmetry, trust signalling, social proof — not direct measurement of plumbers, heating engineers, or builders.
- The Yelp-based studies are US-specific and from a period when Yelp was dominant in local search. Google dominates local services in the UK today, and the distribution of effects across platforms may differ in magnitude.
- All four studies predate 2016. The review ecosystem has changed substantially: higher overall review volumes, AI-mediated discovery through Google AI Overviews and tools such as Perplexity, and a continued shift towards Google as the dominant local review platform in the UK.
- The exact magnitude of effects should not be applied to a specific business without accounting for local competitive context, service category, and current platform penetration in the relevant UK market.
What is generalisable is the direction of effects and the logic of the mechanisms: reviews reduce consumer uncertainty, substitute for brand reputation when none exists, and negative signals consistently outweigh positive ones in their influence on decisions made under uncertainty.
Conclusion
The academic literature on online reviews is not large compared to other marketing topics, but what exists is methodologically strong. The four studies examined here converge on a coherent conclusion: reviews have measurable causal effects on revenue and demand, with greater intensity for businesses that lack established brand recognition.
For an independent plumber, electrician, heating engineer, or builder operating in the UK, that translates into a clear strategic priority: active review management is not a community-relations task — it is a revenue lever with quantitative evidence behind it.
Explore how to act on this evidence through the visibility pillar, the conversion pillar, and the full blog.
We answer before we start
Q/01Does one extra star on Google actually affect a trades business's revenue?
Yes, and the causal evidence is strong. Michael Luca's Harvard Business School study (Working Paper 12-016, revised 2016) found that a one-star increase on Yelp produces a 5-9% revenue increase for independent businesses, using a regression discontinuity design that isolates causal impact rather than mere correlation. For UK trades businesses, the mechanism is equivalent: consumers use the aggregate rating as a cognitive shortcut when no other quality signal is readily available. The effect is strongest for businesses without established brand recognition — exactly the profile of most independent plumbers, electricians, heating engineers, and builders operating in local UK markets.
Q/02Do negative reviews hurt more than positive reviews help?
The data suggests yes. Chevalier and Mayzlin (2006) demonstrated through their analysis of Amazon.com and BarnesandNoble.com that the sales impact of one-star reviews is larger in magnitude than the impact of five-star reviews on relative sales. This finding is consistent with the psychology of negativity bias: humans weight unfavourable information more heavily than favourable information when making decisions under uncertainty. For a plumbing or renovation company operating in the UK, this means actively managing negative reviews is at least as important as accumulating positive ones. Ignoring a one-star review carries a higher cost than not responding to a five-star review.
Q/03Does review volume matter as much as the average star rating?
Both dimensions matter, but differently. The meta-analysis by Floyd et al. (2014) across 443 sales elasticities from 26 empirical studies found that the mean valence elasticity is 0.69 and the volume elasticity is 0.35. Review quality has roughly double the effect of review count, though both are statistically significant. Luca (2016) adds an important nuance: consumers respond more strongly to rating changes when a profile has more reviews, because they interpret volume as a reliability signal for the average. In practice: reach a credible minimum volume first (20-30 reviews), then focus on pushing the average rating higher.
Q/04Are reviews on third-party platforms more effective than testimonials on a business's own website?
According to the meta-analysis by Floyd et al. (2014), sales elasticities are significantly higher for independent third-party platforms than for reviews hosted on a business's own website. The reason is perceived independence: consumers know the business cannot edit or filter reviews on an external platform, which increases their credibility. For UK trades businesses, this reinforces Google Business Profile and Checkatrade as the primary reputation management channels, ahead of any testimonial section on a company website.
Q/05Do online reviews matter more when less information is available about a business?
Yes, and there is direct causal evidence. Anderson and Magruder (2012) in the Economic Journal found that the effect of an extra half-star on Yelp on a restaurant's peak-time availability was 19 percentage points on average, but rose to 27 percentage points for restaurants with no external accreditation. For establishments with established external recognition such as press rankings, the Yelp effect was statistically insignificant. The parallel for UK trades is direct: an independent plumber or local building firm relies on reviews far more than a nationally known franchise does. Reviews substitute for brand reputation when none exists.
Q/06What review platforms should UK trades businesses prioritise?
Google Business Profile is the highest-priority platform for UK trades because it drives both local search visibility and Maps rankings. Checkatrade, Rated People, and Which? Trusted Traders carry additional weight in the home-services sector because consumers associate them with vetting and accountability. The Floyd et al. (2014) meta-analysis confirms that third-party platforms consistently produce higher sales elasticities than a business's own website. In practice, a strategy that targets Google first and maintains a credible presence on at least one sector-specific platform covers both the discoverability and trust dimensions of the buyer decision process.

