First-Order Supplier Rating: How to Evaluate New Vendors Before You Reorder
5-point first-order supplier rating checklist: score delivery, quality, documents, communication, and packaging to decide whether to reorder.
First-Order Supplier Rating: How to Evaluate New Vendors Before You Reorder
The first purchase order is the interview. Most Shopify importers receive their first shipment, do a rough eyeball check, and decide to reorder based on gut feel. That informal evaluation is expensive — one missed quality signal or document accuracy problem compounds into a customs hold, a return wave, or a dispute on your third order. A structured supplier rating after your first delivery changes that.
Why Your First Order Is Your Most Valuable Supplier Data Point
A new supplier relationship exists in a grace period. They're trying harder, their team knows this is an evaluation order, and their factory floor supervisor has been told to prioritize your lot. Yet the first order still reveals five things that no contract negotiation, sales call, or factory audit will show you:
- Actual lead time discipline — Does their ETA hold when there's no buffer in their production schedule?
- Real packing quality — Are inner carton dimensions and SKU labels what they said they would be?
- Document accuracy under pressure — Does the commercial invoice match the packing list, exactly, the first time?
- Defect rate on fresh production — Not the sample the sales rep chose — the actual pull from the production run.
- Communication responsiveness when something goes wrong — Because something always does, even on a good order.
A formal supplier rating on order one creates a baseline. Every subsequent order is measured against it.
The 5-Point First-Order Supplier Rating Checklist
Score each criterion from 0 to 20. Total score determines your reorder decision.
-
On-Time Delivery (0–20 pts)
Arrived within 3 days of confirmed ETA: 20 pts
4–7 days late: 10 pts
8–14 days late: 5 pts
15+ days late or missed shipping window: 0 pts -
Product Quality vs. Approved Sample (0–20 pts)
Defect rate <1%: 20 pts
Defect rate 1–3%: 12 pts
Defect rate 3–5%: 6 pts
Defect rate >5% or product materially different from sample: 0 pts -
Document Accuracy (0–20 pts)
Commercial invoice and packing list match perfectly on first submission: 20 pts
Minor errors corrected before customs clearance: 10 pts
Major errors caused customs hold or required re-submission: 0 pts -
Communication Responsiveness (0–20 pts)
Consistent <24h responses to all queries: 20 pts
24–48h average response time: 12 pts
48–72h average response time: 6 pts
72h+ or unresponsive to urgent issues: 0 pts -
Packaging and Labeling Compliance (0–20 pts)
All SKU labels, inner boxes, and master carton marks correct: 20 pts
Minor labeling issues corrected on-site: 10 pts
Labels required reprinting or products were mislabeled in transit: 0 pts
| Total Score | Supplier Rating | Reorder Decision |
|---|---|---|
| 80–100 | Approved Vendor | Reorder with confidence. Scale when ready. |
| 60–79 | Conditional | Reorder once more, with written remediation plan on lowest-scoring criteria. |
| 40–59 | Probation | One trial order at small quantity. Do not scale until score improves to 70+. |
| Below 40 | Disqualified | Seek alternative supplier. Document failure points for future vendor screening. |
How to Communicate a Supplier Rating Without Damaging the Relationship
Delivering a supplier rating score directly can feel transactional — but done well, it builds more trust than a handshake deal ever does. Four principles to follow:
- Frame it as a shared review, not a report card. Send the scorecard with a note that says "Here is how we evaluate every first order — we'd like to share your results and discuss areas we want to improve together before the next shipment."
- Lead with what went well. Acknowledge the high-scoring criteria before raising issues. This keeps the tone collaborative rather than adversarial.
- Be specific, not accusatory. "Commercial invoice had a value discrepancy of $300 on Line 4" is actionable. "Your paperwork was wrong" is not.
- Set a response timeline. Ask the supplier to respond within 5 business days with their remediation plan for any criteria scoring below 12 points. This signals professionalism and creates accountability on both sides.
First-Order Benchmarks by Supplier Origin
What counts as a "normal" first-order performance varies significantly by origin market. Use these benchmarks when interpreting your supplier rating scores:
| Supplier Origin | Typical Lead Time Variance | Avg First-Order Defect Rate | Document Accuracy (First Pass) |
|---|---|---|---|
| China (factory, 50–500 unit MOQ) | ±5–10 days | 1.5–3% | 80–90% |
| India (SME, 30–200 unit MOQ) | ±7–14 days | 2–4% | 70–85% |
| Mexico (nearshore, 20–100 unit MOQ) | ±2–5 days | 0.5–1.5% | 90–98% |
These benchmarks do not excuse poor performance — they contextualize it. A China factory delivering with a 5-day variance and a 2% defect rate is performing at the high end of typical. Hold them to it. A Mexico nearshore supplier with a 10-day variance and 3% defects is underperforming against the benchmark and should score accordingly.
When to Rate vs. When to Wait
Three situations where your first-order supplier rating should be paused, not scored:
- Force majeure shipping delays. Port congestion, carrier strikes, or customs backlogs outside the supplier's control should not penalize their on-time delivery score. Confirm the cause with your freight forwarder before scoring.
- Single-carton damage in sea freight. A single damaged master carton from a 50-carton sea shipment may be a carrier issue, not a packing defect. Score based on the majority of the shipment, not the exception.
- Custom product with first-production-run tolerance issues. If you ordered a heavily customized product, first-run dimensional tolerance variations are expected. Score against your agreed spec, not an unrealistic perfection standard.
How Forthsource Helps Before Your First Order
The best time to use a supplier rating system is before you even place your first order. Forthsource surfaces historical import trade data — actual shipment records from U.S. customs filings — so you can see whether your prospective supplier has a track record of consistent delivery volumes, document accuracy, and active U.S. buyer relationships. A supplier whose historical shipments show three different buyers and consistent monthly volumes is a lower first-order risk than one with a single buyer relationship and erratic shipment frequency. Use Forthsource data to pre-screen before you commit capital, then apply your first-order supplier rating after delivery to confirm or challenge the signal.
Frequently Asked Questions
What is a good first-order supplier rating score?
A score of 80 or above on a first order is excellent — most established suppliers take 2–3 orders to hit that level. A score of 60–79 is acceptable for a promising new supplier, provided they respond to feedback constructively. Below 60 on a first order is a yellow flag: it means even when they're trying hardest, performance is inconsistent. Below 40 is a red flag and warrants seeking an alternative vendor.
Should I share my supplier rating score with the vendor?
Yes — sharing the score is the point. The supplier rating has no value as a private document. Sharing it creates a feedback loop that drives improvement. Suppliers who take the rating seriously use it to address their weakest criteria before the next order. Suppliers who dismiss it or argue the scoring are giving you equally valuable information: they're not performance-oriented partners.
Do I need different supplier rating criteria for different product categories?
The five core criteria (delivery, quality, documents, communication, packaging) apply universally. What changes by category is the weighting and thresholds. For consumables and beauty products, quality vs. sample deserves heavier weight (consider 30 pts). For custom-label apparel, packaging compliance (correct hang tags, inner bags) is more critical. For electronics, document accuracy becomes even more important due to FCC and customs requirements. Adjust thresholds by category, but maintain the same evaluation framework across all suppliers.
Further reading
- Forthsource — Supplier Verification & Cost Intelligence
- Forthsuite Supply Chain OS for Shopify
- Commercial Invoice vs Packing List: Side-by-Side Guide for Shopify Importers
- Global Sourcing Software: Expert Rankings & Reviews (2026)
- Commercial Invoice and Packing List: Complete Guide for Shopify Importers (2026)