A direct supervisor's formal written evaluation of an employee's performance, goal achievement, competency demonstration, and development trajectory over a defined review period.
Key Takeaways
A manager review is the supervisor's side of the performance conversation. The manager evaluates how the employee performed against goals, assesses competencies, and recommends next steps for development and compensation. It's the most common performance evaluation in existence. In nearly every organization worldwide, the direct manager writes the primary assessment. There's a good reason for this: the direct manager is presumably the person best positioned to evaluate daily work quality, goal progress, and professional behavior. They assign the work, set the expectations, and observe the results. But there's also a major caveat. Research from Mount and Scullen (2023), confirmed by Gallup's manager effectiveness studies, found that 60% of the variance in performance ratings is attributable to the unique rating tendencies of the manager, not the actual performance of the employee. Some managers rate generously. Others rate harshly. Some cluster everyone in the middle. This "idiosyncratic rater effect" means that the same employee, doing the same work, would receive meaningfully different ratings from different managers. That's why calibration, multi-source feedback, and manager training exist: to counterbalance the inherent subjectivity of a single-rater system.
Writing a strong manager review takes preparation, specificity, and balance between evaluation and development.
Pull from multiple sources: goal tracking data, project outcomes, one-on-one notes, peer feedback, customer feedback, and self-assessment (if available). Don't rely on memory alone. Memory is skewed toward recent events and emotionally charged moments. If you haven't kept notes throughout the review period, check email threads, Slack conversations, project management tools, and meeting notes for specific examples. Aim for at least 2-3 concrete data points per goal or competency you plan to evaluate.
Take each goal set at the start of the period and assess it separately. Document the target, the actual result, and the context. Did the employee hit the target? Exceed it? Fall short? Why? Be fair about external factors: a missed revenue target during a market downturn is different from a missed revenue target during a bull market. Rate each goal on the agreed-upon scale and write 2-3 sentences explaining the rating. These goal-level assessments form the foundation of the overall review.
For each competency (leadership, communication, technical expertise, etc.), cite a specific observed behavior and its impact. Bad example: "Strong communication skills." Good example: "Presented the Q3 pipeline analysis to the executive team with clear data visualization and actionable recommendations. The VP of Sales adopted two of the three proposed changes." Behavioral evidence makes ratings defensible and gives the employee something concrete to understand, whether they agree or not.
Every employee has both. Even top performers have areas for growth, and even struggling employees have strengths worth acknowledging. A review that only highlights problems demoralizes without motivating. A review that only highlights strengths misses the development opportunity. A good structure: lead with 2-3 key strengths and their business impact, then transition to 1-2 development areas with specific, actionable suggestions. Phrase development areas as forward-looking opportunities, not character judgments.
This template provides a consistent structure for writing manager evaluations across the organization.
| Section | What to Include | Length |
|---|---|---|
| Summary Statement | 2-3 sentence overview of the employee's performance during the review period | 50-75 words |
| Goal Achievement | Each goal with target, actual result, context, and individual rating | 100-150 words per goal |
| Competency Ratings | 4-6 competencies with behavioral examples and ratings | 50-100 words per competency |
| Key Strengths | 2-3 standout strengths with specific impact examples | 100-150 words total |
| Development Areas | 1-2 areas for growth with actionable next steps | 100-150 words total |
| Development Plan | Specific training, assignments, or experiences recommended | 50-100 words |
| Overall Rating | Summary rating with justification | 50-75 words |
| Next Period Goals | 3-5 goals with metrics and deadlines for the upcoming period | 50-75 words per goal |
Understanding these biases is the first step toward writing more accurate evaluations.
| Bias | What It Looks Like | How to Counter It |
|---|---|---|
| Recency bias | Rating based primarily on the last 2-3 months | Review notes from the entire period before writing; keep quarterly summaries |
| Halo effect | One positive trait inflates all ratings | Evaluate each competency independently; use behavioral anchors |
| Horn effect | One negative trait depresses all ratings | Assess each area on its own merits; separate the incident from the pattern |
| Leniency bias | Rating everyone highly to avoid conflict | Use forced distribution or calibration sessions; train on difficult conversations |
| Strictness bias | Rating everyone harshly compared to peers | Compare ratings against organizational benchmarks and peer manager distributions |
| Similarity bias | Higher ratings for people who think, look, or communicate like you | Use objective criteria; seek feedback from diverse sources |
| Central tendency | Clustering everyone at "meets expectations" | Use even-numbered scales (4-point); require written justification for middle ratings |
| Idiosyncratic rater effect | Ratings reflect your personal tendencies more than employee performance | Calibrate with peer managers; examine your own rating patterns annually |
The delivery is as important as the content. A well-written review delivered poorly undermines the entire process.
Hold the review in a private setting. No interruptions. No open office space. No quick hallway conversations. Block 45-60 minutes for annual reviews, 30-45 for quarterly reviews. Both parties should be fully present. Silence phones and close laptops (except for referencing the review document). The physical and psychological environment signals how seriously you take the employee's development.
Don't just read the review document out loud. Start by sharing the overall assessment: "Overall, I'm rating you as exceeding expectations this year. Let me walk you through the details." Then discuss goals, competencies, strengths, and development areas. Pause after each section and invite the employee's response. Listen actively. Ask open-ended questions: "How does that align with your own assessment?" and "What additional context should I know?" Save the development plan for the last third of the conversation so you end on a forward-looking, growth-oriented note.
Employees will sometimes disagree with your assessment. That's healthy. Hear their perspective fully before responding. If they provide evidence you hadn't considered, be willing to adjust your rating. If you maintain your assessment after hearing their input, explain your reasoning clearly and offer the option to submit a written response for their file. Never become defensive or dismissive. The goal is mutual understanding. You don't need to agree, but you both need to leave with clarity about where things stand.
Calibration ensures that performance ratings are consistent across teams, departments, and managers throughout the organization.
In a calibration session, managers from the same level or department present their ratings and discuss borderline cases. An employee rated "exceeds expectations" by one manager is compared against employees rated the same by other managers. Are the performance levels actually comparable? If Manager A's "exceeds" bar is lower than Manager B's, adjustments are made. HR or a senior leader typically facilitates the session and helps resolve disagreements. Calibration is especially important for compensation decisions to ensure pay equity across teams.
Hold calibration sessions after managers write their evaluations but before the review meetings with employees. Use specific examples and data to justify ratings. Limit sessions to 8-10 managers to keep discussions productive. Focus calibration time on employees at the extremes (top performers and underperformers) and those on the border between rating categories. Document the outcomes and any rating adjustments made during the session. Communicate to managers that calibration adjustments aren't personal criticisms of their judgment. They're a quality assurance step.
Most managers have never been formally trained on how to evaluate performance. That's a fixable problem.
Data on how manager reviews work and where they fall short.