Standardized assessments that measure cognitive abilities, personality traits, and behavioral tendencies to predict job performance and fit.
Key Takeaways
Psychometric testing is the application of standardized, scientific measurement to psychological attributes: how people think (cognitive ability), how they tend to behave (personality), and how they're likely to respond to work situations (behavioral tendencies). The term comes from the Greek words "psyche" (mind) and "metron" (measure). In HR, psychometric tests are used at two key moments: during hiring (to predict which candidates will perform well) and during talent development (to identify strengths, development areas, and team dynamics). The science behind psychometrics is over a century old. Alfred Binet developed the first intelligence test in 1905. The U.S. Army used cognitive tests to screen 1.7 million recruits during World War I. By the 1980s, personality assessments based on the Big Five model (openness, conscientiousness, extraversion, agreeableness, neuroticism) became standard in organizational psychology. Today, the field is mature. Schmidt and Hunter's landmark 1998 meta-analysis (updated by Schmidt in 2016) established that cognitive ability tests are the single best predictor of job performance across all job types and levels, with a predictive validity of 0.54. That's better than interviews, years of experience, education, and references combined.
Not every pre-hire test is psychometric. A psychometric test must be standardized (everyone takes the same test under the same conditions), normed (results are compared against a reference population), reliable (the test produces consistent results over time), and valid (the test measures what it claims to measure, and that measurement predicts job performance). A casual personality quiz on a website, a manager's gut-feel interview questions, or a homegrown skills test without validation data are not psychometric. They may be useful, but they don't meet the scientific standard. The distinction matters for legal defensibility. If a hiring test is challenged under Title VII or the Equality Act, the employer must demonstrate that the test is valid. Properly developed psychometric tests come with technical manuals containing this evidence. Homegrown tests typically don't.
Most personality-based psychometric tests in HR are built on the Big Five (also called OCEAN or the Five-Factor Model). The five dimensions are: Openness to Experience (curiosity, creativity, tolerance for ambiguity), Conscientiousness (organization, dependability, goal orientation), Extraversion (sociability, assertiveness, positive emotions), Agreeableness (cooperation, trust, empathy), and Neuroticism (emotional stability, stress tolerance, anxiety). These five dimensions are the most replicated finding in personality psychology. They appear consistently across cultures, languages, and age groups. Of the five, Conscientiousness is the strongest predictor of job performance across all role types (correlation of 0.22 to 0.31). Emotional Stability (low Neuroticism) is the second strongest. Extraversion predicts performance specifically in sales and management roles.
Psychometric tests fall into two broad categories: ability tests (measuring maximum performance) and personality tests (measuring typical behavior). Each serves a different purpose in the hiring and development process.
| Test Category | What It Measures | Format | Time | Predictive Validity |
|---|---|---|---|---|
| Verbal Reasoning | Comprehension, logical analysis of written information | Multiple-choice, passage-based | 15-25 min | 0.45-0.54 |
| Numerical Reasoning | Data interpretation, mathematical logic, statistical reasoning | Multiple-choice, chart/table-based | 15-25 min | 0.45-0.54 |
| Abstract/Logical Reasoning | Pattern recognition, non-verbal problem-solving | Multiple-choice, figure series | 15-25 min | 0.48-0.54 |
| Personality Inventory | Big Five traits, work style preferences | Likert scale or forced-choice statements | 20-35 min | 0.22-0.31 |
| Situational Judgment Test (SJT) | Decision-making in realistic work scenarios | Multiple-choice scenario responses | 20-40 min | 0.26-0.34 |
| Emotional Intelligence | Self-awareness, empathy, social skills | Self-report or ability-based tasks | 20-30 min | 0.20-0.30 |
| Motivation Questionnaire | What drives and demotivates the individual | Ranking or rating statements | 15-20 min | 0.15-0.25 |
Behind every psychometric test is a measurement framework that converts human behavior and cognition into quantifiable, comparable data.
A legitimate psychometric test goes through years of development. First, psychologists define the construct being measured (for example, "conscientiousness" or "numerical reasoning ability"). Then they write hundreds of test items, pilot them with large sample groups (typically 500 to 2,000 people), and analyze the statistical properties of each item using Item Response Theory or Classical Test Theory. Items that don't discriminate between high and low scorers, or that show bias toward specific demographic groups, are removed. The surviving items form the final test, which is then normed against a reference population (the "norm group"). This development process can take 2 to 5 years and cost $500,000 or more for a major assessment publisher. That's why reputable psychometric tests come from established publishers (SHL, Hogan, Pearson TalentLens) rather than startup assessment tools built in a few months.
Raw test scores are meaningless without context. Scoring 28 out of 40 on a numerical reasoning test doesn't tell you whether that's good or bad. Psychometric tests convert raw scores into percentile ranks or standard scores by comparing them against the norm group. A percentile rank of 75 means the test-taker scored higher than 75% of the norm group. Some tests use sten scores (a 1-to-10 scale where 5.5 is average) or stanine scores (a 1-to-9 scale). The quality of the norm group matters enormously. A test normed on university students won't produce valid scores when applied to experienced professionals. Always check that the norm group matches your candidate population in terms of role level, industry, and geography.
Reliability is measured using statistics like Cronbach's alpha (internal consistency, should be above 0.70) and test-retest reliability (whether the same person gets similar scores when retaking the test weeks later, should be above 0.70). A test with low reliability is like a bathroom scale that gives you a different weight every time you step on it. It might be measuring something, but you can't trust it. Ask vendors for reliability coefficients before purchasing. Any test with Cronbach's alpha below 0.70 shouldn't be used for high-stakes hiring decisions.
Psychometric tests add the most value when they're integrated into a structured hiring process rather than used in isolation.
Schmidt and Hunter's research shows that the best prediction comes from combining a cognitive ability test (validity 0.54) with a structured interview (validity 0.51). Together, they achieve a combined validity of approximately 0.63. This means the combination predicts about 40% of the variance in job performance, compared to about 29% for either method alone. The practical implication: use psychometric tests to screen candidates before interviews, then use structured interviews to assess competencies that tests can't measure well (communication style, cultural alignment, motivation). Don't use one or the other. Use both.
Psychometric test results should be one input in the hiring decision, not the only input. A common weighting model is 30 to 40% psychometric assessment, 30 to 40% structured interview performance, and 20 to 30% skills demonstration (work samples, portfolio, case study). The exact weighting should reflect the role's requirements. For roles where cognitive ability is the primary predictor (analyst, strategist, researcher), weight tests higher. For roles where interpersonal skills matter most (sales, counseling, management), weight interviews and simulations higher.
Best practice is to share at least summary-level feedback with candidates, whether or not they're hired. This serves two purposes: it improves candidate experience (candidates feel respected and valued), and it positions the assessment as developmental rather than purely gatekeeping. Some organizations share a brief report highlighting the candidate's top strengths and potential development areas. This is especially common in executive hiring, where candidates expect a more personalized experience. Never share raw scores or clinical interpretations with candidates unless you're a trained psychologist.
The psychometric assessment market is dominated by a handful of established publishers with decades of validation research. Here are the most widely used tools in HR.
The Myers-Briggs Type Indicator (MBTI) is the most widely recognized personality assessment in the world, with about 2 million administrations per year. However, the psychological community has significant concerns about its use in hiring decisions. The MBTI has lower test-retest reliability than other personality tools. Studies show that up to 50% of people get a different type when retaking the test 5 weeks later (Pittenger, 2005). It lacks predictive validity for job performance. The Myers-Briggs Company itself advises against using the MBTI for hiring purposes. It's designed for self-awareness and team development, not selection. For hiring, use assessments built on the Big Five model (Hogan, SHL OPQ, NEO-PI-R), which have decades of validity research in employment settings.
| Test/Publisher | What It Measures | Format | Best For | Approximate Cost |
|---|---|---|---|---|
| SHL Verify (SHL/CEB) | Cognitive abilities (verbal, numerical, inductive) | Timed, adaptive, online | High-volume screening for corporate roles | $30-50 per candidate |
| Hogan Assessments | Personality (bright side, dark side, values) | Untimed, self-report | Leadership assessment and executive selection | $50-150 per candidate |
| MBTI (The Myers-Briggs Company) | Personality type preferences (16 types) | Untimed, self-report | Team development and coaching (not hiring) | $50-100 per report |
| Watson-Glaser (Pearson) | Critical thinking ability | Timed, multiple-choice | Law, consulting, and analytical roles | $25-40 per candidate |
| Predictive Index (PI) | Behavioral drives and cognitive ability | Untimed behavioral + timed cognitive | Mid-market companies, all role levels | $5,000-15,000/year |
| DISC (Various publishers) | Dominance, Influence, Steadiness, Conscientiousness | Untimed, self-report | Team dynamics and communication styles | $20-50 per report |
| Caliper Profile | Personality traits mapped to job performance | Untimed, self-report | Sales, management, and customer-facing roles | $250-350 per candidate |
Psychometric testing in employment is regulated by anti-discrimination laws in most jurisdictions. Employers must ensure tests are fair, job-related, and don't disproportionately exclude protected groups.
Cognitive ability tests, while highly predictive of job performance, show average group differences by race (Black test-takers score about 1 standard deviation below white test-takers on average, a finding consistent across hundreds of studies). This creates potential adverse impact under the EEOC's four-fifths rule. Employers using cognitive tests must demonstrate that the test is job-related and consistent with business necessity. Strategies to reduce adverse impact while maintaining validity include using banded scoring (treating scores within a range as equivalent), combining cognitive tests with personality assessments (which show minimal group differences), using work-sample tests that are more face-valid and show smaller group differences, and applying tests only after a structured interview has already narrowed the pool.
Psychometric test results are personal data. Under the EU's General Data Protection Regulation (GDPR), employers must have a lawful basis for processing this data (legitimate interest or consent), inform candidates about how their data will be used, stored, and shared, allow candidates to access their data and request deletion, and implement appropriate security measures. Similar regulations exist in other jurisdictions: POPIA in South Africa, PDPA in Singapore, and various state-level privacy laws in the US (CCPA in California). Keep test results in secure systems, limit access to authorized personnel, and establish a data retention policy (typically delete results 6 to 12 months after the hiring decision).
Candidates should know what type of test they'll take, why it's being used, how results will factor into the hiring decision, and who will see their results. This isn't just ethical. It's legally required in many jurisdictions. Transparency also reduces candidate anxiety and improves test performance, which produces more accurate results. Provide clear instructions, practice questions (for ability tests), and a point of contact for candidates who need accommodations.
Beyond hiring, psychometric tests are widely used for leadership development, team building, career planning, and succession management.
Hogan Assessments' three-tool suite (HPI for bright-side personality, HDS for dark-side derailers, and MVPI for values) is the most widely used psychometric system for leadership development globally. The dark-side assessment is particularly valuable because it identifies behaviors that emerge under stress or complacency (being overly cautious, attention-seeking, micromanaging) that can derail otherwise talented leaders. When combined with 360 feedback, psychometric profiles provide a data-driven foundation for executive coaching conversations.
Understanding the psychometric profile of an entire team helps managers recognize potential blind spots and conflict points. A team full of high-Conscientiousness introverts might be great at detailed execution but slow to make decisions or take risks. A team of high-Extraversion, low-Agreeableness individuals might generate lots of ideas but struggle with collaboration. Tools like Belbin Team Roles, the Predictive Index, and the Team Management Profile help map team dynamics and identify gaps. This data informs hiring decisions for new team members (what profile complements the existing team?) and development priorities for the current team.
The psychometric testing industry is evolving in response to technology shifts, regulatory changes, and changing workforce expectations.