What is the formula for Standard Error of Measurement?

SEM = SD × √(1 − r), where SD is the test's standard deviation and r is the reliability coefficient. For example, if SD = 15 and r = 0.84, SEM = 15 × √(0.16) = 15 × 0.4 = 6.0.

Which intelligence test is used for adults in the Philippine PLE?

The WAIS-IV (Wechsler Adult Intelligence Scale, 4th edition) is the standard adult intelligence test, normed for ages 16–90. The WISC-V is for children (6–16), and the WPPSI-IV is for preschoolers (2½–7½).

Assessment Updated 2025 · 18 min read

Psychological Assessment Mock Board Exam Questions for the PLE

Q: What topics are covered in the Psychological Assessment subject of the PLE?

The PLE Assessment subject covers: test construction and item analysis, types of reliability (test-retest, split-half, Cronbach's alpha, inter-rater), types of validity (content, criterion, construct), norms and derived scores, intelligence tests (Wechsler scales, Stanford-Binet), objective personality assessment (MMPI-2, 16PF), and projective techniques (Rorschach, TAT, HTP).

Q: Is Psychological Assessment the hardest subject in the PLE?

Most PLE candidates find Psychological Assessment the most challenging due to its combination of statistical formulas, psychometric theory, and the need to memorize specific test names, authors, age ranges, and appropriate clinical applications.

Psychological Assessment is the subject that most frequently determines the difference between passing and failing the Philippine Psychometrician Licensure Examination. While Abnormal Psychology tests your familiarity with the DSM-5-TR and Developmental Psychology tests your knowledge of major theorists, Assessment demands something more difficult: a simultaneous grasp of abstract psychometric theory, concrete formulas, and the practical application of dozens of specific psychological instruments.

This guide provides 20+ psychological assessment mock board exam questions covering every major topic area — from test construction and reliability to intelligence assessment and projective techniques — with detailed explanations that explain both the correct answer and why the distractors are wrong. Use this alongside PsychBoard PH's interactive practice for the most comprehensive Assessment preparation available.

📊 High-Priority Topics: Reliability types and formulas, validity types and when each applies, Standard Error of Measurement calculation, Wechsler scale names and age ranges, MMPI-2 validity and clinical scales, and the key projective techniques (Rorschach, TAT, HTP). These account for ~60% of Assessment PLE items.

Topic Overview: What Psychological Assessment Covers

The PLE Assessment paper covers six major content areas:

1. Test Construction and Item Analysis: Item writing, difficulty index (p-value), discrimination index, pilot testing, standardization procedures, norm development
2. Reliability: Test-retest, alternate forms, split-half (Spearman-Brown), internal consistency (Cronbach's alpha), inter-rater reliability; factors affecting reliability; Standard Error of Measurement
3. Validity: Content validity, criterion-related (concurrent and predictive), construct validity (convergent and discriminant); face validity; threats to validity
4. Derived Scores and Norms: Raw scores, percentile ranks, z-scores, T-scores, stanines, IQ scores, developmental norms, standard scores
5. Intelligence Assessment: Wechsler scales (WPPSI-IV, WISC-V, WAIS-IV); Stanford-Binet 5; theories (Spearman's g, Cattell-Horn Gc/Gf, Gardner, Sternberg); IQ distribution
6. Personality Assessment: Objective (MMPI-2, 16PF, NEO-PI-R); Projective (Rorschach, TAT, HTP, DAP, SCT)

Section 1: Test Construction and Item Analysis

1. An item with a p-value of 0.90 is considered:

A. Highly discriminating
B. Moderately difficult
C. Very easy
D. Poorly written

✓ The p-value (item difficulty index) = proportion who answered correctly. p = 0.90 means 90% got it right — very easy. Optimal discrimination occurs near p = 0.50. Items with p > 0.80 or p < 0.20 have low discriminating power.

2. The discrimination index compares:

A. Male vs. female test performance
B. High-scoring group vs. low-scoring group performance on each item
C. Test performance at Time 1 vs. Time 2
D. Observed score vs. true score

✓ The discrimination index = (proportion correct in upper 27%) − (proportion correct in lower 27%). Values range from −1 to +1. Items with d ≥ 0.30 are considered good discriminators; negative d values indicate a poorly written item.

3. Developing norms requires administering the test to a large, representative:

A. Clinical sample
B. Convenience sample
C. Standardization sample
D. Expert panel

✓ A standardization sample must be large, representative, and demographically proportional to the target population. Norms derived from a non-representative sample will be biased — leading to inaccurate interpretation of individual scores.

Section 2: Reliability

4. A researcher splits a 50-item test into two 25-item halves, correlates them (r = 0.70), and applies the Spearman-Brown formula. This estimates:

A. Test-retest reliability
B. Split-half reliability for the full test
C. Alternate-forms reliability
D. Inter-rater reliability

✓ Split-half reliability divides the test into two halves and correlates them. The Spearman-Brown prophecy formula corrects the correlation upward because splitting reduces test length (and therefore reliability). Full-test reliability = 2r / (1 + r) = 2(0.70) / (1.70) ≈ 0.82.

5. Cronbach's alpha is most appropriate when test items are:

A. Dichotomous (pass/fail)
B. Rated by two or more observers
C. Scored on a range (e.g., 1–5 scale) or polytomous
D. Administered to two separate groups

✓ Cronbach's alpha generalizes Kuder-Richardson (KR-20) to polytomously scored items (e.g., Likert scales). For dichotomous items, KR-20 is technically more precise. Alpha ranges from 0 to 1.0; ≥ 0.70 = acceptable, ≥ 0.90 = clinical standard.

6. A test has SD = 10 and reliability = 0.75. What is the Standard Error of Measurement?

A. 2.5
B. 4.0
C. 5.0
D. 7.5

✓ SEM = SD × √(1 − r) = 10 × √(0.25) = 10 × 0.50 = 5.0. SEM represents the typical amount of measurement error in a score. A score of 70 with SEM = 5 means the true score likely falls within ±5 points (65–75) at the 68% confidence level.

7. Two psychologists independently score the same set of Rorschach protocols and obtain r = 0.88. This is evidence of:

A. Test-retest reliability
B. Internal consistency
C. Inter-rater reliability
D. Parallel-forms reliability

✓ Inter-rater reliability measures consistency between scorers/raters. It is particularly important for subjective scoring methods like projective tests (Rorschach, TAT) and behavioral observation systems where scorer judgment is involved.

Section 3: Validity

8. A new math achievement test is reviewed by a panel of mathematics educators who evaluate whether the items adequately represent the full curriculum. This represents:

A. Content validity
B. Concurrent validity
C. Construct validity
D. Predictive validity

✓ Content validity is evaluated through systematic expert review of item content representativeness. Unlike other validity types, it is not quantified by correlation — it is a qualitative judgment about domain coverage.

9. A test of depression correlates highly with a well-established depression measure (r = 0.82) but shows little correlation with an anxiety measure (r = 0.15). This provides evidence of:

A. Content validity only
B. Predictive validity
C. Both convergent and discriminant validity (construct validity)
D. Face validity

✓ High correlation with a theoretically related measure = convergent validity. Low correlation with a theoretically unrelated measure = discriminant validity. Both together are components of construct validity as described in Campbell and Fiske's multitrait-multimethod approach.

10. A cognitive ability test given during hiring predicts supervisory performance ratings 12 months later (r = 0.45). This is evidence of:

A. Content validity
B. Concurrent validity
C. Predictive validity
D. Construct validity

✓ Predictive validity — the criterion (supervisory ratings) is collected in the future (12 months later). Concurrent validity uses a criterion measured at the same time. Both are subtypes of criterion-related validity.

Section 4: Intelligence Assessment

11. The WAIS-IV yields which composite scores?

A. Verbal IQ, Performance IQ, Full Scale IQ
B. Verbal Comprehension, Perceptual Reasoning, Working Memory, Processing Speed, and Full Scale IQ
C. Fluid Intelligence, Crystallized Intelligence, Full Scale IQ
D. General Intelligence (g), Specific Abilities (s), Full Scale IQ

✓ The WAIS-IV replaced the old VIQ/PIQ structure with four index scores: Verbal Comprehension (VCI), Perceptual Reasoning (PRI), Working Memory (WMI), Processing Speed (PSI), plus Full Scale IQ (FSIQ). Know these — they are heavily tested.

12. According to Cattell-Horn theory, the ability to solve novel problems using reasoning — not previously learned information — is called:

A. Fluid intelligence (Gf)
B. Crystallized intelligence (Gc)
C. General intelligence (g)
D. Visual-spatial intelligence

✓ Fluid intelligence (Gf) involves reasoning, problem-solving, and adapting to new situations — independent of acquired knowledge. Crystallized intelligence (Gc) reflects accumulated knowledge and vocabulary. Gf peaks in young adulthood; Gc continues growing through middle age.

13. The original ratio IQ formula (MA/CA × 100) was proposed by:

A. Alfred Binet
B. Lewis Terman
C. William Stern
D. David Wechsler

✓ William Stern proposed the ratio IQ formula. Alfred Binet developed the concept of mental age. Lewis Terman adapted Binet's test for American use (Stanford-Binet) and popularized the IQ term. Wechsler developed the deviation IQ (mean = 100, SD = 15).

Section 5: Personality Assessment

14. The MMPI-2 validity scale that detects random or inconsistent responding is:

A. L (Lie) scale
B. K (Defensiveness) scale
C. VRIN (Variable Response Inconsistency) scale
D. F (Infrequency) scale

✓ VRIN detects random responding by checking pairs of similar/opposite items — inconsistent patterns suggest the examinee answered randomly. The F scale detects infrequent/rare responses (may indicate exaggeration of symptoms or random responding). L detects claiming unlikely virtues.

15. In the TAT (Thematic Apperception Test), the primary theoretical assumption is:

A. Examinees describe what they consciously perceive
B. Examinees project unconscious needs, motives, and conflicts onto the characters
C. Story content reflects verbal intelligence
D. Ambiguous stimuli reveal neurological processing

✓ The projective hypothesis — when faced with ambiguous stimuli, people project their own needs, motives, and conflicts onto the ambiguous material. The TAT was developed by Henry Murray and Christiana Morgan to assess achievement, affiliation, and power needs.

16. The Rorschach response "Two bears fighting over a fish" scored as W (whole), M (human movement), FC (form-color) would most accurately be characterized as:

A. A rare, pathological response
B. A perseverative response
C. An organized, elaborated whole response with good form quality
D. A confabulatory response

✓ A W response using the entire blot with coherent organization and logical connection to form is considered a sign of integrated, organized thinking — a positive indicator in Exner's system. Movement (M) responses and good form quality (F+) are associated with cognitive complexity and reality testing.

17. The Big Five personality factors assessed by the NEO-PI-R are:

A. Dominance, Submission, Introversion, Stability, Openness
B. Neuroticism, Extraversion, Openness, Agreeableness, Conscientiousness
C. Need for Achievement, Affiliation, Power, Stability, Extraversion
D. Emotional Stability, Social Boldness, Sensitivity, Vigilance, Tension

✓ The Big Five (OCEAN): Openness to experience, Conscientiousness, Extraversion, Agreeableness, Neuroticism. The NEO-PI-R (Costa & McCrae) is the most widely used Big Five measure. The 16PF (Cattell) measures 16 primary factors.

Key Formulas to Memorize

Standard Error of Measurement: SEM = SD × √(1 − r)
Ratio IQ (historical): IQ = (MA / CA) × 100
z-score: z = (X − M) / SD
T-score: T = (z × 10) + 50
Spearman-Brown correction: r_full = 2r / (1 + r)
Reliability from SEM: r = 1 − (SEM² / SD²)

Key Assessment Tests to Know

Test	Author	Type	Age Range
WPPSI-IV	Wechsler	Intelligence	2½–7½
WISC-V	Wechsler	Intelligence	6–16
WAIS-IV	Wechsler	Intelligence	16–90
MMPI-2	Hathaway & McKinley	Objective Personality	Adults (18+)
NEO-PI-R	Costa & McCrae	Objective Personality (Big Five)	Adults
Rorschach	Hermann Rorschach	Projective	5+
TAT	Murray & Morgan	Projective	10+

Practice More Assessment Questions

Generate unlimited Psychological Assessment practice questions on PsychBoard PH — covering all subtopics with detailed explanations and timed mock exam mode.

Practice Assessment Free →

Free plan available. No signup required.

Frequently Asked Questions

What topics are covered in the Psychological Assessment subject of the PLE?

Test construction and item analysis, reliability (test-retest, split-half, Cronbach's alpha, inter-rater), validity (content, criterion, construct), derived scores (z-scores, T-scores, SEM), intelligence tests (Wechsler scales, Stanford-Binet), objective personality (MMPI-2, NEO-PI-R), and projective techniques (Rorschach, TAT, HTP).

What is the Standard Error of Measurement formula?

SEM = SD × √(1 − r), where SD is the test's standard deviation and r is the reliability coefficient. Example: SD = 15, r = 0.84 → SEM = 15 × √0.16 = 15 × 0.4 = 6.0.

Which Wechsler scale is used for adults?

The WAIS-IV (Wechsler Adult Intelligence Scale, 4th edition), normed for ages 16–90. WISC-V covers 6–16; WPPSI-IV covers 2½–7½.

Is Psychological Assessment the hardest PLE subject?

Most candidates find it the most challenging due to its combination of statistical formulas, psychometric theory, and memorization of specific instruments. It is the subject with the widest gap between what candidates expect to study and what the exam actually tests. Strong preparation specifically for Assessment is the highest-leverage action for borderline candidates.