The Rorschach test (also known as the Rorschach inkblot test, the Rorschach technique, or simply the inkblot test) is a psychological test in which subjects’ perceptions of inkblots are recorded and then analyzed using psychological interpretation, complex scientifically derived algorithms, or both. Some psychologists use this test to examine a person’s personality characteristics and emotional functioning. It has been employed to detect underlying thought disorder, especially in cases where patients are reluctant to describe their thinking processes openly. The test is named after its creator, Swiss psychologist Hermann Rorschach.

 


The first of the ten cards in the Rorschach test, with the occurrence of the most statistically frequent details indicated. The images themselves are only one component of the test, whose focus is the analysis of the perception of the images.

In the 1960s, the Rorschach was the most widely used projective test. In a national survey in the U.S., the Rorschach was ranked eighth among psychological tests used in outpatient mental health facilities. It is the second most widely used test by members of the Society for Personality Assessment, and it is requested by psychiatrists in 25% of forensic assessment cases, usually in a battery of tests that often include the MMPI-2 and the MCMI-III. In surveys, the use of Rorschach ranges from a low of 20% by correctional psychologists to a high of 80% by clinical psychologists engaged in assessment services, and 80% of psychology graduate programs surveyed teach it.

Although the Exner Scoring System (developed since the 1960s) claims to have addressed and often refuted many criticisms of the original testing system with an extensive body of research, some researchers continue to raise questions. The areas of dispute include the objectivity of testers, inter-rater reliability, the verifiability and general validity of the test, bias of the test’s pathology scales towards greater numbers of responses, the limited number of psychological conditions which it accurately diagnoses, the inability to replicate the test’s norms, its use in court-ordered evaluations, and the proliferation of the ten inkblot images, potentially invalidating the test for those who have been exposed to them.

 

History

 


Hermann Rorschach created the Rorschach inkblot test in 1921.

 

Using interpretation of “ambiguous designs” to assess an individual’s personality is an idea that goes back to Leonardo da Vinci and Botticelli. Interpretation of inkblots was central to a game from the late 19th century. Rorschach’s, however, was the first systematic approach of this kind.

It has been suggested that Rorschach’s use of inkblots may have been inspired by German doctor Justinus Kerner who, in 1857, had published a popular book of poems, each of which was inspired by an accidental inkblot. French psychologist Alfred Binet had also experimented with inkblots as a creativity test, and, after the turn of the century, psychological experiments where inkblots were utilized multiplied, with aims such as studying imagination and consciousness.

After studying 300 mental patients and 100 control subjects, in 1921 Rorschach wrote his book Psychodiagnostik, which was to form the basis of the inkblot test (after experimenting with several hundred inkblots, he selected a set of ten for their diagnostic value), but he died the following year. Although he had served as Vice President of the Swiss Psychoanalytic Society, Rorschach had difficulty in publishing the book and it attracted little attention when it first appeared.

In 1927, the newly-founded Hans Huber publishing house purchased Rorschach’s book Psychodiagnostik from the inventory of Ernst Bircher. Huber has remained the publisher of the test and related book, with Rorschach a registered trademark of Swiss publisher Verlag Hans Huber, Hogrefe AG. The work has been described as “a densely written piece couched in dry, scientific terminology”.

After Rorschach’s death, the original test scoring system was improved by Samuel Beck, Bruno Klopfer and others. John E. Exner summarized some of these later developments in the comprehensive system, at the same time trying to make the scoring more statistically rigorous. Some systems are based on the psychoanalytic concept of object relations. The Exner system remains very popular in the United States, while in Europe other methods sometimes dominate, such as that described in the textbook by Evald Bohm, which is closer to the original Rorschach system and rooted more deeply in the original psychoanalysis principles.

 

Method

The tester and subject typically sit next to each other at a table, with the tester slightly behind the subject. This is to facilitate a “relaxed but controlled atmosphere”. There are ten official inkblots, each printed on a separate white card, approximately 18×24 cm in size. Each of the blots has near perfect bilateral symmetry. Five inkblots are of black ink, two are of black and red ink and three are multicolored, on a white background. After the test subject has seen and responded to all of the inkblots (free association phase), the tester then presents them again one at a time in a set sequence for the subject to study: the subject is asked to note where he sees what he originally saw and what makes it look like that (inquiry phase). The subject is usually asked to hold the cards and may rotate them. Whether the cards are rotated, and other related factors such as whether permission to rotate them is asked, may expose personality traits and normally contributes to the assessment. As the subject is examining the inkblots, the psychologist writes down everything the subject says or does, no matter how trivial. Analysis of responses is recorded by the test administrator using a tabulation and scoring sheet and, if required, a separate location chart.

The general goal of the test is to provide data about cognition and personality variables such as motivations, response tendencies, cognitive operations, affectivity, and personal/interpersonal perceptions. The underlying assumption is that an individual will class external stimuli based on person-specific perceptual sets, and including needs, base motives, conflicts, and that this clustering process is representative of the process used in real-life situations. Methods of interpretation differ. Rorschach scoring systems have been described as a system of pegs on which to hang one’s knowledge of personality. The most widely used method in the United States is based on the work of Exner.

Administration of the test to a group of subjects, by means of projected images, has also occasionally been performed, but mainly for research rather than diagnostic purposes.

Test administration is not to be confused with test interpretation:

“The interpretation of a Rorschach record is a complex process. It requires a wealth of knowledge concerning personality dynamics generally as well as considerable experience with the Rorschach method specifically. Proficiency as a Rorschach administrator can be gained within a few months. However, even those who are able and qualified to become Rorschach interpreters usually remain in a “learning stage” for a number of years.”

 

Features or categories

The interpretation of the Rorschach test is not based primarily on the contents of the response, i.e., what the individual sees in the inkblot (the content). In fact, the contents of the response are only a comparatively small portion of a broader cluster of variables that are used to interpret the Rorschach data: for instance, information is provided by the time taken before providing a response for a card can be significant (taking a long time can indicate “shock” on the card). as well as by any comments the subject may make in addition to providing a direct response.

In particular, information about determinants (the aspects of the inkblots that triggered the response, such as form and color) and location (which details of the inkblots triggered the response) is often considered more important than content, although there is contrasting evidence. “Popularity” and “originality” of responses can also be considered as basic dimensions in the analysis.

 

Content

Content is classified in terms of “human”, “nature”, “animal”, “abstract”, etc., as well as for statistical popularity (or, conversely, originality).

More than any other feature in the test, content response can be controlled consciously by the subject, and may be elicited by very disparate factors, which makes it difficult to use content alone to draw any conclusions about the subject’s personality; with certain individuals, content responses may potentially be interpreted directly, and some information can at times be obtained by analyzing thematic trends in the whole set of content responses (which is only feasible when several responses are available), but in general content cannot be analyzed outside of the context of the entire test record.

 

Location

The basis for the response is usually the whole inkblot, a detail (either a commonly or an uncommonly selected one), or the negative space around or within the inkblot.

 

Determinants

Systems for Rorschach scoring generally include a concept of “determinants”: these are the factors that contribute to establish the similarity between the inkblot and the subject’s content response about it, and they can represent certain basic experiential-perceptual attitudes, showing aspects of the way a subject perceives the world. Rorschach’s original work used only form, color and movement; currently, another major determinant considered is shading, which was inadvertently introduced by poor printing of the inkblots (which originally featured uniform saturation), and subsequently recognized as significant by Rorschach himself.

Form is the most common determinant, and is related to intellectual processes; color responses often provide direct insight into emotional life. Shading and movement have been considered more ambiguously, both in definition and interpretation: Rorschach originally disregarded shading (which was originally not even present on the cards, being a result of the print process), and he considered movement as only actual experiencing of motion, while others have widened the scope of this determinant, taking it to mean that the subject sees something “going on”.

More than one determinant can contribute to the formation of the subject’s percept, and fusion of two determinants is taken into account, while also assessing which of the two constituted the primary contributor (e.g. “form-color” implies a more refined control of impulse than “color-form”). It is, indeed, from the relation and balance among determinants that personality can be most readily inferred.

 

Exner scoring system

The Exner scoring system, also known as the Rorschach Comprehensive System (RCS), is the standard method for interpreting the Rorschach test. It was developed in the 1960s by Dr. John E. Exner, as a more rigorous system of analysis. It has been extensively validated and shows high inter-rater reliability. In 1969, Exner published The Rorschach Systems, a concise description of what would be later called “the Exner system”. He later published a study in multiple volumes called The Rorschach: A Comprehensive system, the most accepted full description of his system.

Creation of the new system was prompted by the realization that at least five related, but ultimately different methods were in common use at the time, with a sizeable minority of examiners not employing any recognized method at all, basing instead their judgment on subjective assessment, or arbitrarily mixing characteristics of the various standardized systems.

The key components of the Exner system are the clusterization of Rorschach variables and a sequential search strategy to determine the order in which to analyze them, framed in the context of standardized administration, objective, reliable coding and a representative normative database. The system places a lot of emphasis on a cognitive triad of information processing, related to how the subject processes input data, cognitive mediation, referring to the way information is transformed and identified, and ideation.

In the system, responses are scored with reference to their level of vagueness or synthesis of multiple images in the blot, the location of the response, which of a variety of determinants is used to produce the response (i.e., what makes the inkblot look like what it is said to resemble), the form quality of the response (to what extent a response is faithful to how the actual inkblot looks), the contents of the response (what the respondent actually sees in the blot), the degree of mental organizing activity that is involved in producing the response, and any illogical, incongruous, or incoherent aspects of responses. It has been reported that popular responses on the first card include bat, badge and coat of arms.

Using the scores for these categories, the examiner then performs a series of calculations producing a structural summary of the test data. The results of the structural summary are interpreted using existing research data on personality characteristics that have been demonstrated to be associated with different kinds of responses.

With the Rorschach plates (the ten inkblots), the area of each blot which is distinguished by the client is noted and coded – typically as “commonly selected” or “uncommonly selected”. There were many different methods for coding the areas of the blots. Exner settled upon the area coding system promoted by S. J. Beck (1944 and 1961). This system was in turn based upon Klopfer’s (1942) work.

As pertains to response form, a concept of “form quality” was present from the earliest of Rorschach’s works, as a subjective judgment of how well the form of the subject’s response matched the inkblots (Rorschach would give a higher form score to more “original” yet good form responses), and this concept was followed by other methods, especially in Europe; in contrast, the Exner system solely defines “good form” as a matter of word occurrence frequency, reducing it to a measure of the subject’s distance to the population average.

 

Cultural differences

Comparing North American Exner normative data with data from European and South American subjects showed marked differences in some features, some of which impact important variables, while others (such as the average number of responses) coincide. For instance, texture response is typically zero in European subjects (if interpreted as a need for closeness, in accordance with the system, a European would seem to express it only when it reaches the level of a craving for closeness), and there are fewer “good form” responses, to the point where schizophrenia may be suspected if data were correlated to the North American norms. Form is also often the only determinant expressed by European subjects; while color is less frequent than in American subjects, color-form responses are comparatively frequent in opposition to form-color responses; since the latter tend to be interpreted as indicators of a defensive attitude in processing affect, this difference could stem from a higher value attributed to spontaneous expression of emotions.

The differences in form quality are attributable to purely cultural aspects: different cultures will exhibit different “common” objects (French subjects often identify a chameleon in card VIII, which is normally classed as an “unusual” response, as opposed to other animals like cats and dogs; in Scandinavia, “Christmas elves” (nisser) is a popular response for card II, and “musical instrument” on card VI is popular for Japanese people), and different languages will exhibit semantic differences in naming the same object (the figure of card IV is often called a troll by Scandinavians and an ogre by French people). Many of Exner’s “popular” responses (those given by at least one third of the North American sample used) seem to be universally popular, as shown by samples in Europe, Japan and South America, while specifically card IX’s “human” response, the crab or spider in card X and one of either the butterfly or the bat in card I appear to be characteristic of North America.

Form quality, popular content responses and locations are the only coded variables in the Exner systems that are based on frequency of occurrence, and thus immediately subject to cultural influences; therefore, cultural-dependent interpretation of test data may not necessarily need to extend beyond these components.

The cited language differences mean that it’s imperative for the test to be administered in the subject’s native language or a very well mastered second language, and, conversely, the examiner should master the language used in the test. Test responses should also not be translated into another language prior to analysis except possibly by a clinician mastering both languages. For example, a bow tie is a frequent response for the center detail of card III, but since the equivalent term in French translates to “butterfly tie”, an examiner not appreciating this language nuance may code the response differently from what is expected.

 

Neurology

Research using card III have found that ‘‘unique responses’’ are found in people with larger amygdalas. The researchers note, “Since previous reports have indicated that unique responses were observed at higher frequency in the artistic population than in the non-artistic normal population, this positive correlation suggests that amygdalar enlargement in the normal population might be related to creative mental activity.”

 

The ten inkblots

Below are the ten inkblots of the Rorschach test printed in Rorschach’s Rorschach Test – Psychodiagnostic Plates, together with the most frequent responses for either the whole image or the most prominent details according to various authors.

 

Card Popular responses Comments

 


Beck: bat, butterfly, moth
Piotrowski: bat (53%), butterfly (29%)
Dana (France): butterfly (39%)

When seeing card I, subjects often inquire on how they should proceed, and questions on what they are allowed to do with the card (e.g. turning it) are not very significant. Being the first card, it can provide clues about how subjects tackle a new and stressful task. It is not, however, a card that is usually difficult for the subject to handle, having readily available popular responses.

 


Beck: two humans
Piotrowski: four-legged animal (34%, gray parts)
Dana (France): animal: dog, elephant, bear (50%, gray)

The red details of card II are often seen as blood, and are the most distinctive features. Responses to them can provide indications about how a subject is likely to manage feelings of anger or physical harm. This card can induce a variety of sexual responses.

 


Beck: two humans (gray)
Piotrowski: human figures (72%, gray)
Dana (France): human (76%, gray)

Card III is typically perceived to contain two humans involved in some interaction, and may provide information about how the subject relates with other people (specifically, response latency may reveal struggling social interactions).

 


Beck: animal hide, skin, rug
Piotrowski: animal skin, skin rug (41%)
Dana (France): animal skin (46%)

Card IV is notable for its dark color and its shading (posing difficulties for depressed subjects), and is generally perceived as a big and sometimes threatening figure; compounded with the common impression of the subject being in an inferior position (“looking up”) to it, this serves to elicit a sense of authority. The human or animal content seen in the card is almost invariably classified as male rather than female, and the qualities expressed by the subject may indicate attitudes toward men and authority. Because of this Card IV is often called “The Father Card”.

 


Beck: bat, butterfly, moth
Piotrowski: butterfly (48%), bat (40%)
Dana (France): butterfly (48%), bat (46%)

Card V is an easily elaborated card that is not usually perceived as threatening, and typically instigates a “change of pace” in the test, after the previous more challenging cards. Containing few features that generate concerns or complicate the elaboration, it is the easiest blot to generate a good quality response about.

 


Beck: animal hide, skin, rug
Piotrowski: animal skin, skin rug (41%)
Dana (France): animal skin (46%)

Texture is the dominant characteristic of card VI, which often elicits association related to interpersonal closeness; it is specifically a “sex card”, its likely sexual percepts being reported more frequently than in any other card, even though other cards have a greater variety of commonly seen sexual contents.

 


Beck: human heads or faces (top)
Piotrowski: heads of women or children (27%, top)
Dana (France): human head (46%, top)

Card VII can be associated with femininity (the human figures commonly seeing in it being described as women or children), and function as a “mother card”, where difficulties in responding may be related to concerns with the female figures in the subject’s life. The center detail is relatively often (though not popularly) identified as a vagina, which make this card also relate to feminine sexuality in particular.

 


Beck: animal: not cat or dog (pink)
Piotrowski: four-legged animal (94%, pink)
Dana (France): four-legged animal (93%, pink)

People often express relief about card VIII, which lets them relax and respond effectively. Similar to card V, it represents a “change of pace”; however, the card introduces new elaboration difficulties, being complex and the first multi-colored card in the set. Therefore, people who find processing complex situations or emotional stimuli distressing or difficult may be uncomfortable with this card.

 


Beck: human (orange)
Piotrowski: none
Dana (France): none

Characteristic of card IX is indistinct form and diffuse, muted chromatic features, creating a general vagueness. There is only one popular response, and it is the least frequent of all cards. Having difficulty with processing this card may indicate trouble dealing with unstructured data, but aside from this there are few particular “pulls” typical of this card.

 


Beck: crab, lobster, spider (blue)
Piotrowski: crab, spider (37%, blue),
rabbit head (31%, light green),
caterpillars, worms, snakes (28%, deep green)

Card X is structurally similar to card VIII, but its uncertainty and complexity are reminiscent of card IX: people who find it difficult to deal with many concurrent stimuli may not particularly like this otherwise pleasant card. Being the last card, it may provide an opportunity for the subject to “sign out” by indicating what they feel their situation is like, or what they desire to know.

 

Prevalence

United States

The Rorschach test is used almost exclusively by psychologists. In a survey done in the year 2000, 20% of correctional psychologists used the Rorschach while 80% used the MMPI. Forensic psychologists use the Rorschach 36% of the time. In custody cases, 23% of psychologists use the Rorschach to examine a child. Another survey found that 124 out of 161 (77%) of clinical psychologists engaging in assessment services utilize the Rorschach, and 80% of psychology graduate programs teach its use. Another study found that its use by clinical psychologists was only 43%, while it was used less than 24% of the time by school psychologists.

 

Controversy

Some skeptics consider the Rorschach inkblot test pseudoscience, as several studies suggested that conclusions reached by test administrators since the 1950s were akin to cold reading. In the 1959 edition of Mental Measurement Yearbook, Lee Cronbach (former President of the Psychometric Society and American Psychological Association) is quoted in a review: “The test has repeatedly failed as a prediction of practical criteria. There is nothing in the literature to encourage reliance on Rorschach interpretations.” In addition, major reviewer Raymond J. McCall writes (p. 154): “Though tens of thousands of Rorschach tests have been administered by hundreds of trained professionals since that time (of a previous review), and while many relationships to personality dynamics and behavior have been hypothesized, the vast majority of these relationships have never been validated empirically, despite the appearance of more than 2,000 publications about the test.” A moratorium on its use was called for in 1999.

A 2003 report by Wood and colleagues had more mixed views: “More than 50 years of research have confirmed Lee J. Cronbach’s (1970) final verdict: that some Rorschach scores, though falling woefully short of the claims made by proponents, nevertheless possess “validity greater than chance” (p. 636). “Its value as a measure of thought disorder in schizophrenia research is well accepted. It is also used regularly in research on dependency, and, less often, in studies on hostility and anxiety. Furthermore, substantial evidence justifies the use of the Rorschach as a clinical measure of intelligence and thought disorder.”

 

Test materials

The basic premise of the test is that objective meaning can be extracted from responses to blots of ink which are supposedly meaningless. Supporters of the Rorschach inkblot test believe that the subject’s response to an ambiguous and meaningless stimulus can provide insight into their thought processes, but it is not clear how this occurs. Also, recent research shows that the blots are not entirely meaningless, and that a patient typically responds to meaningful as well as ambiguous aspects of the blots. Reber (1985) describes the blots as merely “.. the vehicle for the interaction ..” between client and therapist, concluding: “.. the usefulness of the Rorschach will depend upon the sensitivity, empathy and insightfulness of the tester totally independently of the Rorschach itself. An intense dialogue about the wallpaper or the rug would do as well provided that both parties believe.”

 

Illusory and invisible correlations

In the 1960s, research by psychologists Loren and Jean Chapman showed that at least some of the apparent validity of the Rorschach was due to an illusion. At that time, the five signs most often interpreted as diagnostic of homosexuality were 1) buttocks and anuses; 2) feminine clothing; 3) male or female sex organs; 4) human figures without male or female features; and 5) human figures with both male and female features. The Chapmans surveyed 32 experienced testers about their use of the Rorschach to diagnose homosexuality. At this time homosexuality was regarded as a psychopathology, and the Rorschach was the most popular projective test. The testers reported that homosexual men had shown the five signs more frequently than heterosexuals. Despite these beliefs, analysis of the results showed that heterosexual men are just as likely to report these signs, so they are totally ineffective for identifying homosexuals. The five signs did, however, match the guesses students made about which imagery would be associated with homosexuality.

The Chapmans investigated the source of the testers’ false confidence. In one experiment, students read through a stack of cards, each with a Rorschach blot, a sign and a pair of “conditions” (which might include homosexuality). The information on the cards was fictional, although subjects were told it came from case studies of real patients. The students reported that the five invalid signs were associated with homosexuality, even though the cards had been constructed so there was no association at all. The Chapmans repeated this experiment with another set of cards, in which the association was negative; the five signs were never reported by homosexuals. The students still reported seeing a strong positive correlation. These experiments showed that the testers’ prejudices could result in them “seeing” non-existent relationships in the data. The Chapmans called this phenomenon “illusory correlation” and it has since been demonstrated in many other contexts.

A related phenomenon called “invisible correlation” applies when people fail to see a strong association between two events because it does not match their expectations. This was also found in clinicians’ interpretations of the Rorschach. Homosexual men are more likely to see a monster on Card IV or a part-animal, part-human figure in Card V. Almost all of the experienced clinicians in the Chapmans’ survey missed these valid signs. The Chapmans ran an experiment with fake Rorschach responses in which these valid signs were always associated with homosexuality. The subjects missed these perfect associations and instead reported that invalid signs, such as buttocks or feminine clothing, were better indicators.

In 1992, the psychologist Stuart Sutherland argued that these artificial experiments are easier than the real-world use of the Rorschach, and hence they probably underestimated the errors that testers were susceptible to. He described the continuing popularity of the Rorschach after the Chapmans’ research as a “glaring example of irrationality among psychologists”.

 

Tester projection

Some critics argue that the testing psychologist must also project onto the patterns. A possible example sometimes attributed to the psychologist’s subjective judgement is that responses are coded (among many other things), for “Form Quality”: in essence, whether the subject’s response fits with how the blot actually looks. Superficially this might be considered a subjective judgment, depending on how the examiner has internalized the categories involved. But with the Exner system of scoring, much of the subjectivity is eliminated or reduced by use of frequency tables that indicate how often a particular response is given by the population in general. Another example is that the response “bra” was considered a “sex” response by male psychologists, but a “clothing” response by females. In Exner’s system, however, such a response is always coded as “clothing” unless there is a clear sexual reference in the response.

Third parties could be used to avoid this problem, but the Rorschach’s inter-rater reliability has been questioned. That is, in some studies the scores obtained by two independent scorers do not match with great consistency. This conclusion was challenged in studies using large samples reported in 2002.

 

Validity

When interpreted as a projective test, results are poorly verifiable. The Exner system of scoring (also known as the “Comprehensive System”) is meant to address this, and has all but displaced many earlier (and less consistent) scoring systems. It makes heavy use of what factor (shading, color, outline, etc.) of the inkblot leads to each of the tested person’s comments. Disagreements about test validity remain: while the Exner proposed a rigorous scoring system, latitude remained in the actual interpretation, and the clinician’s write-up of the test record is still partly subjective. Reber (1985) comments “.. there is essentially no evidence whatsoever that the test has even a shred of validity.”

Nevertheless, there is substantial research indicating the utility of the measure for a few scores. Several scores correlate well with general intelligence. Interestingly, one such scale is R, the total number of responses; this reveals the questionable side-effect that more intelligent people tend to be elevated on many pathology scales, since many scales do not correct for high R: if a subject gives twice as many responses overall, it is more likely that some of these will seem “pathological”. Also correlated with intelligence are the scales for Organizational Activity, Complexity, Form Quality, and Human Figure responses. The same source reports that validity has also been shown for detecting such conditions as schizophrenia and other psychotic disorders; thought disorders; and personality disorders (including borderline personality disorder). There is some evidence that the Deviant Verbalizations scale relates to bipolar disorder. The authors conclude that “Otherwise, the Comprehensive System doesn’t appear to bear a consistent relationship to psychological disorders or symptoms, personality characteristics, potential for violence, or such health problems as cancer”. (Cancer is mentioned because a small minority of Rorschach enthusiasts have claimed the test can predict cancer.)

 

Reliability

It is also thought that the test’s reliability can depend substantially on details of the testing procedure, such as where the tester and subject are seated, any introductory words, verbal and nonverbal responses to subjects’ questions or comments, and how responses are recorded. Exner has published detailed instructions, but Wood et al. cites many court cases where these had not been followed. Similarly, the procedures for coding responses are fairly well specified but extremely time-consuming leaving them very subject to the author’s style and the publisher to the quality of the instructions (such as was noted with one of Bohm’s textbooks in the 1950s) as well as clinic workers (which would include examiners) being encouraged to cut corners

US Courts have challenged the Rorschach as well. Jones v Apfel (1997) stated (quoting from Attorney’s Textbook of Medicine) that Rorschach “results do not meet the requirements of standardization, reliability, or validity of clinical diagnostic tests, and interpretation thus is often controversial”. In State ex rel H.H. (1999) where under cross examination Dr. Bogacki stated under oath “many psychologists do not believe much in the validity or effectiveness of the Rorschach test” and US v Battle (2001) ruled that the Rorschach “does not have an objective scoring system.”

 

Population norms

Another controversial aspect of the test is its statistical norms. Exner’s system was thought to possess normative scores for various populations. But, beginning in the mid-1990s others began to try to replicate or update these norms and failed. In particular, discrepancies seemed to focus on indices measuring narcissism, disordered thinking, and discomfort in close relationships. Lillenfeld and colleagues, who are critical of the Rorschach, have stated that this proves that the Rorschach tends to “overpathologise normals”. Although Rorschach proponents, such as Hibbard, suggest that high rates of pathology detected by the Rorschach accurately reflect increasing psychopathology in society, the Rorschach also identifies half of all test-takers as possessing “distorted thinking”, a false positive rate unexplained by current research.

The accusation of “over-pathologising” has also been considered by Meyer et al. (2007). They presented an international collaborative study of 4704 Rorschach protocols, obtained in 21 different samples, across 17 different countries, with only 2% showing significant elevations on the index of perceptual and thinking disorder, 12% elevated on indices of depression and hyper-vigilance and 13% elevated on persistent stress overload—all in line with expected frequencies among nonpatient populations.

 

Applications

The test is also controversial because of its common use in court-ordered evaluations. This controversy stems, in part, from the limitations of the Rorschach, with no additional data, in making official diagnoses from the Diagnostic and Statistical Manual of Mental Disorders (DSM-IV). Irving B. Weiner (co-developer with John Exner of the Comprehensive system) has stated that the Rorschach “is a measure of personality functioning, and it provides information concerning aspects of personality structure and dynamics that make people the kind of people they are. Sometimes such information about personality characteristics is helpful in arriving at a differential diagnosis, if the alternative diagnoses being considered have been well conceptualized with respect to specific or defining personality characteristics”. In the vast majority of cases, anyway, the Rorschach test wasn’t singled out but used as one of several in a battery of tests, and despite the criticism of usage of the Rorschach in the courts, out of 8,000 cases in which forensic psychologists used Rorschach-based testimony, the appropriateness of the instrument was challenged only six times, and the testimony was ruled inadmissible in only one of those cases. One study has found that use of the test in courts has increased by three times in the decade between 1996 and 2005, compared to the previous fifty years. Others however have found that its usage by forensic psychologists has decreased.

 

Protection of test items and ethics

Psychologists object to the publication of psychological test material out of concerns that a patient’s test responses will be influenced (“primed”) by previous exposure. The Canadian Psychological Association takes the position that, “Publishing the questions and answers to any psychological test compromises its usefulness” and calls for “keeping psychological tests out of the public domain.” The same statement quotes their president as saying, “The CPA’s concern is not with the publication of the cards and responses to the Rorschach test per se, for which there is some controversy in the psychological literature and disagreement among experts, but with the larger issue of the publication and dissemination of psychological test content”.

However, from a legal standpoint, the Rorschach test images have in fact been in the public domain for many years in most countries, particularly those with a copyright term of up to 70 years post mortem auctoris. They have been in the public domain in Hermann Rorschach’s native Switzerland since at least 1992 (70 years after the author’s death, or 50 years after the cut-off date of 1942), according to Swiss copyright law. They are also in the public domain under United States copyright law where all works published before 1923 are considered to be in the public domain.

This means that the Rorschach images may be used by anyone for any purpose. William Poundstone was, perhaps, first to make them public in his 1983 book Big Secrets, where he also described the method of administering the test.

The American Psychological Association (APA) has a code of ethics that supports “freedom of inquiry and expression” and helping “the public in developing informed judgments”. It claims that its goals include “the welfare and protection of the individuals and groups with whom psychologists work”, and it requires that psychologists “make reasonable efforts to maintain the integrity and security of test materials”. The APA has also raised concerns that the dissemination of test materials might impose “very concrete harm to the general public”. It has not taken a position on publication of the Rorschach plates but noted “there are a limited number of standardized psychological tests considered appropriate for a given purpose”. Exner and others have claimed that the Rorschach test is capable of detecting suicidality. A public statement by the British Psychological Society expresses similar concerns about psychological tests (without mentioning any test by name) and considers the “release of [test] materials to unqualified individuals” to be misuse if it is against the wishes of the test publisher. In his book Ethics in psychology, Koocher (1998) notes that some believe “reprinting copies of the Rorschach plates … and listing common responses represents a serious unethical act” for psychologists and is indicative of “questionable professional judgment”. Other professional associations, such as the Italian Association of Strategic Psychotherapy, recommend that even information about the purpose of the test or any detail of its administration should be kept from the public, even though “cheating” the test is held to be practically impossible.

On September 9, 2008, Hogrefe attempted to claim copyright over the Rorschach ink blots during fillings of a complaint with the World Intellectual Property Organization against the Brazilian psychologist Ney Limonge. These complaints were denied. Further complaints were sent to two other websites that contained information similar to the Rorschach test in May 2009 by legal firm Schluep and Degen of Switzerland.

Psychologists have sometimes refused to disclose tests and test data to courts when asked to do so by the parties citing ethical reasons; it is argued that such refusals may hinder full understanding of the process by the attorneys, and impede cross-examination of the experts. APA ethical standard 1.23(b) states that the psychologist has a responsibility to document processes in detail and of adequate quality to allow reasonable scrutiny by the court.

Controversy ensued in the psychological community in 2009 when the original Rorschach plates and research results on interpretations were published in the “Rorschach test” article on Wikipedia. Hogrefe & Huber Publishing, a German company that sells editions of the plates, called the publication “unbelievably reckless and even cynical of Wikipedia” and said it was investigating the possibility of legal action. Due to this controversy an edit filter was temporarily established on Wikipedia to prevent the removal of the plates.

Dr. James Heilman, a Canadian emergency room physician involved in the debate, compared it to the publication of the eye test chart: though people are likewise free to memorize the eye chart before an eye test, its general usefulness as a diagnostic tool for eyesight has not diminished. For those opposed to exposure, publication of the inkblots is described as a “particularly painful development”, given the tens of thousands of research papers which have, over many years, “tried to link a patient’s responses to certain psychological conditions.” Controversy over Wikipedia’s publication of the inkblots has resulted in the blots being published in other locations, such as The Guardian and The Globe And Mail.

Publication of the Rorschach images is also welcomed by critics who consider the test to be pseudoscience. Benjamin Radford, editor of Skeptical Inquirer magazine, stated that the Rorschach “has remained in use more out of tradition than good evidence” and was hopeful that publication of the test might finally hasten its demise.