I need your help!

I want your feedback to make the book better for you and other readers. If you find typos, errors, or places where the text may be improved, please let me know. The best ways to provide feedback are by GitHub or hypothes.is annotations.

Opening an issue or submitting a pull request on GitHub: https://github.com/isaactpetersen/Principles-Psychological-Assessment

Adding an annotation using hypothes.is. To add an annotation, select some text and then click the symbol on the pop-up menu. To see the annotations of others, click the symbol in the upper right-hand corner of the page.

Chapter 17 The Interview and the DSM

17.1 Overview of Clinical Interviews

Clinical interviews are a form of conversation with a client, in which the conversation has explicit clinical goals including information gathering on the client. Sharp et al. (2013) provide an overview of the clinical interview. Interviews are the most widely used assessment technique in clinical psychology, yet there is very little done to explore their validity or ways to improve their validity. There is very little data on the validity of interviews. It is likely that there has been little research on the validity of interviews because the field has been riding the coattails of biological psychiatry, in which the tradition has been that the clinical interview is just assumed to be the “gold standard”. However, this is based on theory, not data.

The problem is that we do not have a “gold standard” (Lilienfeld et al., 2015). We do not have any robust measure to compare interviews against to verify their accuracy—i.e., we do not have true knowledge about the pathophysiology as a strong indicator of disease status. Diagnoses are not directly observable concepts. Comparisons from one interviewer to another interviewer establishes inter-rater reliability (i.e., diagnostic agreement), but it does not establish diagnostic validity—i.e., the correspondence to truth about illness status.

To estimate diagnostic accuracy in the absence of a gold standard, one can use latent class models (Faraone & Tsuang, 1994), where you identify latent (unmeasured) class membership among participants using multiple observed variables. For example, you can identify a latent class membership based on multiple symptoms, raters or observers, and/or at multiple time points. High scores across symptoms, raters, and time points are more likely to reflect true illness. This is consistent with making diagnostic decisions using the LEAD standard: LEAD is an acronym for: Longitudinal, Expert, All Data. So, the best approach may be to use LEAD data to evaluate diagnostic validity.

The latent class would be the illness status—i.e., whether the client “has” the disorder or not. Illness status is not directly observable, so we use observables to infer latent classes. Each person is assigned to a latent class with different probabilities to account for uncertainty. For example, Person A may be assigned a 75% probability of “having” depression, whereas the probability may be 13% for Person B. Then, we could treat the latent class estimate of illness status as the latent gold standard. Diagnoses based on LEAD data are not 100% valid; thus, they are a LEAD standard, not a gold standard. However, LEAD approaches are expensive and time-consuming, and they are not widely used.

17.2 Two Traditions: Unstructured and Structured Interviews

There are two primary traditions to the clinical interview: the unstructured interview and the structured interview.

17.2.1 Unstructured Interview

The unstructured interview is the oldest tradition of the clinical interview. It was the most widely used interview method until the late 1970s. The unstructured clinical interview came from a psychoanalytic theoretical orientation. In an unstructured interview, the clinician keeps the framework “in the head” of the information they want to extract from the client, including presenting problems, past treatments obtained, comorbid symptoms, etc.

An unstructured interview is an open-ended, free flowing conversation between the clinician and client. According to psychoanalysis, the nondirective interview was believed to serve as a catalyst for the client’s expression of their unconscious, for example by means of transference and free associations. Transference is when the client treats the therapist as if the therapist was an important figure in the client’s past—for example, if the client treats the therapist as if the therapist were the client’s parent. Notes were frowned upon because they were viewed as disrespectful to the client and a disruption of the flow of session.

A problem of the unstructured interview is that taking a different “conversation path” (i.e., asking different questions) or having a different clinician led to different information extracted from the client and to low inter-rater and test–retest reliability. The unstructured interview is susceptible to multiple forms of bias, such as halo effects when dealing with a polite client. Another important form of (cognitive) bias is confirmation bias, in which clinicians look for evidence that confirms their hypothesis about the client, which leads to the tendency to stop the interview once the first diagnosis is identified, and to fewer diagnoses. This is known as diagnostic overshadowing. Diagnostic overshadowing is analogous to challenges that radiologists have in identifiying a second tumor after already identifying one tumor, known as “satisfaction of search” (Fleck et al., 2010). The unstructured interview is also susceptible to race, gender, class bias, etc.

In general, unstructured interviews show low reliability. The low reliability of unstructured interviews is a result of several factors. One factor is information variance: different information is extracted from the client in each interview. Another factor that contributes to the low reliability of unstructured interviews is criterion variance, where different criteria are used to determine the presence or absence of a condition. Criterion variance was inadequate once criterion-based diagnosis came along. A third factor that contributes to the low reliability of unstructured interviews is varying levels of skill and knowledge of interviewers.

A potential advantage of the unstructured interview is that it can be good for non-specific aspects of interview. For instance, an unstructured interview can potentially be helpful for building rapport because the client feels the clinician’s first goal is to understand them and their situation in order to provide help. Classic examples of an unstructured interview are provided in “The Psychiatric Interview” (1970) by Harry Stack Sullivan, who is a psychodynamically oriented clinician. Sullivan (1970)’s book provides a guide to understanding the unstructured interview. He described processes such as “reciprocal emotion,” which he described as how the interviewer and interviewee influence each other and responses of the other through one’s emotional tone.

17.2.2 Structured Interview

An important movement in the development of interviews was behavioral interviews that began in the 1960s. Behavioral interviews came from a cognitive-behavioral theoretical orientation. They started out as unstructured. However, behavioral interviews had a different set of goals from traditional unstructured interviews. Behavioral interviews sought to quantify aspects of behavior—for example, how often the symptoms occur. Behavioral interviews applied the A-B-C (antecedent-behavior-consequence) model. The A-B-C model tries to identify patterns of sequences that include the problem behavior. Specifically, the A-B-C model identifies things that came before the behavior (antecedents), and things that came after the behavior (consequences). This helps to identify precipitating and maintaining factors for the problem behavior, as well as strengths to bolster in treatment. It provides helpful ways to generate information and helps with case conceptualization. The behavior interviews eventually led to more structured versions, called structured interviews.

An example of a structured clinical interview is the Structured Clinical Interview for DSM (SCID). Structured interviews were developed to solve the problem of reliability. In a structured interview, a pre-determined domain of questions is asked in a pre-defined way. A fully structured interview provides a script to use to ask each question (to standardize wording), and in a particular order, with a decision tree and branching logic to choose subsequent questions based on the responses given.

For a structured diagnostic interview, the diagnostic criteria for a disorder are codified, and the exact language is provided to ask about each symptom. Then, the responses are scored and integrated in a systematic way, and the diagnosis is determined in a systematic way.

Structured interviews show higher reliability compared to unstructured interviews due to: less information variance because the clinician extracts similar information from a given client, and less criterion variance because the clinician makes diagnostic decisions using the same criteria. Interviews differ in their continuum or degree of structure, from semi-structured to structured. In general, convergent validity improves as the degree of structure for an interview increases (Widiger, 2002).

When very structured, even a layperson could perform the interview. Taken to the extreme, a human interviewer becomes unnecessary; the entire interview can be recorded and done by a computer. In these cases, inter-rater reliability approaches perfection. However, this removes the role of clinical judgment, which can lead to errors if the client misinterprets the computer-administered question or describes a behavior that is not actually clinically concerning. Therefore, structured interviews reduce the amount of training necessary for a clinician to administer, but they lose the potential positive qualities of unstructured interview: e.g., the capacity for better rapport; clients may not feel understood, cared about, or that they are working with an expert. Therefore, semi-structured interviews were developed.

17.2.2.1 Semi-Structured Interview

Semi-structured interviews provide some leeway in how the clinician asks questions and the pace of the questions, so the clinician can ask questions in their own words and ask follow-up questions or restate questions using the client’s own words. Semi-structured interviews are still structured in that they are still asking about the same range of problems as a structured interview, to avoid confirmation bias, and they provide a structured way to score diagnostic criteria and assign diagnoses, to ensure consistency across therapists. A well-designed semi-structured interview balances improvements in validity—due to greater clinical judgment and probing—with the tradeoff in lower inter-rater reliability due to idiosyncratic decisions. A semi-structured interview also allows clarifying the meaning of questions and response choices.

Conducting semi-structured interviews requires more training and there is frequent discussion, in the literature, of their problems, and it takes a long time to become really skilled. This can become an issue with highly trained individuals because they sometimes “screw around” with the interview (i.e., do things differently because they believe their approach is superior) and bypass the structure, leading to an unstructured interview.

In addition to degree of structure, interviews also differ in their degree of specialization. Some interviews are general/broad, including the SCID. Other interviews are specific to particular domains, such as the Semi-Structured Assessment for the Genetics of Alcoholism (SSAGA), which assesses alcohol use disorder. As you get further away from the construct, you lose valued information and the interview becomes less and less helpful. Specialized interviews provide the maximum amount of information in the area of interest, and they take less time, which is why they are popular in research.

A review of structured and semi-structured diagnostic interviews is provided by Summerfeldt et al. (2010).

17.2.3 Structured Versus Unstructured Interviews

Structured interviews show higher reliability than unstructured interviews, in terms of both inter-rater and test–retest reliability. Inter-rater reliability of a clinical interview is typically estimated based on the degree of agreement between an interviewer and an observer of the interview. That is, most often inter-rater reliability of interviews is estimated based on two raters of the same interview content. However, this does not necessarily capture different styles of different interviewers. Therefore, the typical estimates of inter-rater reliability of interviews do not capture all parts of the reliability of the measure.

Test–retest reliability of a clinical interview, is usually evaluated by conducting interviews of the same client separated (typically) by 2 weeks, often by a different interviewer. However, different ratings might be made at each time point if the diagnosis (e.g., depression) fluctuates over time or especially if the disorder is influenced by variation of the context. Therefore, changes in symptoms can contribute to differences in reliability over time—or, alternatively, differences could be due to differences in interviewers. Because of conflating differences in raters and time, test–retest reliability tends to be lower than inter-rater reliability in evaluating the reliability of interviews.

Fully structured diagnostic interviews typically deliver inter-rater reliability Cohen’s kappa (\(\kappa\)) estimates of .80 or higher. By contrast, estimates of inter-rater agreement between clinicians conducting unstructured interviews usually are near zero, which indicates that their agreement is no better than chance. Cohen’s kappa ranges from \(-1\) (perfect disagreement) to \(+1\) (perfect agreement). A kappa greater than .75 would be considered good inter-rater reliability.

Despite the increased reliability of using structured and semi-structured interviews compared to unstructured interviews, many clinicians are reluctant to use structured and semi-structured interviews. Clinicians often feel that structured interviews impinge upon their professional autonomy and believe that structured interviews will damage rapport, even though semi-structured interviews have been shown to increase rapport. Clients prefer structured approaches, feeling that the clinician has a more comprehensive understanding of the client’s needs. Clinicians prefer to use their expertise, insight, and judgment. However, their judgment is what results in lower reliability and validity. Unstructured interviews are much more widely used in practice despite the advantages of structured interviews in terms of reliability and validity, because clinicians tend to prefer them.

Clinicians using unstructured approaches tend to diagnose fewer conditions than structured interviews detect. This likely reflects confirmation bias and diagnostic overshadowing.

17.3 Other Findings Regarding Interviews

There are other notable patterns regarding clinical interviews. Inter-rater reliability of Diagnostic and Statistical Manual of Mental Disorders (DSM) diagnoses is better for some conditions than others, but in general it is not very good. This provides further evidence that DSM-defined diagnostic categories are not “real” in a phenomenological sense. Moreover, there is considerable evidence that these diagnostic categories are better conceptualized as dimensional than categorical (Markon et al., 2011). Therefore, a better understanding of the structure of psychopathology may lead to a more valid diagnostic system.

Low inter-rater reliability of interviews could be a result of either low sensitivity or low specificity. Even if one is high but the other is low, you will get low inter-rater reliability.

It is worth noting that the clinical interview is not merely a tool for assessment. A clinical interview can be used as a form of intervention (i.e., therapeutic interview). For instance, motivational interviewing is a therapeutic interview that is often used as a treatment for substance use.

17.4 Best Practice for Diagnostic Assessment

There are several best practice approaches for conducting diagnostic assessment. As discussed earlier, when possible, it is best to follow the LEAD standard: longitudinal, expert, all data. Have the assessment conducted by an expert or multiple experts in the domain(s) of interest. Have multiple people who are experts on the client (e.g., parent, teacher, friend, sibling, self) report on the person’s behavior. Use rating scales to generate the contending hypotheses, and then pick structured or semi-structured interview modules to use for follow-up to examine and rule out hypotheses. Incorporate multiple forms of information, e.g., interviews, rating scales, observations, objective/direct/performance-based measures, and school/work/legal records. If possible, include an observational measure and an objective/direct/performance-based measure of diagnostic-relevant behavior. Examples of observational measures include parent–child play in the clinic paired with a cleanup command to see how parents give commands, how children respond to the parent’s commands, and how parents respond to their child’s noncompliance. Examples of more objective measures could include biological assessments, such as polysomnography, or performance-based assessments, such as intelligence tests and academic achievement tests.

There are many important skills when conducting a clinical interview, including:

Taking a nonjudgmental stance
Active listening
Empathy
Authenticity
Effective use of silence
Paraphrasing
Summarizing
Providing a sense of hope

17.5 The DSM and ICD

The Diagnostic and Statistical Manual of Mental Disorders (DSM) provides the list of mental disorders and the diagnostic criteria for mental health treatment providers. It is published by the American Psychiatric Association and is used in the United States. The equivalent list of mental disorders and diagnostic criteria for mental health providers outside of the United States is the International Classification of Diseases (ICD). However, the ICD also includes diseases that are not considered mental disorders. Collectively, the DSM and ICD represent the diagnostic system for mental disorders. A goal of DSM and ICD is to define different “mental disorders” to make sure people get the services they need.

17.5.1 Strengths

There are several strengths of the DSM and ICD. First, they can facilitate communication about disorders with other mental health providers; they provide a common language to describe complex behavioral presentations. Second, ideally, the DSM and ICD also guide treatment selection, especially if people are homogeneous within a disorder in terms of their symptoms, course, etiology, and treatment response. Third, diagnosis is used to justify payment for services from third-party payers (e.g., insurance companies, government). Fourth, diagnosis can be normalizing and empowering to some clients. It may help people learn that other people are experiencing similar challenges, and it may give them hope that there is something that can be done to address it. Fifth, the DSM/ICD promotes research in psychopathology, in terms of epidemiology (the disorder distribution in the population), etiology (causes of the disorder), course (how the disorder plays out over time), and treatment (including development and evaluation of interventions).

17.5.2 Concerns

However, there are also key concerns with the DSM and ICD. One potential concern with the DSM and ICD is stigmatization. The goal of the DSM/ICD is not to label people. Labeling people can be an unfortunate consequence, however, and it can be stigmatizing because mental disorders often carry a stigma. A second concern is that providing a label to a client may lead the client to see their difficulties as stable and unchangeable; they may mistakenly interpret the label as something that they “have” (rather than as description of what they “do”), a misconception which is clarified later in this section. For instance, Ahuvia et al. (2024) found that college students who self-labeled themselves as having depression had less perceived control over depression and experienced more catastrophizing, even after controlling for depression symptom severity. Another concern of the DSM and ICD that they may be pathologizing normality. Mental disorders are not infrequent. Estimates of lifetime prevalence of mental disorders are around 75% (Schaefer et al., 2017). That is, three out of four people will experience a mental disorder at some point in their life. Thus, abnormality as defined by the DSM and ICD are normal. Another concern with the DSM and ICD is that they medicalize and pathologize problems in living as mental illnesses, and they obscure the role of environmental factors such as poverty (Gambrill, 2014). Moreover, diagnoses specified in the DSM and ICD ignore causes and etiology—they do not provide explanations for behavior (Fried, 2022). It would thus be incorrect (and tautological) to say that someone engages in a particular behavior because they have ADHD (or insert whatever diagnosis; archived at https://perma.cc/5972-PUQD).

Ideally, a diagnostic system should have validity and utility. For instance, the diagnoses in a diagnostic system should have construct validity. That is, the diagnostic categories should reflect truth, reality, or the existence of the construct (i.e., diagnostic validity). Diagnostic validity is the extent to which the diagnostic category accurately captures the abnormal phenomenon of interest. Typical indicators of diagnostic validity are homogeneity (similarity) across people receiving a given diagnosis in terms of etiology, course, treatment response, etc. Utility means that the diagnostic system should help clinical decision-making, in terms of ease of use (e.g., saving time), facilitating communication with others, and in treatment planning. However, diagnostic systems (DSM and ICD) have serious concerns with both validity and utility. The diagnostic system has serious concerns with validity, in terms of whether the diagnostic categories defined by the DSM/ICD are actually real. There is great heterogeneity (variability) between people with the same diagnosis, in terms of symptom profiles, etiologies, course, and treatment outcomes (Fried, 2022). For instance, there are over 600,000 ways in terms of differing symptom profiles that a person can meet criteria for post-traumatic stress disorder (Galatzer-Levy & Bryant, 2013). As another example, two people diagnosed with conduct disorder can share zero symptoms. Subtypes can attempt to address this problem, but there is little support for the validity of these subtypes. Many disorder subtypes and features are not based on empirical data—instead, they are based on expert judgment, consensus, and politics. Moreover, which diagnoses a person experiences at a given time do not portend the types of diagnoses they will experience in the future—people show considerable diagnosis switching from one disorder to another (and even across diagnostic families—e.g., from an externalizing disorder to an internalizing or thought disorder) across development (Caspi et al., 2020).

There is also strong comorbidity (co-occurence) across disorders. A person who meets criteria for one disorder is likely to also meet criteria for other disorders (Caspi et al., 2020). Many disorders often co-occur within an individual, including major depressive disorder and generalized anxiety disorder, which suggests that they may share an underlying liability and do not reflect distinct categories. This complicates treatment and challenges the validity of the diagnostic system.

Another challenge is regarding the coverage of the diagnostic system, including the number of categories and which are represented. How many disorders/categories should there be? Do we have too many categories? There are hundreds of possible diagnoses, including 541 disorders in the DSM-5 (Blashfield et al., 2014). Do we have too few diagnoses? Clinicians often use “Not Otherwise Specified” or “Unspecified” for diagnoses, which suggests that the current diagnostic categories do not cover the variability of clients’ presentations. How thinly should we “split” the categories? Some people are “splitters” and tend to split categories into as many categories as possible. Other people are “lumpers” and tend to lump categories together so there are fewer categories.

The diagnostic system also has serious concerns with utility, in terms of lengthy criterion sets that are time-consuming and difficult to assess in practice (Mullins-Sweatt & Widiger, 2009). One possible way to address the time-consuming nature of using lengthy criterion sets is to use prototypal matching instead of criterion sets. Prototypal matching involves having the clinician rate the similarity of the client to a narrative description of a prototypical person with the disorder. However, prototypal matching could result in lower inter-rater reliability (compared to using criterion sets) because of idiosyncratic judgments. There is a tradeoff between assessment time and validity. The clinician’s time can be saved by using computerized interviews or computerized tests. Another way to save the clinician’s time is to use self-report inventories as a screening device followed by an interview that is specific to the most relevant conditions identified by the screen.

The diagnostic system includes arbitrary thresholds that do not facilitate social and clinical decision-making. Current diagnostic categories do not provide much utility in terms of determining the appropriate treatment.

No single biological substrate has been identified for any psychological disorder that has high enough sensitivity and specificity to be useful for diagnosis or for predicting response to treatment (Tiego et al., 2023). This is because the DSM/ICD-based psychological disorders are fictive categories. The DSM/ICD-defined categories do not exist in nature. This does not dismiss the importance of what people are experiencing, which is very real. A person’s experience of depression is real, but the construct of major depressive disorder—as defined by the DSM/ICD—is not real. Mental disorders are not a concrete thing. Psychological disorders are social constructs. Mental disorders are not something that people “have”; rather, they are something that people “do” or “experience”. The mental disorders in the DSM/ICD are merely a description of behaviors. They are fuzzy concepts with vague boundaries. They are subjective, often without a clear “yes” or “no” as to whether someone meets criteria for a particular disorder. Mental disorders are defined in relation to cultural, social, and familial norms and values. That is, they are socially defined not biologically. The boundaries between normality and abnormality vary across cultures. Thus, the diagnostic categories in the DSM/ICD may not involve “biological pathology”—they do not carve nature at its joints.

Researchers have examined biology-related differences in people with versus without mental disorders, and they have found some differences, but these biology-related differences are not able to classify perfectly because the categories are defined socially in terms of behaviors, and not in terms of biology.

Another concern with the DSM and ICD is their conceptualization of psychopathology as binary. The DSM and ICD treat most disorders as categorical phenomena that are binary in nature: either you have a disorder or you do not. There is no gray in this conceptualization of psychopathology. However, research has shown that the difference between “normal” and “abnormal” behavior frequently is one of degree rather than kind (Markon et al., 2011). Thus, a dimensional approach to classification may provide more valid portrayal of many clinical phenomena than the categorical approach used by the DSM and ICD.

There is also potential for bias in the diagnostic system or its application, especially against people with particular disorders. For instance, some therapists will not treat people with particular disorders, especially certain types of personality disorders (e.g., borderline personality disorder, antisocial personality disorder), because of perceived challenges. The diagnostic system also might overestimate abnormality in women (e.g., premenstrual dysphoric disorder) or racial/ethnic minorities.

In general, reliability of diagnoses is low. The inter-rater reliability of diagnoses tends to be low (Lobbestael et al., 2011). Moreover, test–retest reliability is also relatively low. A client might cycle back and forth between diagnosis and no diagnosis based on one or a few symptoms. For instance, major depressive disorder is given a diagnosis with 5 or more symptoms, but no diagnosis is given with 4 or fewer symptoms. The fluctuation of symptoms and disorder status across time suggests that a binary approach to diagnosis is inaccurate.

It will thus be important to improve our diagnostic system. For instance, it will be important for the diagnostic system to account for the dimensional nature of constructs so that the diagnostic system better carves nature at its joints (better validity) and is flexible to allow different optimal thresholds for different social and clinical decisions (better utility). One argument against dimensional diagnostic systems is that treatment decisions are often binary (e.g., whether or not to provide treatment). However, decisions in practice can be dimensional too, and can reflect different levels of treatment, including, for example, the dosage of medication, the frequency and intensity of therapy, and the degree of hospitalization.

17.5.3 Alternative Structures of Psychopathology

Emerging evidence suggests that alternative models of psychopathology may provide a better fit to its structure. Models that have captured the greatest attention include hierarchical (higher-order) models of psychopathology, including the p-factor (Caspi et al., 2014; Smith et al., 2020) and the hierarchical taxonomy of psychopathology [HiTOP; Kotov et al. (2017); Kotov et al. (2021)]. The p-factor is the general factor of psychopathology, akin to the general factor of intelligence (g), and it attempts to account for the fact that many forms of psychopathology show strong covariation and co-occurrence. The p-factor (Caspi et al., 2014; Smith et al., 2020) is a hierarchical model of psychopathology in which the p-factor subsumes three lower-order dimensions of psychopathology: internalizing problems (e.g., depression, anxiety, obsessive-compulsive disorder), externalizing problems (e.g., conduct disorder, ADHD), and thought-disordered problems (e.g., autism, schizophrenia).

In the HiTOP model (Kotov et al., 2017, 2021), a p-factor influences super spectra, including emotional dysfunction, psychosis, and externalizing problems. Emotional dysfunction subsumes somatoform and internalizing problems (e.g., sexual problems, eating pathology, fear, distress, and mania). Psychosis subsumes thought disorder and detachment. Externalizing problems are subdivided into disinhibited externalizing and antagonistic externalizing. Both disinhibited and antagonistic externalizing subsume antisocial behavior, whereas disinhibited externalizing subsumes heavy substance use.

A third alternative conceptualization of psychopathology is the National Institute of Mental Health (NIMH) Research Domain Criteria (RDoC). RDoC is described in Chapter 20.

Another emerging technique is network approaches that model the covariation among symptoms (McNally, 2021).

17.6 Conclusion

Interviews can be administered (a) in a free-flowing way as unstructured interviews, (b) in a way that asks questions in a pre-defined way with a pre-defined order and pre-defined scoring criteria as structured interviews, or (c) as semi-structured interviews that blend structured interviews with freedom to ask follow-up questions. In general, reliability and validity improves as the degree of structure for an interview increases.

The Diagnostic and Statistical Manual of Mental Disorders (DSM) and the International Classification of Diseases (ICD) provide the list of mental disorders and the diagnostic criteria for mental health treatment providers. The DSM and ICD have potential strengths, including (ideally) facilitating communication, guiding treatment selection, providing justification for payment for services, providing a normalizing and empowering effect for some clients, and promoting research in psychopathology. However, there are key concerns of the DSM and ICD, including concerns with stigmatization, pathologizing normality, poor coverage, binary classification, obscuring environmental factors, no biological criteria, potential bias, and low reliability, validity, and utility. There are alternatives to the DSM and ICD for conceptualizing psychopathology. Alternative structures of psychopathology include hierarchical structures such as the p-factor and the hierarchical taxonomy of psychopathology (HiTOP).

17.7 Suggested Readings

Sommers-Flanagan & Sommers-Flanagan (2016)

References

Ahuvia, I. L., Schleider, J. L., Kneeland, E. T., Moser, J. S., & Schroder, H. S. (2024). Depression self-labeling in U.S. College students: Associations with perceived control and coping strategies. Journal of Affective Disorders, 351, 202–210. https://doi.org/10.1016/j.jad.2024.01.229

Blashfield, R. K., Keeley, J. W., Flanagan, E. H., & Miles, S. R. (2014). The cycle of classification: DSM-I through DSM-5. Annual Review of Clinical Psychology, 10(1), 25–51. https://doi.org/10.1146/annurev-clinpsy-032813-153639

Caspi, A., Houts, R. M., Ambler, A., Danese, A., Elliott, M. L., Hariri, A., Harrington, H., Hogan, S., Poulton, R., Ramrakha, S., Rasmussen, L. J. H., Reuben, A., Richmond-Rakerd, L., Sugden, K., Wertz, J., Williams, B. S., & Moffitt, T. E. (2020). Longitudinal assessment of mental health disorders and comorbidities across 4 decades among participants in the Dunedin Birth Cohort Study. JAMA Network Open, 3(4), e203221–e203221. https://doi.org/10.1001/jamanetworkopen.2020.3221

Caspi, A., Houts, R. M., Belsky, D. W., Goldman-Mellor, S. J., Harrington, H., Israel, S., Meier, M. H., Ramrakha, S., Shalev, I., Poulton, R., & Moffitt, T. E. (2014). The p factor: One general psychopathology factor in the structure of psychiatric disorders? Clinical Psychological Science, 2(2), 119–137. https://doi.org/10.1177/2167702613497473

Faraone, S. V., & Tsuang, M. T. (1994). Measuring diagnostic accuracy in the absence of a “gold standard.” American Journal of Psychiatry, 151, 650–657. https://doi.org/10.1176/ajp.151.5.650

Fleck, M. S., Samei, E., & Mitroff, S. R. (2010). Generalized “satisfaction of search”: Adverse influences on dual-target search accuracy. Journal of Experimental Psychology: Applied, 16(1), 60–71. https://doi.org/10.1037/a0018629

Fried, E. I. (2022). Studying mental health problems as systems, not syndromes. Current Directions in Psychological Science, 31(6), 500–508. https://doi.org/10.1177/09637214221114089

Galatzer-Levy, I. R., & Bryant, R. A. (2013). 636,120 ways to have posttraumatic stress disorder. Perspectives on Psychological Science, 8(6), 651–662. https://doi.org/10.1177/1745691613504115

Gambrill, E. (2014). The diagnostic and statistical manual of mental disorders as a major form of dehumanization in the modern world. Research on Social Work Practice, 24(1), 13–36. https://doi.org/10.1177/1049731513499411

Kotov, R., Krueger, R. F., Watson, D., Achenbach, T. M., Althoff, R. R., Bagby, R. M., Brown, T. A., Carpenter, W. T., Caspi, A., Clark, L. A., Eaton, N. R., Forbes, M. K., Forbush, K. T., Goldberg, D., Hasin, D., Hyman, S. E., Ivanova, M. Y., Lynam, D. R., Markon, K., … Zimmerman, M. (2017). The hierarchical taxonomy of psychopathology (HiTOP): A dimensional alternative to traditional nosologies. Journal of Abnormal Psychology, 126(4), 454–477. https://doi.org/10.1037/abn0000258

Kotov, R., Krueger, R. F., Watson, D., Cicero, D. C., Conway, C. C., DeYoung, C. G., Eaton, N. R., Forbes, M. K., Hallquist, M. N., Latzman, R. D., Mullins-Sweatt, S. N., Ruggero, C. J., Simms, L. J., Waldman, I. D., Waszczuk, M. A., & Wright, A. G. C. (2021). The hierarchical taxonomy of psychopathology (HiTOP): A quantitative nosology based on consensus of evidence. Annual Review of Clinical Psychology, 17(1), 83–108. https://doi.org/10.1146/annurev-clinpsy-081219-093304

Lilienfeld, S. O., Sauvigne, K., Lynn, S. J., Latzman, R. D., Cautin, R., & Waldman, I. D. (2015). Fifty psychological and psychiatric terms to avoid: A list of inaccurate, misleading, misused, ambiguous, and logically confused words and phrases. Frontiers in Psychology, 6. https://doi.org/10.3389/fpsyg.2015.01100

Lobbestael, J., Leurgans, M., & Arntz, A. (2011). Inter-rater reliability of the Structured Clinical Interview for DSM-IV Axis I Disorders (SCID I) and Axis II Disorders (SCID II). Clinical Psychology & Psychotherapy, 18(1), 75–79. https://doi.org/10.1002/cpp.693

Markon, K. E., Chmielewski, M., & Miller, C. J. (2011). The reliability and validity of discrete and continuous measures of psychopathology: A quantitative review. Psychological Bulletin, 137(5), 856–879. https://doi.org/10.1037/a0023678

McNally, R. J. (2021). Network analysis of psychopathology: Controversies and challenges. Annual Review of Clinical Psychology, 17(1), 31–53. https://doi.org/10.1146/annurev-clinpsy-081219-092850

Mullins-Sweatt, S. N., & Widiger, T. A. (2009). Clinical utility and DSM-V. Psychological Assessment, 21(3), 302–312. https://doi.org/10.1037/a0016607

Schaefer, J. D., Caspi, A., Belsky, D. W., Harrington, H., Houts, R., Horwood, L. J., Hussong, A., Ramrakha, S., Poulton, R., & Moffitt, T. E. (2017). Enduring mental health: Prevalence and prediction. Journal of Abnormal Psychology, 126(2), 212–224. https://doi.org/10.1037/abn0000232

Sharp, K. L., Williams, A. J., Rhyner, K. T., & Ilardi, S. S. (2013). The clinical interview. In K. F. Geisinger, J. F. Carlson, J.-I. C. Hansen, N. R. Kuncel, S. P. Reise, & M. C. Rodriguez (Eds.), APA handbook of testing and assessment in psychology, Vol. 2: Testing and assessment in clinical and counseling psychology (pp. 103–117). American Psychological Association.

Smith, G. T., Atkinson, E. A., Davis, H. A., Riley, E. N., & Oltmanns, J. R. (2020). The general factor of psychopathology. Annual Review of Clinical Psychology, 16(1), 75–98. https://doi.org/10.1146/annurev-clinpsy-071119-115848

Sommers-Flanagan, J., & Sommers-Flanagan, R. (2016). Clinical interviewing. Wiley.

Sullivan, H. S. (1970). The psychiatric interview. Norton.

Summerfeldt, L. J., Kloosterman, P. H., & Antony, M. M. (2010). Structured and semistructured diagnostic interviews. In M. M. Antony & D. H. Barlow (Eds.), Handbook of assessment and treatment planning for psychological disorders (2nd ed., pp. 95–137). Guilford Press.

Tiego, J., Martin, E. A., DeYoung, C. G., Hagan, K., Cooper, S. E., Pasion, R., Satchell, L., Shackman, A. J., Bellgrove, M. A., Fornito, A., Abend, R., Goulter, N., Eaton, N. R., Kaczkurkin, A. N., & and, R. N. (2023). Precision behavioral phenotyping as a strategy for uncovering the biological correlates of psychopathology. Nature Mental Health, 1, 304–315. https://doi.org/10.1038/s44220-023-00057-5

Widiger, T. A. (2002). Personality disorders. In M. M. Antony & D. H. Barlow (Eds.), Handbook of assessment and treatment planning for psychological disorders (pp. 453–480). Guilford Publications.

Feedback

Please consider providing feedback about this textbook, so that I can make it as helpful as possible. You can provide feedback at the following link: https://forms.gle/95iW4p47cuaphTek6