Young Mania Rating Scale (YMRS)

The Young Mania Rating Scale (YMRS) is a Clinician administered tool used to rate the severity of symptoms of mania (Young, Biggs, Ziegler & Meyer, 1978) across clinical and research settings. The YMRS was originally developed in 1978 and normed with psychiatric inpatients based on a semi structured interview and observations over an 8 hour period. Today the YMRS combines the clients self-report of symptoms of mania over the past 48 hours with the clinician observations during interview (Miller, Johnson & Eisner, 2009) and is now a commonly used tool to screen for manic symptoms and monitor the severity of manic symptoms (Lukasiewicz et al., 2013). Used to assess the severity of manic symptoms, this tool is often used to monitor the progress of interventions (Miller, Johnson & Eisner, 2009).

It is an 11 item scale assessing mood, motor activity/ energy levels, interest in sex, sleep, irritability, rate and frequency of speech, flight of ideas, grandiosity, aggressive behaviour, appearance, and insight into current presentation. It should be noted that the YMRS does not map onto the DSM 5 criteria for mania as it does not account for distractibility, increases in goal directed activity or excessive involvement in pleasurable activities that have a potential fir painful consequences (DSM-5). As such this tool is not a diagnostic assessment.

Each item is composed of 5 explicitly defined levels of severity.  Severity ratings for 7 items are scored on a scale of 0 -4. The remaining 4 items are double weighted to account for poor cooperation of client when unwell and are scored on a scale of 0 – 8. Item ratings are sum to produce a total score between 0 -60. A score <29 indicates that the person is experiencing “severe” mania (Wciorka et al., 2011).

Although weighting items increases the complexity of scoring and interpreting, it has not affected the psychometric properties of the scale. The YMRS is reported to have high interrater reliability for total scores (0.93) and individual item scores (0.66 -0.92) (Young et al., 1978). It also has been found to have good internal reliability, with Cronback alpha coefficients ranging from 0.8 – 0.91. The YMRS has demonstrated high convergent validity with other assessment measures of mania including the Bech-Rafaelsen Mania Rating Scale (Spearman’s Rho = 0.90). Furthermore, the YMRS statistically differentiates between clients; before and 2 weeks after treatment (Young et al., 1978); mania from symptoms of ADHD (Serrano, Ezpeleta, Alda, Matalí, & San, L., 2011). Finially, the YMRS has demonstrated validity across cultural populations including Korea (Seon-Cheol & Joonjo, 2016) and Poland (Wciorka et al., 2011).


General Behavior Inventory (GBI)

The General Behavior Inventory (GBI), first developed by Depue et al. (1981), was designed to identify the presence and severity of depressive and manic/hypomanic symptoms, as well as to assess for cyclothymia in adults. In their attempts to explore predisposition to bipolar disorder, the authors created a behavioural paradigm to identify persons at risk. Though intended for use in an adult population, a slightly modified version of the GBI has demonstrated potential as a parent-report measure of mood symptomatology amongst children and adolescents (Youngstrom, Findling, Danielson, & Calabrese, 2001). In addition, a short version has been developed via factor analysis that allows for it to be a screening tool in both adult and adolescent populations (Youngstrom, Murray, Johnson, & Findling, 2016).

The original self-report includes three dimensions, or subscales, that comprise 73 items on which respondents use a 4-point Likert-type scale (0 = never or hardly ever; 3 = very often/almost constantly) to indicate the frequency with which they experience a behaviour over the past year. The Depression scale sums 45 of the items whilst the Hypomanic/Biphasic scales combined sum 28 items. Questions include: “Have you become sad, depressed, or irritable for several days or more without really understanding why?” and “has your mood or energy shifted rapidly back and forth from happy to sad or high to low?” As suggested by Depue, Krauss, and Spoont (1987), the items may be scored using a dichotomous model. This involves dividing the population into cases and non-cases, where those individuals responding 0 or 1 to an item receive 0 points and those responding 2 or 3 to an item receive 1 point. The scale may also be scored in the traditional Likert fashion, where the responses are merely summed. Whilst higher scores reflect increased psychopathology, it is important to note that the GBI is not a diagnostic tool. Research has indicated that the scales can discriminate between bipolar and disruptive behaviour disorders, unipolar and bipolar depression, and mood and disruptive behaviour disorders or no diagnosis (Danielson, Youngstrom, Findling, & Calabrese, 2003).

The GBI has strong psychometric properties. In a recent evaluation study, it demonstrated excellent internal consistency (Cronbach’s ⍺ over .93 for both subscales; Pendergast et al., 2014). Results from the original validation study suggest the tool has good test-retest reliability (r = .73 over 15 weeks), excellent content validity, excellent construct validity, and excellent discriminative validity (Depue et al., 1981). More recent studies have found the GBI to have excellent discriminant validity (Youngstrom, Genzlinger, Egerton, & Van Meter, 2015) and good treatment sensitivity (Youngstrom et al., 2013).

Evidence has shown that gender differences have not compromised the overall psychometric properties of the GBI (Depue & Klein, 1988). However, Chmielewski and colleagues (1995) compared GBI data for African American, Asian American, Caucasian, and Latino samples, and discovered significant cultural differences – Caucasians scored lower than all other groups. Though two decades later, involving a combined Caucasian and African American sample, Pendergast et al. (2015) found that GBI scores were largely invariant across racial groups.

Free access to the GBI:

Mood Disorder Questionnaire (MDQ)



The Mood Disorder Questionnaire (MDQ) was created by Hirschfeld and colleagues (2000) to address the need for accurately screening individuals with a bipolar spectrum disorder. Accurate identification of bipolar disorder (BD) is of concern as it’s often unrecognised or inaccurately diagnosed, which results in a delay of diagnosis and appropriate treatment (Lish, et al., 1994). Items on the MDQ are derived from the DSM-IV criteria and experience as a clinician (Hirschfeld, 2000).

Clinical Use

Self-report format, around five minutes to complete, not to be used for diagnostic purposes, only as a screening tool, and a comprehensive evaluation should follow a positive screen outcome.

Administration and Scoring

The MDQ consists of 3 questions. First, there are 13 items that examine manic symptoms. Second and third, enquires whether these symptoms identified have co-occurred, and the severity of the symptoms. To screen positive, the individual must have answered ‘yes’ to a minimum of 7 items on question 1, responded ‘yes’ to question 2, and answered ‘moderate problem’ or ‘serious problem’ to question 3.

Development and Psychometric Properties

The MDQ has achieved adequate internal consistency with a Cronbach’s alpha of 0.79 and 0.90 (Hirschfeld, 2000; Isometsä et al., 2003). The validation study administered the MDQ to patients at five psychiatric clinics in the United States (Hirschfeld, 2000). The results were used to determine cut off points for items, specificity, and sensitivity. Findings demonstrated that the MDQ had a 0.73 sensitivity and a 0.90 specificity when contrasted against other screening questionnaires in psychiatric settings. The researchers then conducted testing in a general population, which identified a 0.28 sensitivity and a 0.97 specificity (Hirschfeld, 2002). An additional study assessed the effectiveness of the MDQ in unipolar and bipolar depressive patients and found a 0.58 sensitivity (higher sensitivity for bipolar 1) and a 0.67 specificity (Miller, Klugman, Berv, Rosenquist, Ghaemi, 2004). Lastly, testing in a primary care setting revealed a 0.58 sensitivity and a 0.93 specificity (Hirschfeld, Cass, Holt, Carlson, 2005).

In sum, the MDQ is a useful screening tool for BD, demonstrating validity in clinical settings and across cultures. However, consideration should be given towards its higher sensitivity to detect BD type 1 compared to other BD on the spectrum, and its low sensitivity in general populations. Additionally, the use of differing cutoff points of items in scoring (e.g., standard or modified cutoff value of 7 for question 1), and the inclusion/exclusion criteria (e.g., more defined BD definition/criteria includes more severe cases, and increases sensitivity) has shown variability in sensitivity and specificity thus, limiting its overall effectiveness (Wang, et al., 2015).


