Current state of RUCAM-based HILI reports published from the Asian region
China
Starting as early as 2006 with an analysis from Hong Kong,18 an overall 21 reports of HILI cases were published which had been assessed for causality using RUCAM with results presented by groups with first authors having their working place in mainland China, Hong Kong, or Taiwan.18–38 These publications merit further consideration. Assessed cases were commonly well presented with respect to case data completeness and evaluation (Table 1). Most reports provided data of cohorts consisting of HILI alone, but few combined results of both HILI and DILI cases, causing confusion due to mixed data.24,26,30,36 In a few instances, publications erroneously mention in their title specifically only DILI, although HILI cases are also presented in the text,24,26,36 ignoring thereby that HILI features are clearly different from those of DILI.5–11 It seems that most reports were based on a retrospective rather than a prospective study design. Some studies included HILI cases not only with highly probable or probable causality gradings but also with a possible causality level based on RUCAM scores ≥3 (Table 1).24,25,27,28,30,36,38 In other cases, RUCAM-based causality gradings were erroneously classified as definitive;18 although this term was never proposed or approved in the RUCAM literature that determines highly probable as the highest grading,12,15 the most appropriate term for results in biological systems like clinical liver injury. Occasionally, RUCAM-based causality gradings, classified initially as possible, had afterwards been upgraded to a probable level through a non-transparent maneuver36—an overall highly questionable and disputable approach as also discussed previously.11 In rare instances, causality gradings were not reported35 or RUCAM was used for causality grading but the respective publication remained unquoted,26,34 even if the updated RUCAM was mentioned in the text.34 Similar omissions of RUCAM quotation have been observed in some publications related to DILI.11
Table 1.Asian countries with a selection of published HILI cases assessed for causality using RUCAM, occasionally reported together with DILI cases
Country | Author | Year | HILI (n) DILI (n) | Products | Comments |
China | Yuen18 | 2006 | HILI (7) | Several herbs | Using RUCAM for causality assessment, gradings were highly probable (not definitive!) in 3 patients, probable in 2 cases, and possible in 2 patients |
Cheung19 | 2009 | HILI (3) | Psoralea corylifolia | RUCAM-based causality assessment provided with scores of 6-8 in all cases a probable causality grading |
Chau20 | 2011 | HILI (27) | Multiple herbs | With RUCAM, causality gradings were highly probable in 5 cases, probable in 16, possible in 5, and unlikely in 1 case |
Lin21 | 2011 | HILI (1) | Gynura segetum | Using the original RUCAM, a score of 6 corresponding to a probable causality grading was reported |
Gao22 | 2012 | HILI (5) | Gynura segetum | Based on evaluations by the original RUCAM, scores were reported as ≥5 corresponding to a probable or possible causality grading |
Dong23 | 2014 | HILI (18) | Polygonummultiflorum | RUCAM provided 4 x a probable and 14 x a highly probable causality grading |
Hao24 | 2014 | HILI (87) DILI (13) | Multiple herbs and drugs | RUCAM was used to assess causality, whereby HILI or DILI cases with a score of ≥3 (possible or higher) were included |
Gao25 | 2015 | HILI (23) | PA-containing herbs | RUCAM was used, cases with a score of >5 were included (causality grading of possible or probable, score: 5.52±0.67) |
Ou26 | 2015 | HILI (130) DILI (361) | Multiple herbs and drugs | Unspecified, not quoted RUCAM version used for HILI, providing a probable or highly probable causality grading |
Wang27 | 2015 | HILI (40) | Polygonum multiflorum | Within a subgroup: 9 highly probable, 15 probable, 16 possible RUCAM gradings |
Zhu28 | 2015 | HILI (158) | Polygonum multiflorum | The original RUCAM with a score of ≥3 was used for the included cases |
Zhang29 | 2016 | HILI (54) | Multiple herbs | Use of the updated RUCAM, providing a highly probable causality grading in 26 cases and a probable one in 28 cases |
Zhu30 | 2016 | HILI (563) DILI (870) both (552) | Multiple herbs and drugs | Cohorts consisted of HILI, DILI, or both, RUCAM gradings were highly probable or probable, rarely possible |
Li31 | 2017 | HILI (1) | Polygonum multiflorum | By using the updated RUCAM, reported causality grading was probable |
Chow32 | 2019 | HILI (1,552) | Many single herbs (1,428) or mixtures with multiple herbs (124) | RUCAM-based causality gradings were: probable or higher for only 138 cases of HILI by single herbs and for 56 cases by herbal mixtures, possible for 226 cases of HILI by single herbs and for 27 cases by herbal mixtures, but lacking causality for 1,064 cases of HILI by single herbs and for 41 cases by herbal mixtures |
Jing33 | 2019 | HILI (145) | Polygonum multiflorum | With the updated RUCAM, causality gradings were highly probable (11%), probable (82.8%), and possible (6.2%) |
Li34 | 2019 | HILI (1) | Psoralea corylifolia | The updated RUCAM was used but not specifically referenced, with score of 10 as a highly probable causality grading |
Liu35 | 2019 | HILI (331) | Polygonum multiflorum | Cases had been assessed by RUCAM without providing causality gradings |
Shen36 | 2019 | HILI (6,971) DILI (18,956) | Multiple herbs and drugs | RUCAM-based assessment with scores ≥3 in all HILI cases but causality grading was not differentiated from additional, RUCAM-based 18,956 DILI cases |
Tan37 | 2019 | HILI (3) | Swietenia macrophylla | RUCAM score was 7 for all 3 patients, in line with a probable causality grading. |
Zhu38 | 2019 | HILI (488) | Multiple herbs | Using the updated RUCAM, causality grading was highly probable in 52 cases (10.5%), probable in 370 cases (74.8%), and possible in 66 cases (13.3%) |
Japan | Tsuda39 | 2010 | HILI (1) | Saireito | Use of the original RUCAM of 1993 provided a RUCAM score of 8 and thereby a probable causality grading |
Korea | Ahn40 | 2004 | HILI (64) | Various herbs | Use of RUCAM with modifications by the authors, providing mostly probable and highly probable causality gradings |
Seo41 | 2006 | HILI (17) | Various herbs | RUCAM was used, and cases with a score of at least 3 were included |
Kang42 | 2008 | HILI (66) DILI (38) | Various herbs and drugs | RUCAM provided scores ≥4 for all HILI cases corresponding to a possible grading or higher |
Sohn43 | 2008 | HILI (24) | Various herbs | RUCAM was applied in all HILI patients undergoing a liver transplantation |
Kang44 | 2009 | HILI (1) | Corydalis speciosa | Reported was the use of a RUCAM version, modified by the authors, providing a score of 9 and thereby a highly probable causality grading |
Kim45 | 2009 | HILI (2) | Arrowroot | The use of RUCAM provided a score of 10 and thereby a highly probable causality grading |
Bae46 | 2010 | HILI (1) | Polygonum multiflorum | With RUCAM, a score of 10 was achieved corresponding to a highly probable causality grading |
Yang47 | 2010 | HILI (3) | Aloe vera or arborescens | RUCAM-based scores of 7 in 2 cases provided a probable causality grading, and a score of 9 was achieved in the third patient corresponding to a highly probable causality grading (a definitive one as erroneously stated in the text does not exist in the RUCAM system) |
Jung48 | 2011 | HLI (25) | Polygonum multiflorum | With RUCAM, the scores were 6-8, corresponding to a probable causality grading in 15 patients and were >9 corresponding to a highly probable causality grading in 10 patients |
Kim49 | 2012 | HILI (1) | Hovenia dulcis | Using RUCAM, a score of 6 was obtained for this case corresponding to a probable causality grading |
Suk50 | 2012 | HILI (149) DILI (101) | Various herbs and drugs | RUCAM-based evaluation for HILI cases provided an average score of 7, with a range of 3-12 and thereby a possible or probable causality grading |
Lee51 | 2015 | HILI (27) | Various herbs | Use of a modified RUCAM, lack of any quotation and of causality grading |
Lee52 | 2015 | HILI (97) | Various herbs | Using RUCAM, scores were 8.2±1.4; individual cases received mostly a highly probable or probable causality grading |
Woo53 | 2015 | HILI (5) | Various herbs | A simplified RUCAM was used that provided a probable causality grading |
Cho54 | 2017 | HILI (6) | Various herbs | RUCAM was used without specification of its version and referencing, providing a probable causality grading in four HILI cases and a possible grading in two cases |
Singapore | Wai55 | 2006 | HILI (15) DILI (14) | Various herbs and drugs | RUCAM was used in HILI patients for causality assessment, but individual causality gradings were not reported; it was mentioned that all cases fulfilled all RUCAM criteria collected in the course of a prospective study, which suggests a causality grading of at least probable due to the expected data completeness |
Teo56 | 2016 | HILI (10) | Various herbs | RUCAM was used in 10 assessable cases, with scores from 0 to 2 for 9 patients and a score of 5 for 1 patient |
There are several excellent publications, which could serve as examples for future publications on RUCAM-based HILI cases (Table 1). The encouraging report of Zhang et al.29 analyzed HILI cases in a perfect way, using the updated RUCAM of 2016, adopting a high threshold of liver tests (LTs) to avoid nonspecific liver injuries and providing for 26/28 cases a highly probable causality grading. As outlined in the report of Chau et al.,20 the interrater agreement between experts and RUCAM was 81%, facilitating evaluations and exclusion of cases with alternative causes or unclear herbal product identification. In general, RUCAM-based HILI series are preferred that cover in more detail a single herb, such as Gynura segetum and other pyrrolizidine alkaloids (PAs)-containing herbs like in the reports of Lin et al.21 and Gao et al.,22,25 or Psoralea corylifolia like in the reports of Cheung et al.19 and Li et al.,34 or Polygonum multiflorum (PM), as shown in the reports of Dong et al.,23 Wang et al.,27 Zhu et al.,28 Li et al.,31 Jing et al.,33 and Liu et al.35 For instance, Li et al.31 presented a perfect case report on HILI caused by PM, using the updated RUCAM of 2016.15 Similarly, the case series of Dong et al.23 focuses on PM on a single herb causing HILI in 18 patients, with each having received an individual causality grading of probable or highly probable. Since for 14/18 HILI patients, a highly probable causality grading was attributed, this is best explained by a careful case evaluation with complete data sets allowing for this extraordinary result. In addition and as shown in their report assessing causality by Gao et al.,25 RUCAM was used for the first time in the hepatic sinusoidal obstruction syndrome caused by PAs in 23 patients, supporting the blood pyrrole-protein adducts as diagnostic biomarkers.25 The reports of Hao et al.,24 Chow et al.32 and Tan et al.37 are worth mentioning because these authors clarify, already in their title, that cases had been assessed for causality using RUCAM. Tan et al.37 also carefully assessed the comedicated drug using a separate RUCAM sheet, as recommended earlier.12,15 As potential confounding alternative diagnosis, hepatitis E virus (HEV) infection was excluded in all three patients, and RUCAM-based data had been presented in a transparent list.37 In this study, most interesting was the finding of a high causality grading of probable, achieved with a score of 7; although liver injury by Swietenia macrophylla was unknown at the time of publication, providing a score of 0, not allowing additional scores. Therefore, lack of previous knowledge of liver injury does not prevent high causality gradings. Similarly, lacking unintentional readministration, which provides a score of 0, nevertheless allowed for a high causality grading.37 This again underscores the value of RUCAM by taking care of liver injury cases lacking some elements.
Japan
In Japan, the report of Tsuda et al.39 used the RUCAM of 1993 but there are no other RUCAM-based cases of HILI to be used for comparison with worldwide RUCAM-based HILI cases.
Korea
First authors of reports from Korea contributed as experts were numbered overall 15, and thereby represented a substantial number of publications on 526 HILI cases that had been assessed for causality using RUCAM (Table 1).40–54 These included single case reports, case series and review articles. Respective articles were mostly of good quality, with minor shortcomings. These included, for instance, the use of a RUCAM version modified by the authors for unknown reason(s) without own method re-validation,40,44,51,53 the inclusion of cases with a possible causality grading that impairs the focus on cases with a probable or highly probable causality grading40,41,42,50,52,54 using the RUCAM algorithm but leaving individual causality grading unreported for unknown reason(s),51,54 forgetting quotation of the used RUCAM publication,51,54 and classifying the original highly probable causality grading erroneously as definite.47 It seems that most reports followed a retrospective study approach (Table 1),40–54 whereas RUCAM instructions clearly recommend the use of RUCAM for prospective studies.15
Prefect studies were provided among others by Suk et al.,50 who followed a prospective design for their nationwide HILI study in Korea, and by Kim et al.,45 Bae et al.,46 Yang et al.,47 Jung et al.,48 Kim et al.,49 and Woo et al.,53 who all provided cases limited to a probable or highly probable causality grading, suggesting complete case data sets or prospective data collection in single case reports. Valuable is, also, the report of Kang et al.,44 who described a patient with a positive re-exposure result, as evidenced by a striking increase of serum alanine aminotransferase (ALT) activity shown in a separate figure and likely following the test criteria published earlier.15
Singapore
Groups from Singapore presented two reports, with altogether 25 HILI cases that had been assessed for causality using RUCAM.55,56 In the first report published 2006 by Wai et al.,55 a prospective study design was used that allowed for complete case data, conditions commonly facilitating high causality gradings. The second study published 10 years later by Teo et al.56 presented data from a retrospective analysis of spontaneous reports submitted to the national registry; respective causality gradings were extremely low due to incomplete case data, not unexpected under these study conditions.
Other Asian countries
There are virtually no valid reports on RUCAM-based HILI cases from authors residing in other Asian countries like Vietnam, Indonesia, Thailand, or India. Some reports could have been published in local language but not in English; it is also possible that RUCAM had not yet achieved a larger acceptance. With respect to RUCAM-based liver injury by Indian Ayurvedic medicines, two reports were published by authors outside of India, namely from Germany57 and the USA.58 In the report from Germany, the original RUCAM of 1993 was used and referenced for causality assessment, having provided scores of 6-8 as a probable causality grading for four concomitantly used herbal medicines, preferring one single herb with the highest score of 8.57 The USA report discussed RUCAM without providing a correct reference and attributed a score of 5 corresponding to a possible causality grading,58 while some questions including posology and product quality have been raised.59 It is well recognized that reports of Indian Ayurvedic medicine-related liver injury are sparse in the literature,60,61 which we found to include not only herbs but also other complementary and alternative medicines.61 An exemption refers to 8 cases of HILI by Indian products as reported in a RUCAM-based prospective study by the Indian group of Rathi et al.62 that was classified as a report of excellence.63 With respect to Ayurvedic and herbal medicine-induced liver injury, there is a refreshing statement by Devarbhavi:64Is it time to wake up and take notice. Indeed, the quality of HILI case evaluation is insufficient in many countries, including Asian ones, a topic that merits further discussion as outlined below.
Actual issues
Increasing use of RUCAM in Asia
There is now increasing awareness of the benefits provided by RUCAM among various countries, including China,18–38 Korea,40–54 and Singapore,55,56 as evidenced by reports initially published in 2004 from Korea40 and in 2006 from China18 and Singapore,55 with subsequent articles (Table 1). On top among the Asian countries is currently China, best explained by the large population and heavy use of herbal traditional Chinese medicines (TCMs), with increasing numbers of publications and cases until 2019.18–38 Korea ranks at the second position, followed by Singapore in third place (Table 1). Scientists from other Asian countries are more cautious using RUCAM, either to avoid disturbances with the politics of the national TCM-based health system, hospital-related issues, scientific society-based requirements, or that they just prefer their own CAMs (but this should not be the preferred solution and must be declined).
RUCAM essentials
RUCAM has a remarkable scientific run among experts of HILI and RUCAM as an appreciated diagnostic algorithm for assessing causality in liver injury cases, shown alone by the large list of RUCAM-based DILI and HILI cases published until 2015.15 Additional support for RUCAM came from a recent study of 46,266 DILI cases, which had been assessed for causality using RUCAM and were published from 2014 to 2019.11 For assessing causality in DILI or HILI cases, no other method exists with such a background of worldwide use and acceptance.11,15
Appreciation of RUCAM is also substantiated by the reports evaluated for the current analysis of 11,160 HILI cases (Table 1)15–56 that are validated by RUCAM for robust causality assessment. RUCAM is continuously used without problems,11,15 except for some minor questions, addressed and clarified in previous RUCAM publications.15–17 The updated RUCAM is as good as physicians and assessors are handling this method and strictly apply published recommendations.15 RUCAM has not been designed for chronic DILI and HILI or when a suspected injury occurs on pre-existing liver disease—both complex conditions where a more accurate approach especially for the timing of the events and the exclusion of alternative causes is needed. Problems were not found at the level of RUCAM itself but rather were related to poor quality case data or the users if they publish incorrect RUCAM-based causality gradings that had been lifted intentionally from possible to probable gradings. Otherwise, a recent analysis showed that RUCAM performs well provided the RUCAM users do a good job.11
The philosophy behind creating the original RUCAM of 1993 was to facilitate a valid diagnosis for patients with suspected liver injury. This led to the development of a liver-specific, quantitative, objective, transparent, and structured diagnostic algorithm12 which was well validated using cases with positive re-exposure tests as gold standard.13 An update was published later,15 with two different scales, one for cases of hepatocellular injury (Table 2) and one for the cholestatic or mixed liver injury (Table 3).15 This updated RUCAM is now in common use and should be applied for future cases replacing earlier versions.12,14 Occasionally, groups reported the use of RUCAM versions with their own unclear modifications (Table 1), but this attempt must be rejected because such modifications would require a new method validation that has never been provided. A clear unmodified diagnostic algorithm, such as the updated RUCAM of 2016, is essential for complex diseases, as are DILI and HILI, to avoid subjective evaluations and arbitrary conclusions; the RUCAM-based method uniformity will allow for valid comparison of case results between countries and continents.
Table 2.RUCAM worksheet for hepatocellular injury
Suspected product: | Date: |
Items for hepatocellular injury | Score | Result |
1. Time to onset from the beginning of the drug/herb | | |
• 5-90 days (rechallenge: 1-15 days) | +2 | □ |
• <5 or >90 days (rechallenge: >15 days) | +1 | □ |
Alternative: Time to onset from cessation of the drug/herb | | |
• ≤15 days (except for slowly metabolized chemicals: >15 days) | +1 | □ |
2. Course of ALT after cessation of the drug/herb | | |
Percentage difference between ALT peak and ULN | | |
• Decrease ≥50 % within 8 days | +3 | □ |
• Decrease ≥50 % within 30 days | +2 | □ |
• No information or continued drug use | 0 | □ |
• Decrease ≥50 % after the 30th day | 0 | □ |
• Decrease <50 % after the 30th day or recurrent increase | -2 | □ |
3. Risk factors | | |
• Alcohol use (current drinks/day: >2 for women, >3 for men) | +1 | □ |
• Alcohol use (current drinks/day: ≤2 for women, ≤3 for men) | 0 | □ |
• Age ≥55 years | +1 | □ |
• Age <55 years | 0 | □ |
4. Concomitant drug(s)/herb(s) | | |
• None or no information | 0 | □ |
• Concomitant drug/herb with incompatible time to onset | 0 | □ |
• Concomitant drug/herb with time to onset 5-90 days | -1 | □ |
• Concomitant drug/herb known as hepatotoxin and with time to onset 5-90 days | -2 | □ |
• Concomitant drug/herb with evidence for its role in this case (positive rechallenge or validated test) | -3 | □ |
5. Search for alternative causes Group I (7 causes) | Tick if negative | Tick if not done |
• HAV: Anti-HAV-IgM | □ | □ |
• HBV: HBsAg, anti-HBc-IgM, HBV-DNA | □ | □ |
• HCV: Anti-HCV, HCV-RNA | □ | □ |
• HEV: Anti-HEV-IgM. anti-HEV-IgG, HEV-RNA | □ | □ |
• Hepatobiliary sonography / Doppler / CT /MRC | □ | □ |
• Alcoholism (AST/ALT ≥2) | □ | □ |
• Acute recent hypotension history (particularly if underlying heart disease) | □ | □ |
Group II (5 causes) | | |
• Complications of underlying disease(s), such as sepsis, metastatic malignancy, autoimmune hepatitis, chronic hepatitis B or C, primary biliary cholangitis or sclerosing cholangitis, genetic liver diseases | □ | □ |
• Infection suggested by PCR and titer change for | | |
• CMV: anti-CMV-IgM, anti-CMV-IgG | □ | □ |
• EBV: anti-EBV-IgM, anti-EBV-IgG | □ | □ |
• HSV: anti-HSV-IgM, anti-HSV-IgG | □ | □ |
• VZV: anti-VZV-IgM, anti-VZV-IgG | □ | □ |
Evaluation of groups I and II | | |
• All causes-groups I and II – reasonably ruled out | +2 | □ |
• The 7 causes of group I ruled out | +1 | □ |
• 6 or 5 causes of group I ruled out | 0 | □ |
• Less than 5 causes of group I ruled out | -2 | □ |
• Alternative cause highly probable | -3 | □ |
6. Previous hepatotoxicity of the drug/herb | | |
• Reaction labelled in the product characteristics | +2 | □ |
• Reaction published but unlabeled | +1 | □ |
• Reaction unknown | 0 | □ |
7. Response to unintentional reexposure | | |
• Doubling of ALT with the drug/herb alone, provided ALT below 5×ULN before reexposure | +3 | □ |
• Doubling of ALT with the drug(s)/herb(s) already given at the time of first reaction | +1 | □ |
• Increase of ALT but less than ULN in the same conditions as for the first administration | -2 | □ |
• Other situations | 0 | □ |
Total score | |
Table 3.RUCAM worksheet for cholestatic or mixed liver injury
Suspected product: | Date: |
Items for cholestatic or mixed liver injury | Score | Result |
1. Time to onset from the beginning of the drug/herb | | |
• 5-90 days (rechallenge: 1-90 days) | +2 | □ |
• <5 or >90 days (rechallenge: >90 days) Alternative: Time to onset from cessation of the drug/herb | +1 | □ |
• ≤30 days (except for slowly metabolized chemicals: >30 days) | +1 | □ |
2. Course of ALP after cessation of the drug/herb Percentage difference between ALP peak and ULN | | |
• Decrease ≥50 % within 180 days | +2 | □ |
• Decrease <50 % within 180 days | +1 | □ |
• No information, persistence, increase, or continued drug/herb use | 0 | □ |
3. Risk factors | | |
• Alcohol use (current drinks/day: >2 for women, >3 for men) | +1 | □ |
• Alcohol use (current drinks/day: ≤2 for women, ≤3 for men) | 0 | □ |
• Pregnancy | +1 | □ |
• Age ≥55 years | +1 | □ |
• Age <55 years | 0 | □ |
4. Concomitant use of drug(s)/herb(s) | | |
• None or no information | 0 | □ |
• Concomitant drug/herb with incompatible time to onset | 0 | □ |
• Concomitant drug/herb with time to onset 5-90 days | -1 | □ |
• Concomitant drug/herb known as hepatotoxin and with time to onset 5-90 days | -2 | □ |
• Concomitant drug/herb with evidence for its role in this case (positive rechallenge or validated test) | -3 | □ |
5. Search for alternative causes | Tick if negative | Tick if not done |
Group I (7 causes) |
• HAV: Anti-HAV-IgM | □ | □ |
• HBV: HBsAg, anti-HBc-IgM, HBV-DNA | □ | □ |
• HCV: Anti-HCV, HCV-RNA | □ | □ |
• HEV: Anti-HEV-IgM, anti-HEV-IgG, HEV-RNA | □ | □ |
• Hepatobiliary sonography / Doppler / CT / MRC | □ | □ |
• Alcoholism (AST/ALT ≥2) | □ | □ |
• Acute recent hypotension history (particularly if underlying heart disease) | □ | □ |
Group II (5 causes) | | |
• Complications of underlying disease(s), such as sepsis, metastatic malignancy, autoimmune hepatitis, chronic hepatitis B or C, primary biliary cholangitis or sclerosing cholangitis, genetic liver diseases | □ | □ |
• Infection suggested by PCR and titer change for | | |
• CMV: anti-CMV-IgM, anti-CMV-IgG | □ | □ |
• EBV: anti-EBV-IgM, anti-EBV-IgG | □ | □ |
• HSV: anti-HSV-IgM, anti-HSV-IgG | □ | □ |
• VZV: anti-VZV-IgM, anti-VZV-IgG | □ | □ |
Evaluation of group I and II | | |
• All causes - groups I and II – reasonably ruled out | +2 | □ |
• The 7 causes of group I ruled out | +1 | □ |
• 6 or 5 causes of group I ruled out | 0 | □ |
• Less than 5 causes of group I ruled out | -2 | □ |
• Alternative cause highly probable | -3 | □ |
6. Previous hepatotoxicity of the drug/herb | | |
• Reaction labelled in the product characteristics | +2 | □ |
• Reaction published but unlabeled | +1 | □ |
• Reaction unknown | 0 | □ |
7. Response to unintentional reexposure | | |
• Doubling of ALP with the drug/herb alone, provided ALP below 2×ULN before reexposure | +3 | □ |
• Doubling of ALP with the drugs(s)/herbs(s) already given at the time of first reaction | +1 | □ |
• Increase of ALP but less than ULN in the same conditions as for the first administration | -2 | □ |
• Other situations | 0 | □ |
Total score | |
RUCAM evaluates seven key elements characteristic for liver injury, which are individually scored, and their summed score provides a final score and a final causality grading;15 for instance, final score of ≤0 excludes causality, of 1-2 is unlikely, of 3-5 is possible, of 6-8 is probable, and ≥9 is highly probable. The highest RUCAM-based causality level is not definite as erroneously described in some publications (Table 1) but clearly termed as highly probable,15 respecting the biological nature-based variability of liver injury and the associated lack of any definite or certain condition. In general, the highest final scores and associated high causality gradings are obtained with complete case data sets and are best achieved by a prospective study design as the primary aim of any causality assessment of liver injury cases.15 However, and if worse comes to worst, RUCAM is also applicable and prepared for liver injury cases assessed retrospectively, but this commonly leads to low final RUCAM scores and low causality gradings because RUCAM partially disqualifies missing data by low or negative scores to be subtracted from the final score. Low final scores often provide a possible causality grading, and respective cases should not be included in study cohorts of cases with a probable or highly probable causality grading, just to avoid a mix of cases with different causality gradings. Describing clinical features of liver injury cases should be based exclusively on cases with a probable or highly probable causality grading of RUCAM. This certainly applies for evaluations and descriptions of any new diagnostic biomarker, as well.65 Some diagnostic biomarkers are well established for HILI and DILI, but others came under scientific fire due to recent actions of the European Medicines Agency (known as the EMA) through the correct and official retraction of its earlier Letter of Support to promote biomarker research and use.65 The retraction by EMA was the consequence of faulty results based on studies misconducted by not-further identified liver injury experts.11,65 This official retraction represents currently, and in near future, a tricky dilemma for the scientific liver injury community.
Additional notes on HILI in Asia or elsewhere relating to RUCAM are warranted for reasons of clarity and transparency.66–71 A report of excellence is the careful systematic review on Chinese HILI and the use of RUCAM in 54 cases with high causality gradings, published by Zhang et al.29 A robust diagnostic algorithm, such as RUCAM, is commonly used in cases of DILI11 and HILI by TCMs,18–62,66 with more details provided in a recent systematic review on clinical characteristics and outcomes.66 This analysis compares the quality of three RUCAM-based study cohorts, preferring studies of single case reports which provide clinical data and RUCAM details of each patient with HILI by TCMs. The second choice are studies, which summarize the data of a series of patients with HILI by TCM. The third choice refers to studies of extremely low quality, which report the proportion of HILI by TCM in a mix with all DILI cases. This analysis also showed, for study cohorts with a fairly good case data quality, that RUCAM was used as a diagnostic tool in 97/203 studies (47.8%), whereby 154/203 studies (75.9%) were published in Chinese-language journals, which lacked individual references not open for re-evaluation by peers and without causality gradings; only 2/203 studies were prospective.66 Consequently, over half of the studies published in China did not benefit from a good CAM, calling for substantial improvement in future cases. Shortcomings are also evident in a USA Food and Drug Administration (FDA) study, which discusses issues of HILI and used the method of the Drug-Induced Liver Injury Network (DILIN)67 for causality assessment, which comes along without any specific element scoring and provides only arbitrary causality gradings as percentage ranges;68 additionally, other CAMs were used,67 known for being not specific for liver injury cases and not based on typical, individually scored liver-related key elements, as amply discussed previously15 and reiterated recently.68 No question, the strength of this FDA report would have been increased if the updated RUCAM of 201615 would have been used rather than just referencing publications on RUCAM.67 Critical is also the data source of used cases, which were partially retrieved from the USA’s National Institutes of Health LiverTox database,67 known for inclusion of liver injury cases lacking robust CAMs and being therefore disputed.69–71
Liver test thresholds
Liver injury is defined by increased serum activities of LTs: ALT of at least 5 times the upper limit of normal (ULN) and/or of alkaline phosphatase (ALP) of at least 2×ULN, best assessed simultaneously on the day of first presentation, as outlined in 2016.15 In the original RUCAM of 1993, ALT thresholds were lower, with at least 2×ULN,13 but should not be applied anymore to ensure exclusion of unspecific liver injury cases.15 The currently favored ALT and ALP threshold values of 201615 have also been considered as perfect in China by Yang et al.72 Therefore, and for reasons of comparability, in future publications on HILI, the use of the current thresholds and their mentioning in the text is urgently recommended, namely ALT ≥5×ULN and ALP ≥2×ULN. In fact, actual threshold information is often lacking in HILI publications (Table 1).18–56 Disregarding thresholds impedes clear differentiation between liver injury and LT abnormality.51 As expected, increasing ALT thresholds from ≥3×ULN to ≥5×ULN substantially reduces the case number of true HILI.73
Causality grading
RUCAM-based causality gradings are defined with highly probable being the top level.15 Attempts to modify the commonly used RUCAM gradings must be resisted. For instance, efforts to use the RUCAM gradings concomitantly with the arbitrary percentage ranges of causality gradings have been published, so far being favored by the disputable vague DILIN system, and to incorporate it in the RUCAM algorithm74—an approach that will not work. Just the opposite direction should be taken by incorporating the RUCAM-based scoring system in the DILIN method, rendering it then an excellent quantitative CAM, unrelated to the intransparent, subjective global introspection method used currently in the USA. Problematic are also post hoc uptonings of RUCAM-based causality gradings from possible up to probable.36 In addition and as confirmed in court, intentional uptonings of RUCAM scores from possible to probable gradings invalidate published conclusions,75,76 disregarding ethics among the scientific community.11
Epidemiology
Epidemiology aspects of liver injury remain an issue.51,54,73,77,78 A low HILI prevalence was found in a large retrospective single center study from Korea, in which 27/4769 patients (0.6%) with musculoskeletal disorders received TCMs, as reported by Lee et al.,51 with confirmed results through secondary evaluation by the same group.73 For Korea again, Cho et al.54 reported HILI prevalence results from a nationwide multicenter and prospective study with 6/1001 patients (0.6%). These results, from one single country and presented by two different groups, are surprising and require comments. With 0.6%, identical data of HILI prevalence were achieved;51,54,73 although, one group used a retrospective design, commonly known for its low case quality,51,73 whereas the other group followed a prospective protocol.54 The low prevalence data were achieved by both groups using HILI cases with ALT thresholds of at least 3×ULN, which included many cases with unspecific LT increases.51,54,73 With higher ALT thresholds of ≥5×ULN, HILI case numbers approached the zero range,73 signifying that all is now perfectly done, with reasonable results and without the need of further studies. Indeed, since 2017, no other HILI-related reports were published from Korea (Table 1). HILI is seemingly not a problem in Korea,51,54,73 similar to Germany, considering the low TCM-related HILI incidence data.77 In that report, liver injury data were derived from a prospective, hospital-based and large-scale study of 21,470 patients who had no liver disease prior to treatment with herbal TCM. Among these, 26 patients (0.12%) experienced HILI on formal grounds, as evidenced by ALT values of ≥5×ULN, but a probable causality was attributable to only 8/26 cases, a possible one to 16/26 patients, and an excluded one to 2/26 cases, using the updated RUCAM.77
In China, with around 1.4 billion inhabitants,36 conditions of HILI are more complex.36,78 In particular, valid epidemiology data of HILI are not available for the population; although, herbal TCMs are integral constituents of the Chinese health system. An earlier vain epidemiology analysis was not RUCAM-based and used mixed cohorts of injury cases by drugs, herbs, or CAMs.78 Instead, some improvements were evident in a more recent report, with the title focusing on incidence and etiology of DILI in mainland China, published in a 2019 issue of Gastroenterology.36 At least, it was now recognized that the use of RUCAM, as a valuable diagnostic algorithm, can help assess causality in liver injury cases.36 However, the respective cohorts were grouped under the term of DILI, and represented still not only DILI but also liver injury cases caused by herbal TCM and herbal dietary supplements, representing two different product categories and again providing conditions similar to the shortcomings of the earlier study78 and not allowing for characterization of HILI epidemiology features.36 Nevertheless, some progress is recognizable because other critical shortcomings have been well identified in the text under the limitation section.36 What’s more important, a new version of this study was already promised and will hopefully be published with inclusion of the updated RUCAM of 2016, now being without major flaws and after more careful peer reviews, preventing letters to the Editor. Under the current conditions, no valid statement is warranted on HILI epidemiology in China.36 Nevertheless, China is well prepared to present valid data on HILI cases, all assessed by RUCAM, as listed in Table 1 and referenced.18–38
For future studies on epidemiology, a reminder may be useful: epidemiology includes incidence and prevalence; hence, these two parameters are to be considered separately.79 The incidence of HILI and, of course, DILI is expressed as the total number of new injury cases during a certain period of time, divided by the number of individuals in the population initially at risk. The prevalence of liver injury by herbs or drugs is calculated as the total number of liver injury cases in the population at a given time, and it represents an estimate of how common liver injury can affect the general population at a fixed time. Consequently, incidence commonly provides information about the risk of acquiring new liver injury; whereas, prevalence signifies how widespread liver injury from herbs or drugs is. Prospective studies will provide best results on the incidence.
Case data quality
With a good study design, high-quality HILI cases are to be expected.15–17 Only a prospective study design that includes the use of the updated RUCAM15 will provide valid and complete data of HILI cases, with a high causality grading of probable or better highly probable. Case presentation should follow few principles.80 No question, the updated RUCAM can be used even for HILI cases obtained from retrospective studies; although, this is not the preferred approach. Based on the present experience, editors of journals should prefer publication of only articles dealing with HILI cases presenting good case data quality obtained prospectively using the updated RUCAM.
Herbal product quality
Basic requirements: Whenever a patient with assumed HILI is further evaluated clinically, one of the key questions relates to product quality, including herb authentication (Table 4). RUCAM is not destined to check for product quality. Of concern are impurities and adulteration by synthetic drugs that might have been added erroneously or intentionally to increase the efficacy of the herbal product.10,79 The quality of herbal medicines must be evaluated by toxicology methods, such approach is a routine measure in a TCM hospital in Germany, as described previously.77 In this clinical setting, only herbal TCMs of verified quality are used by the patients under care, raising the question of whether this quality concept contributes to the low number of HILI cases observed under these specific hospital conditions. In addition, the quality of herbs is influenced by other factors79 that are rarely considered in HILI case analyses published in Asian countries18–56 or elsewhere, including those in the recent Special Issue on “Drug, Herb, and Dietary Supplement Hepatotoxicity”, which presented much information in various articles on liver injury by herbal products.81 Therefore, these important and so far largely neglected aspects are discussed in much more detail below.
Table 4.Proposal for good quality of herbal medicines, safe use, and requirements for regulatory approved herbal drugs
Specific international qualification required for regulatory approved herbal drugs |
• Good Agricultural Practices • Good Manufacturing Practices • Definition of plant family, subfamily, species, subspecies, and variety • Definition of plant part • Definition of solvents and solubilizers • Lack of impurities, adulterants, and misidentifications • Minimum of batch and product variability • Lack of variety to variety variability • Brand name with details of ingredients, plant parts, batch number, and expiration date • Manufacturer with address • Regulatory specification of indication of herbal drug use • Daily dose with details of the application form • Maximum duration of herbal drug use • Efficacy of the herbal drug proven by valid randomized controlled trials • Description of adverse reactions and their frequency • Information of risk/benefit profile • Internationally approved regulatory surveillance • Regulatory harmonization to use the updated RUCAM in order to assess causality in suspected HILI cases |
Plant circadian clock system: The Nobel Prize in Physiology or Medicine 2017 was awarded to the three US scientists Michael W. Young, Michael Rosbash, and Jeffrey C. Hall for their discoveries on the molecular mechanisms controlling circadian rhythms (the physiological 24-hour body clock).82 Their discoveries explain how plants, animals, and humans adapt their biological rhythm so that it is synchronized with the Earth’s revolutions. They identified a gene which encodes a protein within the cell during the night that then degrades during the day. Sufficient evidence exists to mandate understanding plant physiology and consideration of plant circadian rhythm in manufacture of good quality herbal products.82,83 In experimental studies using plant leaves and mimicking the daylight, exposure of ultraviolet-C (short wavelength) to the Shell ginger (Alpinia zerumbet) of the ginger family (Zingiberaceae) modulates the relative chemical composition, changes the amounts of essential oils and total phenols, and alters the antioxidant activity.83 The circadian clock system in plants controls many important metabolic pathways and functions, including photosynthesis, stomatal opening, and molecular processes leading to gene expression.82 Transcriptional, translational, and post-translational processes are interlocked by feedback loops among morning- and evening-phased genes.83 Changing circadian rhythms may be an approach to gain improved plant quality, to prevent poor quality, or both.82,83 Better identifying their pathways and processes that are clock controlled and of benefit for the plants,84 however, is still a major multidisciplinary challenge of plant chronobiology.
Plant stress: Herbal product quality is also modified by biotic or abiotic plant stress, affecting higher plants.79 Biotic plant stress by pathogen attacks of other living organisms is caused by insects, larger grazing animals, parasites, bacteria, viruses, and fungi. Instead, abiotic stress is caused by environmental attacks, heavy ultraviolet radiation, draft, wounding, or soil contamination by salts or heavy metals.83,85,86 At the molecular level, plant stress leads to oxidative stress through generation of reactive oxygen species, damaging the plant’s integrity and impairing herbal product quality. This is triggered if radical scavenging chemicals, such as polyphenols, are absent in the plant under injurious stress.
Seasonal variation: Herbal quality is strongly dependent on the harvest time, shown recently as example for the roots of Cyathula officinalis, a popular TCM.87 Using a metabolomic approach based on gas chromatography-mass spectroscopy, 166 metabolites had been identified in these roots, 63 of which showed significant quantitative changes in different growth years of up to 4 years. It was suggested to harvest in the fourth grow year in order to boost herbal quality, and extending these studies to other plants.87 Such studies about variation of phytochemicals in different harvest times is in line with Good Agricultural Practice standards of Chinese traditional herbs in China. Fixing the harvest year will provide consistency of batches and herbal products with the desired phytochemicals as target ingredients.
Area of harvest: Unexpected were results obtained with PM, harvested from various regions of China and assessed for its hepatotoxic potential.88 This is an important study, since PM is much used in China and elsewhere, and known for its liver toxicity. These results showed that liver toxicity was obviously different among the various areas of harvest, and the most toxic PM was from the Sichuan Province. It is noteworthy that emodin was not considered the main hepatotoxin anymore, as opposed to previous studies.88–90 Preference is now given to both tetrahydroxystilbene-O-(galloyl)-hex and emodin-O-hex-sulfate as the primary offending agents.88
Case and herb listing: An optimum listing of several individual Asian herbs causing HILI should include cases with RUCAM-based causality assessment and high causality grading. Respective lists presented by authors of Asian countries in English language are scarce, partly due to the focus on DILI cases with neglect of HILI data (Table 1).18–56 Similarly, in one of the largest studies of DILI with HILI published within the last year, little attention was paid to a separate robust listing of herbs causing liver injury in China.37 Instead, a comprehensive list was provided by the exceptional study of Zhang et al.29
A few publications from authors outside of Asia have presented some case and herb listings of Asian HILI but with limited information. For instance, our group published initial lists of HILI by various herbal TCMs, with partially incomplete data regarding causality grading, RUCAM use, or quotation of respective reports.15,89–95 In one publication of 2014, HILI lists contained herbal TCMs, references, and data of causality assessments using criteria of re-exposure tests but RUCAM-based causality gradings were not provided.91 In the other report, these gradings were provided for a few HILI cases.92 Reports of 2015 presented HILI lists of TCM herbs with established causality93 or a large list of herbal TCMs causing HILI with exact case numbers but without RUCAM-based causality grading.94 A large list with individual RUCAM-based causality grading for various herbal TCMs was also published.95 Reports of 2016 presented case lists of HILI with TCM herbs and causality assessment by RUCAM or positive re-exposure tests89 or a country-wise case listing of HILI by herbal TCMs with exact numbers and references,90 and a large list of RUCAM-based injury cases by herbal TCMs, other herbs and drugs, all listed within the publication of the updated RUCAM.15 From outside of Asia, reports were published by authors of the USA on a few cases of HILI by TCMs.3,67 The large group of cases included in the first USA report would have benefitted if better stratified regarding RUCAM assessment.3 In contrast, cases presented by the second group of the USA FDA67 were partially assessed using the updated RUCAM15 or a unique, not validated evidence-based method. Cases were also derived from the LiverTox database,67 with its published problems in assessing a correct causality in liver injury cases.69–71 Data were also disappointing in another FDA report with attempted focus on the development of a database for herbal and dietary supplement-induced liver toxicity, but herbal TCMs and causality assessment by the updated RUCAM were explicitly not considered.96
Not included in this analysis were cases of HILI in association with the use of products derived from Camellia sinensis, consumed either as green tea beverage or green tea extracts (GTE) because respective publications by Asian authors are scarce; indeed, it is mainly a problem in Western countries, where many RUCAM-based reports were published on liver injury in connection with the use of GTE.97,98 Key issues around liver injury by GTE are obviously settled now, as the United States Pharmacopeia and DILIN members finally made it and confirmed that GTE are potentially hepatotoxic by using the updated RUCAM and thereby breaking boundaries to good medicine based on evidence and a diagnostic algorithm in line with artificial intelligence proposals.99
Networks and regulatory databases: Generally problematic are reports presented as network data when case presentations and causality assessments are poor.67,96,100 For instance, a network-based pharmacology study of the HILI potential of traditional hepatoprotective Chinese herbal medicines discusses aspects of liver injury without considering issues of causality assessment like the use of the updated RUCAM.100 Clearly, shortcomings of methodological requirements invalidate studies like this one. Unexpectedly, not a single case of HILI was found in a retrospective study of adverse events due to complementary health products in Singapore from 2010 to 2016; adverse events were reported to the Health Sciences Authority, and analyzed were overall 147,215 adverse event reports suspected to be associated with pharmaceutical products and complementary health products, which included Chinese traditional medicines.101 These data are at variance with another Singapore study of liver injury associated with CAM—a review of adverse event reports in an Asian community from 2009 to 2014, in which 10 assessable HILI cases provided weak RUCAM scores from 0 to 2 for 9 patients and a score of 5 for 1 patient.56 In another report from Singapore, RUCAM was used in 15 HILI patients for causality assessment, whereby all cases reportedly fulfilled all RUCAM criteria but individual RUCAM-based causality gradings were not reported.55 Data were collected in the course of a prospective study which suggest a causality grading of at least probable due to the expected data completeness. These data again underscore the complexity of accessing valid HILI data within a single country, but the overall conclusion can be reached that HILI is rare in Singapore. The reasons of these promising data are possibly related to the herbal product quality.101
Current and resolved controversies
In Korea, a HILI report published in 201551 contained shortcomings regarding the use of RUCAM (Table 1). There was intermittently a heavy dispute on the low HILI case frequency—forced by scientific societies, TV, and print press, and overall poor conditions for scientific discussions—but re-evaluation confirmed the initial conclusions and likely settled the disturbances, for now.73 Focusing on another report36 and the related Letters to the Editor102–104 by various DILI experts from China,102,104 India,103 and Iceland,103 discussions have emerged around the reported RUCAM-based DILI and HILI cases36 but it seems that the problems can well be solved in a new, promised prospective study, whereby the use of RUCAM may again be helpful, now applying its updated version.36,102–105 The cited problems focused, among others, on the retrospective design of the study36 and it was argued that results gathered retrospectively do not allow valid conclusions.102–104 This is why the updated RUCAM calls for prospective use.15
Guidelines
For China, guidelines exist with focus on the diagnosis of HILI (Fig. 1),106 HILI by herbal TCMs,107 and DILI.108 Several criteria are identical, others are variable.106–108 Therefore, and in future guidelines, some uniformity is desired to facilitate their use. This should include separate listing of RUCAM-based HILI and DILI cases without using concomitantly a non-RUCAM method to avoid confusion, providing RUCAM-based causality gradings for each case (Table 1), identical LT thresholds and liver pattern criteria, evaluating liver injury cases for typical features only if high causality grades such as highly probable or probable have been achieved, and the prospective use of the updated RUCAM with quotation of the corresponding publication.15 New guidelines should specifically address only diagnostic recommendations using the updated RUCAM and not include clinical data like general liver injury features unless derived from cases assessed for causality by RUCAM with high causality gradings.
Guidelines with the updated RUCAM should also be used for evaluation of liver injury in patients with COVID-19 infections to analyze whether the injury is caused by the virus itself (found in the liver),109,110 by other factors such as pre-existing liver disease,18,33,36 or the use of potentially hepatotoxic conventional drugs or herbal TCMs,33 conditions well described in publications from China.18,11,36 Finally, since acute respiratory syndrome is a severe complication in these patients, the liver injury could be caused by respiratory insufficiency leading to respiratory hepatopathy due to hepatic hypoxia, in analogy to cardiac hepatopathy, as detailed earlier111,112 and listed as important differential diagnosis of HILI and DILI.15