Accuracy of Data Extraction of Non-English Language Trials with Google Translate

Background Systematic review prides itself on inclusion of all relevant evidence. However, study eligibility is often restricted to English language for practical reasons. Google Translate, a free Web-based resource for translation, has recently become available. However, it is unknown whether its translation accuracy is sufficient for Evidence-based Practice Center (EPC) systematic reviews. Therefore, we formally evaluated the accuracy of Google Translate for the purpose of data extraction of non-English language articles. Methods We retrieved 10 randomized controlled trials (RCTs) in eight languages (Chinese, French, German, Italian, Japanese, Korean, Portuguese, and Spanish) and eight observational studies in Hebrew. Eligible studies were RCTs that reported per-treatment group results data (except for Hebrew language studies, where no RCTs were identified). Each article was translated into English using Google Translate. The time required to translate each study was tracked. Data from the original language versions of the articles were extracted by one of 10 fluent speakers who were current or former members of our EPC. The English translated versions of the articles were extracted by one of five current EPC researchers who did not speak the given language. These five researchers also double data extracted 10 English language RCTs. Data extracted included: eligibility criteria, treatment description, study descriptors, quality issues, outcome description, and results. Extractors were also asked to estimate how much extra time was required for extraction compared to a similar English language article. For each study, pairs of data extractions were compared for agreement of each extracted item. We analyzed the percent agreement within sets of studies in each language for each extraction item and for groups of extraction items. We defined “high agreement” as at least 80 percent agreement within an item or article. The degree of agreement for each language was compared with that of the English language study comparisons with nonparametric tests. Results The length of time required to translate articles ranged from seconds (51 articles, 58 percent) to about 1 hour. Assessment by the English language data extractors indicated that “a little” extra time was required for 40 articles (45 percent) and “a lot” for 42 (48 percent). When evaluating all extraction items together, Portuguese and German articles had the best agreement between original and translated extractions, with high agreement between extractors among about 60 percent of the items, compared with 80 percent in English articles. Spanish, Hebrew, and Chinese had the lowest agreement (30 percent, 24 percent, and 8 percent, respectively). The absolute agreement and the proportion of items with high agreement were statistically significantly worse for all languages, compared with English. Eight of 10 English language articles had high agreement for all items; compared with 7 of 10 Portuguese articles; 6 of 10 German articles; 4 of 10 French, Italian, and Korean; 3 of 8 Hebrew articles; 3 of 10 Japanese and Spanish articles; but no Chinese articles. Conclusion Translation was not always possible, but generally required few resources. Across all languages, data extraction from translated articles was less accurate than from English language articles. Accurate extraction was possible for some articles in all languages, except Chinese, with Portuguese and German articles yielding the most accurate extractions. Use of Google Translate has the potential of being an approach to reduce language bias; however, reviewers may need to be more cautious about using data from these translated articles.

[1]  G. Pasero,et al.  [Analgesic dose range finding of lornoxicam compared to diclofenac. Crossover double blind study in rheumatoid arthritis]. , 2011, Reumatismo.

[2]  F. Baldi,et al.  [Efficacy of Cellfood's therapy (deutrosulfazyme) in fibromyalgia]. , 2011, Reumatismo.

[3]  G. Valentini,et al.  [A comparison between the Simplified Disease Activity Index (SDAI) and the Disease Activity Score (DAS28) as measure of response to treatment in patients undergoing different therapeutic regimens]. , 2011, Reumatismo.

[4]  David Moher,et al.  Language of publication restrictions in systematic reviews gave different results depending on whether the intervention was conventional or complementary. , 2005, Journal of clinical epidemiology.

[5]  H. Yoon,et al.  [The effects of preemptive analgesia of morphine and ketorolac on postoperative pain, cortisol, O(2) saturation and heart rate]. , 2008, Journal of Korean Academy of Nursing.

[6]  E. Gouzoulis-Mayfrank,et al.  Motivationsbehandlung für Patienten mit der Doppeldiagnose Psychose und Sucht , 2011, Der Nervenarzt.

[7]  C. Lengeler,et al.  Language bias in randomised controlled trials published in English and German , 1997, The Lancet.

[8]  J. B. Dichi,et al.  [Effect of n-3 fatty acids in glycemic and lipid profiles, oxidative stress and total antioxidant capacity in patients with the metabolic syndrome]. , 2010, Arquivos brasileiros de endocrinologia e metabologia.

[9]  J. Schwartzkopff,et al.  [Penetrating keratoplasty with intrastromal corneal ring. A prospective randomized study]. , 2008, Der Ophthalmologe : Zeitschrift der Deutschen Ophthalmologischen Gesellschaft.

[10]  A. Casati,et al.  0.2% ropivacaine with or without sufentanil for patient-controlled epidural analgesia after anterior cruciate ligament repair. , 2005, Minerva anestesiologica.

[11]  Mikko P. Björkman,et al.  Vitamin D supplementation has minor effects on parathyroid hormone and bone turnover markers in vitamin D-deficient bedridden older patients. , 2008, Age and ageing.

[12]  Hyojin Park,et al.  [The effects of probiotics on symptoms of irritable bowel syndrome]. , 2006, The Korean journal of gastroenterology = Taehan Sohwagi Hakhoe chi.

[13]  L. Baillargeon,et al.  [The effects of Arnica Montana on blood coagulation. Randomized controlled trial]. , 1993, Canadian family physician Medecin de famille canadien.

[14]  Xiao-yan Hu,et al.  [Effects of probiotics on feeding intolerance in low birth weight premature infants]. , 2010, Zhongguo dang dai er ke za zhi = Chinese journal of contemporary pediatrics.

[15]  F. Sheppard Medical writing in English: The problem with Google Translate. , 2011, Presse medicale.

[16]  J. Sahuquillo,et al.  [Comparison of the effectiveness of pentobarbital and thiopental in patients with refractory intracranial hypertension. Preliminary report of 20 patients]. , 2005, Neurocirugia.

[17]  R. Ramonda,et al.  [Influence of intra-articular injections of sodium hyaluronate on clinical features and synovial fluid nitric oxide levels of temporomandibular osteoarthritis]. , 2011, Reumatismo.

[18]  J. Scholz,et al.  [Comparison of premedication regimes. A randomized, controlled trial]. , 2007, Der Anaesthesist.

[19]  Min Jun Song,et al.  [Effectiveness of 10 day-sequential therapy for Helicobacter pylori eradication in Korea]. , 2008, The Korean journal of gastroenterology = Taehan Sohwagi Hakhoe chi.

[20]  J. H. Lee,et al.  The Effect of Green Coffee Bean Extract Supplementation on Body Fat Reduction in Overweight/Obese Women , 2010 .

[21]  Peter Jüni,et al.  Direction and impact of language bias in meta-analyses of controlled trials: empirical study. , 2002, International journal of epidemiology.

[22]  F. Périé,et al.  [Treatment of Helicobacter pylori infection with lansoprazole 30 mg or 60 mg combined with two antibiotics for duodenal ulcers]. , 2000, Gastroentérologie Clinique et Biologique.

[23]  R. J. Hayes,et al.  Empirical evidence of bias. Dimensions of methodological quality associated with estimates of treatment effects in controlled trials. , 1995, JAMA.

[24]  A. Malvasi,et al.  Comparison of sequential combined spinal-epidural anesthesia and spinal anesthesia for cesarean section. , 2005, Minerva anestesiologica.

[25]  F. Schiele,et al.  Impact of intravascular ultrasound guidance in stent deployment on 6-month restenosis rate: a multicenter, randomized study comparing two strategies--with and without intravascular ultrasound guidance. RESIST Study Group. REStenosis after Ivus guided STenting. , 1998, Journal of the American College of Cardiology.

[26]  B. Fleury,et al.  Titrated mandibular advancement versus positive airway pressure for sleep apnoea , 2009, European Respiratory Journal.

[27]  H. Khairi,et al.  [Comparing two Prepidil gel regimens for cervical ripening before induction of labor at term: a randomized trial]. , 2009, Journal de gynecologie, obstetrique et biologie de la reproduction.

[28]  T. Nishikawa,et al.  [One-week application of terbinafine cream compared with four-week application in treatment of Tinea pedis]. , 2001, Nihon Ishinkin Gakkai zasshi = Japanese journal of medical mycology.

[29]  J. Kizu,et al.  [Establishment of an optimum bowel preparation method before gynecologic laparoscopic surgery]. , 2001, Yakugaku zasshi : Journal of the Pharmaceutical Society of Japan.

[30]  S. Thurber English , 1894 .

[31]  S. Leuchte,et al.  [Minimally invasive vs. transgluteal total hip replacement. A 3-month follow-up of a prospective randomized clinical study]. , 2008, Der Orthopade.

[32]  F. Konietschke,et al.  [Mesh shrinkage in hernia surgery: data from a prospective randomized double-blinded clinical study]. , 2010, Der Chirurg; Zeitschrift fur alle Gebiete der operativen Medizen.

[33]  J. Sterne,et al.  How important are comprehensive literature searches and the assessment of trial quality in systematic reviews? Empirical study. , 2003, Health technology assessment.

[34]  F. Krummenauer,et al.  [Rotational stability in intraocular lenses with C-loop haptics versus Z haptics in cataract surgery. A prospective randomised comparison]. , 2005, Der Ophthalmologe.

[35]  R. Verreault,et al.  [Dietary treatment of mild to moderate hypercholesterolemia. Effectiveness of different interventions]. , 1996, Canadian family physician Medecin de famille canadien.

[36]  R. Morelatto,et al.  Antifungal topical therapy in oral chronic candidosis. A comparative study. , 2002, Medicina oral : organo oficial de la Sociedad Espanola de Medicina Oral y de la Academia Iberoamericana de Patologia y Medicina Bucal.

[37]  Monica R. Shah,et al.  Evaluation Study of Congestive Heart Failure and Pulmonary Artery Catheterization Effectiveness The ESCAPE Trial , 2005 .

[38]  G. Fanelli,et al.  [Epidural vs general anaesthesia]. , 2002, Minerva anestesiologica.

[39]  D. Rogers Italian , 1995, Journal of the International Phonetic Association.

[40]  José Mendes Aldrighi,et al.  Revista da Associação Médica Brasileira , 2010 .

[41]  K. Meador,et al.  Preliminary Findings of High-Dose Thiamine in Dementia of Alzheimer's Type , 1993, Journal of geriatric psychiatry and neurology.

[42]  M. Aubin,et al.  [Control of arterial hypertension: effectiveness of an intervention performed by family practitioners]. , 1994, Canadian family physician Medecin de famille canadien.

[43]  M. Amorim,et al.  [Transdermal nitroglycerin versus oral nifedipine administration for tocolysis: a randomized clinical trial]. , 2009, Revista brasileira de ginecologia e obstetricia : revista da Federacao Brasileira das Sociedades de Ginecologia e Obstetricia.

[44]  J. H. Lee,et al.  The Effect of Isoflavone and Gamma-linolenic Acid Supplementation on Serum Lipids and Menopausal Symptoms in Postmenopausal Women , 2010 .

[45]  E. Hockman,et al.  Effects of Nurse-Managed Telemonitoring on Blood Pressure at 12-Month Follow-Up Among Urban African Americans , 2007, Nursing research.

[46]  M. Zugaib,et al.  [Comparative study of manual vacuum aspiration and uterine curettage for treatment of abortion]. , 2006, Revista da Associacao Medica Brasileira.

[47]  T. Aoki,et al.  [A randomized comparison of two vaginal procedures for the treatment of uterine prolapse using polypropylene mesh: hysteropexy versus hysterectomy]. , 2009, Revista do Colegio Brasileiro de Cirurgioes.

[48]  Carla Bassanezi Mazzaro,et al.  [A prospective, randomized, open and comparative study to evaluate the safety and efficacy of blue light treatment versus a topical benzoyl peroxide 5% formulation in patients with acne grade II and III]. , 2009, Anais brasileiros de dermatologia.

[49]  Tiago de Araujo Guerra Grangeia,et al.  [Use of corticosteroids after esophageal dilations on patients with corrosive stenosis: prospective, randomized and double-blind study]. , 2003, Revista da Associacao Medica Brasileira.

[50]  S. Blair,et al.  [Therapeutic efficacy of a regimen of artesunate-mefloquine-primaquine treatment for Plasmodium falciparum malaria and treatment effects on gametocytic development]. , 2009, Biomedica : revista del Instituto Nacional de Salud.

[51]  C. Pandozi,et al.  Left atrial ablation versus biatrial ablation for persistent and permanent atrial fibrillation: a prospective and randomized study. , 2006, Journal of the American College of Cardiology.

[52]  A. Oksenberg,et al.  [The significance of body posture on breathing abnormalities during sleep: data analysis of 2077 obstructive sleep apnea patients]. , 2009, Harefuah.

[53]  Simon Tuchais French , 1958, Language Communities in Japan.

[54]  Milam Aiken,et al.  An Evaluation of the Accuracy of Online Translation Systems , 2009, Communications of the IIMA.

[55]  S. Leucht,et al.  Language bias in neuroscience—is the Tower of Babel located in Germany? , 2004, European Psychiatry.

[56]  Milam Aiken,et al.  An Analysis of Google Translate Accuracy , 2012 .

[57]  Natalia Varela,et al.  [Pain prevention in term neonates: randomized trial for three methods]. , 2008, Archivos argentinos de pediatria.

[58]  F. Wappler,et al.  [Intra-articular ketamine after arthroscopic knee surgery. Optimisation of postoperative analgesia]. , 2007, Der Anaesthesist.

[59]  H. Yoon,et al.  [A comparison of the effect of lidocaine or sodium bicarbonate mixed with rocuronium on withdrawal movement, mean arterial pressure and heart rate during rocuronium injection]. , 2009, Journal of Korean Academy of Nursing.

[60]  M. Ávila,et al.  [2% ibopamine vs. water-drinking test as a provocative test for glaucoma]. , 2008, Arquivos brasileiros de oftalmologia.

[61]  A. Aydin,et al.  [Effectiveness of topical ciclosporin A treatment after excision of primary pterygium and limbal conjunctival autograft]. , 2008, Journal francais d'ophtalmologie.

[62]  G. Chisari,et al.  [Treatment of bacterial conjuntivitis with topical ciprofloxacin and norfloxacin: a comparative study]. , 2003, Le infezioni in medicina : rivista periodica di eziologia, epidemiologia, diagnostica, clinica e terapia delle patologie infettive.

[63]  G. Grégoire,et al.  Selecting the language of the publications included in a meta-analysis: is there a Tower of Babel bias? , 1995, Journal of clinical epidemiology.

[64]  B. Cragg,et al.  Randomized controlled trial to determine effects of prenatal breastfeeding workshop on maternal breastfeeding self-efficacy and breastfeeding duration. , 2006, Journal of obstetric, gynecologic, and neonatal nursing : JOGNN.

[65]  R. Bonamigo,et al.  [Evaluation of patients' learning about the ABCD rule: A randomized study in southern Brazil]. , 2009, Anais brasileiros de dermatologia.

[66]  Karen A Robinson,et al.  Development of a highly sensitive search strategy for the retrieval of reports of controlled trials using PubMed. , 2002, International journal of epidemiology.

[67]  A. Sailer,et al.  [The impact of silicone spray on scar formation. A single-center placebo-controlled double-blind trial]. , 2010, Der Hautarzt; Zeitschrift fur Dermatologie, Venerologie, und verwandte Gebiete.

[68]  D. Domagk,et al.  Endoluminal Gastroplasty (EndoCinch™) versus Endoscopic Polymer Implantation (Enteryx™) for Treatment of Gastroesophageal Reflux Disease: 6-Month Results of a Prospective, Randomized Trial , 2006, The American Journal of Gastroenterology.

[69]  Ariane Ferreira Machado,et al.  [Prospective, randomized and controlled trial on the dwell time of peripheral intravenous catheters in children, according to three dressing regimens]. , 2005, Revista latino-americana de enfermagem.

[70]  Jun-Hui Yi,et al.  [Influence of near-work and outdoor activities on myopia progression in school children]. , 2011, Zhongguo dang dai er ke za zhi = Chinese journal of contemporary pediatrics.

[71]  E. Martínez-Abundis,et al.  [Effect of oral zinc administration on insulin sensitivity, leptin and androgens in obese males]. , 2006, Revista medica de Chile.

[72]  Deliang Liu,et al.  [Safety and efficacy of carbon dioxide insufflation during colonoscopy]. , 2009, Zhong nan da xue xue bao. Yi xue ban = Journal of Central South University. Medical sciences.

[73]  M. Kim,et al.  [The effect of proton pump inhibitor on healing of post-esophageal variceal ligation ulcers]. , 2008, The Korean journal of gastroenterology = Taehan Sohwagi Hakhoe chi.

[74]  A R Jadad,et al.  What contributions do languages other than English make on the results of meta-analyses? , 2000, Journal of clinical epidemiology.

[75]  K. Resch,et al.  [Chronic prostatitis/chronic pelvic pain syndrome. Influence of osteopathic treatment - a randomized controlled study]. , 2009, Der Urologe (Ausg. A).

[76]  G. Carroli,et al.  [The effect of early and delayed umbilical cord clamping on ferritin levels in term infants at six months of life: a randomized, controlled trial]. , 2010, Archivos argentinos de pediatria.

[77]  J. Bernier Gastro-entérologie clinique et biologique , 1976 .

[78]  J. Webster,et al.  Randomised comparison of percutaneous angioplasty vs continued medical therapy for hypertensive patients with atheromatous renal artery stenosis , 1998, Journal of Human Hypertension.

[79]  Yi Liu,et al.  [Effects of Bushen Huoxue Granule on motor function in patients with Parkinson's disease: a multicenter, randomized, double-blind and placebo-controlled trial]. , 2010, Zhong xi yi jie he xue bao = Journal of Chinese integrative medicine.