Support Vector Machine for Error Analysis in Machine Assisted English Chinese Technical Translation: A Comparative Study with RF and BPNN
Abstract
With the rapid advancement of globalization, technical translation has become crucial for effective cross cultural communication and technology dissemination. Machine assisted translation (MAT) enhances translation efficiency and quality but often suffers from tra nslation errors that affect output accuracy. This study introduces a support vector machine (SVM) approach to systematically analyze errors in English Chinese technical translation and compares its performance with Random Forest (RF) and Back Propagatio n N eural Network (BPNN). Using 5,000 sentence pairs from domains including mechanical engineering, electronic technology, and computer science, we extract grammatical features via dependency parsing, lexical features using TF IDF, and semantic features thr oug h Word2Vec embeddings. The task is treated as a multi class classification problem, targeting lexical, grammatical, semantic, and spelling errors. Experimental results demonstrate that SVM outperforms RF and BPNN in both classification accuracy and gene ral ization ability. SVM achieves 87.6% accuracy, compared to 79.5% for BPNN and 73.2% for RF. The SVM also exhibits superior performance in 10 fold cross validation with lower mean square error (MSE) and higher R² scores. The radial basis function (RBF) ke rne l yielded optimal results among tested kernel functions. This research provides valuable insights for optimizing MAT systems and suggests that future enhancements may be achieved through deeper learning models and expanded datasets.DOI:
https://doi.org/10.31449/inf.v49i10.8319Downloads
Published
How to Cite
Issue
Section
License
I assign to Informatica, An International Journal of Computing and Informatics ("Journal") the copyright in the manuscript identified above and any additional material (figures, tables, illustrations, software or other information intended for publication) submitted as part of or as a supplement to the manuscript ("Paper") in all forms and media throughout the world, in all languages, for the full term of copyright, effective when and if the article is accepted for publication. This transfer includes the right to reproduce and/or to distribute the Paper to other journals or digital libraries in electronic and online forms and systems.
I understand that I retain the rights to use the pre-prints, off-prints, accepted manuscript and published journal Paper for personal use, scholarly purposes and internal institutional use.
In certain cases, I can ask for retaining the publishing rights of the Paper. The Journal can permit or deny the request for publishing rights, to which I fully agree.
I declare that the submitted Paper is original, has been written by the stated authors and has not been published elsewhere nor is currently being considered for publication by any other journal and will not be submitted for such review while under review by this Journal. The Paper contains no material that violates proprietary rights of any other person or entity. I have obtained written permission from copyright owners for any excerpts from copyrighted works that are included and have credited the sources in my article. I have informed the co-author(s) of the terms of this publishing agreement.
Copyright © Slovenian Society Informatika







