Automated Language Identification of Bibliographic Resources

作者	Victoria Morris
出版日期	2019
內容	This article describes experiments in the use of machine learning techniques at the British Library to assign language codes to catalog records, in order to provide information about the language of content of the resources described. In the first phase of the project, language codes were assigned to 1.15 million records with 99.7% confidence. The automated language identification tools developed will be used to contribute to future enhancement of over 4 million legacy records.
刊名	Cataloging & Classification Quarterly
關鍵字	Language identification, machine learning, automatic metadata generation, metadata, legacy record enhancement
網址連結	Automated Language Identification of Bibliographic Resources

發布日期：2020年01月02日　最後更新：2020年01月02日

您在這裡