Tze-Min, Fong and Bali, Ranaivo-Malançon (2015) Using TEI XML Schema to Encode the Structures of Sarawak Gazette. International Journal of Social Science and Humanity, 5 (10). ISSN 2010-3646
PDF
Ranaivo.pdf Download (736kB) |
Abstract
Automatic extraction of information from old printed documents which have been digitised injudiciously will end up with a lot human corrections. To overcome the problem, one possible solution is to annotate the documents with some markups. This paper presents the encoding of the digitised sample of Sarawak Gazette published from 1903 until 1939 using the standard TEI XML schema. The output of the work is a set of six TEI XML templates that is considered to represent the different layout structures found in the studied samples.
Item Type: | Article |
---|---|
Uncontrolled Keywords: | Data structure, layout analysis, metadata, TEI P5 schema, unimas, university, universiti, Borneo, Malaysia, Sarawak, Kuching, Samarahan, ipta, education, research, Universiti Malaysia Sarawak |
Subjects: | T Technology > T Technology (General) |
Divisions: | Academic Faculties, Institutes and Centres > Faculty of Computer Science and Information Technology Faculties, Institutes, Centres > Faculty of Computer Science and Information Technology Academic Faculties, Institutes and Centres > Faculty of Computer Science and Information Technology |
Depositing User: | Karen Kornalius |
Date Deposited: | 11 Aug 2016 18:45 |
Last Modified: | 22 Jun 2021 16:18 |
URI: | http://ir.unimas.my/id/eprint/12923 |
Actions (For repository members only: login required)
View Item |