Show simple item record

KHATT: Handwritten Arabic Text

ContributorParvez, Mohammad Tanvir
ContributorMärgner, Volker
ContributorAhmad, Irfan
ContributorAl-Khatib, Wasfi G.
ContributorMahmoud, Sabri A.
ContributorAlshayeb, Mohammad
ContributorFink, Gernot A.
Date2015
dc.date.accessioned2021-07-25T18:12:49Z
dc.date.available2021-07-25T18:12:49Z
IdentifierLDC2015T23
Identifierhttps://catalog.ldc.upenn.edu/LDC2015T23
IdentifierISBN:%201-58563-736-X
IdentifierISLRN:%20866-063-772-506-2
IdentifierDOI:%2010.35111/vc52-tm53
dc.identifier.urihttps://linghub.org/handle/123456789/1090767
Description*Introduction* KHATT: Handwritten Arabic Text was developed by King Fahd University of Petroleum & Minerals, Technical University of Dortmund and Braunschweig University of Technology. It is comprised of scanned Arabic handwriting from 1,000 distinct male and female writers representing diverse countries, age groups, handedness and education levels. Participants produced text on a topic of their choice in an unrestricted style. KHATT was designed to promote research in areas such as text recognition and writer identification. *Data* The majority of participants were natives of Saudi Arabia; the next largest group was from a collection of regional countries (Egypt, Jordan, Kuwait, Morocco, Palestine, Tunisia and Yemen). Most writers were between 16-25 years of age with high school or university qualifications. Scanned text is presented as tiff images scanned at 200, 300 and 600 DPI (dots per inch). The source images are four-page tiffs consisting of metadata about the writer, fixed paragraphs and free writing. Image files of isolated paragraphs or lines are also included. Ground-truth files are presented as plain-text Unicode. Data is divided into training, validation and test sets. *Samples* Please view this image sample and this text sample. *Updates* None at this time.
LanguageArabic
dc.language.isoara
PublisherLinguistic Data Consortium
Publisherhttps://www.ldc.upenn.edu
Relationhttps://catalog.ldc.upenn.edu/docs/LDC2015T23
TitleKHATT: Handwritten Arabic Text
TypeText
TypeStillImage
dcterms.accessRightsLicensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
dcterms.bibliographicCitationMahmoud, Sabri A., et al. KHATT: Handwritten Arabic Text LDC2015T23. Web Download. Philadelphia: Linguistic Data Consortium, 2015
dcterms.contributorParvez, Mohammad Tanvir
dcterms.contributorMärgner, Volker
dcterms.contributorAhmad, Irfan
dcterms.contributorAl-Khatib, Wasfi G.
dcterms.contributorMahmoud, Sabri A.
dcterms.contributorAlshayeb, Mohammad
dcterms.contributorFink, Gernot A.
dcterms.date2015
dcterms.description*Introduction* KHATT: Handwritten Arabic Text was developed by King Fahd University of Petroleum & Minerals, Technical University of Dortmund and Braunschweig University of Technology. It is comprised of scanned Arabic handwriting from 1,000 distinct male and female writers representing diverse countries, age groups, handedness and education levels. Participants produced text on a topic of their choice in an unrestricted style. KHATT was designed to promote research in areas such as text recognition and writer identification. *Data* The majority of participants were natives of Saudi Arabia; the next largest group was from a collection of regional countries (Egypt, Jordan, Kuwait, Morocco, Palestine, Tunisia and Yemen). Most writers were between 16-25 years of age with high school or university qualifications. Scanned text is presented as tiff images scanned at 200, 300 and 600 DPI (dots per inch). The source images are four-page tiffs consisting of metadata about the writer, fixed paragraphs and free writing. Image files of isolated paragraphs or lines are also included. Ground-truth files are presented as plain-text Unicode. Data is divided into training, validation and test sets. *Samples* Please view this image sample and this text sample. *Updates* None at this time.
dcterms.extentCorpus size: 28956648 KB
dcterms.identifierLDC2015T23
dcterms.identifierhttps://catalog.ldc.upenn.edu/LDC2015T23
dcterms.identifierISBN:%201-58563-736-X
dcterms.identifierISLRN:%20866-063-772-506-2
dcterms.identifierDOI:%2010.35111/vc52-tm53
dcterms.issued2015-11-16
dcterms.languageArabic
dcterms.licenseLDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf
dcterms.mediumDistribution: Web Download
dcterms.publisherLinguistic Data Consortium
dcterms.publisherhttps://www.ldc.upenn.edu
dcterms.relationhttps://catalog.ldc.upenn.edu/docs/LDC2015T23
dcterms.rightsHolderPortions © 2015 King Fahd University of Petroleum & Minerals, Trustees of the University of Pennsylvania
dcterms.titleKHATT: Handwritten Arabic Text
dcterms.typeText
dcterms.typeStillImage


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

  • OLAC
    Main data from the OLAC dataset

Show simple item record


Copyright  © 2020 All Rights Reserved by Prêt-à-LLOD Project.

Horizon 2020

This project has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No 825182.