Thai word segmentation on social networks with time sensitivity

Social network service like Twitter is one of the important social networks that has had a huge impact on Thai culture.It has changed the behavior of many Thai people from using televisions to using computers or smart phones regularly.Thai people also share their experiences and get information suc...

Full description

Saved in:
Bibliographic Details
Main Authors: Ronran, Chirawan, Unankard, Sayan, Nadee, Wanvimol, Khomwichai, Nongkran, Sirirangsi, Rangsit
Format: Conference or Workshop Item
Language:English
Published: 2016
Subjects:
Online Access:http://repo.uum.edu.my/20123/1/KMICe2016%20362%20367.pdf
http://repo.uum.edu.my/20123/
http://www.kmice.cms.net.my/kmice2016/files/KMICe2016_eproceeding.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Social network service like Twitter is one of the important social networks that has had a huge impact on Thai culture.It has changed the behavior of many Thai people from using televisions to using computers or smart phones regularly.Thai people also share their experiences and get information such as news on social networks. With the increasing number of micro-blog messages that are originated and discussed over social networks, Thai word segmentation is becoming a compelling research issue as it is an important task in natural language processing. However, the existing Thai segmentation approaches are not designed to deal with short and noisy messages like Twitter. In this paper, we proposed Thai word segmentation on social networks approach by exploit both the local context (in tweets) and the global context from Thai Wikipedia.We evaluate our approach based on a real-world Twitter dataset. Our experiments show that the proposed approach can effectively segment Twitter messages over the baseline.