Análise de Técnicas de Aprendizado de Máquina para Classificar Notícias para Gerencimento de Informação no Mercado de Café (Analysis of Machine Learning Techniques to Classify News for Information Management in Coffee Market)

Paulo Oliveira Lima Júnior (plima@nepomuceno.cefetmg.br)1, Luiz Gonzada de Castro Júnior (gonzaga.ufla@gmail.com)2, André Luiz Zambalde (zamba@dcc.ufla.br)2


1Centro Federal de Educação Tecnológica de Minas Gerais
2Universidade Federal de Lavras

This paper appears in: Revista IEEE América Latina

Publication Date: July 2015
Volume: 13,   Issue: 7 
ISSN: 1548-0992


Abstract:
This paper presents an empirical study of machine learn techniques to text categorization. Specifically aim to classify news about coffee market according with categories from coffee supply chain. The objective is to measure the performance of three types of algorithms: Naïve Bayes based, Tree bases and Support Vector Machine (SVM). A database with news collected from web and labeled by human expert analysts is used in a learning phase. Then automatic classify news extracted from web following the same steps and terms as human according to their relevance for each learned category. The test in a real database shows a better performance by Naïve Bayes based Algorithms for this specific case.

Index Terms:
Information Management, Text Categorization, Machine Learning   


Documents that cite this document
This function is not implemented yet.


[PDF Full-Text (441)]