ABSTRACT
This research applies Latent Dirichlet Allocation on United States Automated Manifest System Bill of Lading data. We define a "bag of word" where each Harmonized tariff code represents a document, each shipper name be a token and count of shipments to be element of matrix. The result shows that topic model is able to classify some shippers of the same industries.
Recommendations
Joint sentiment/topic model for sentiment analysis
CIKM '09: Proceedings of the 18th ACM conference on Information and knowledge managementSentiment analysis or opinion mining aims to use automated tools to detect subjective information such as opinions, attitudes, and feelings expressed in text. This paper proposes a novel probabilistic modeling framework based on Latent Dirichlet ...
Topic modelling for qualitative studies
Qualitative studies, such as sociological research, opinion analysis and media studies, can benefit greatly from automated topic mining provided by topic models such as latent Dirichlet allocation LDA. However, examples of qualitative studies that ...
Topic models with power-law using Pitman-Yor process
KDD '10: Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data miningOne important approach for knowledge discovery and data mining is to estimate unobserved variables because latent variables can indicate hidden specific properties of observed data. The latent factor model assumes that each item in a record has a latent ...
Comments