Session chair: Mega Subramaniam
Document Representation and Clustering Models for Bilingual Documents Clustering
Shutian Ma & Chengzhi Zhang
Currently, the Internet has created many documents in languages other than English. People face challenges when seeking and using information, for example, non-native English-speaking students tend to have problems when utilizing libraries in North American universities. To help people efficiently organize information, bilingual documents clustering has advantages for practical utilization, it can divide documents into groups with the same topic and there is no need for a training dataset. Document representation and clustering models are two important parts in clustering. This paper compares four popular representation methods, Vector Space Model (VSM), Latent Semantic Indexing (LSI), Latent Dirichlet Allocation (LDA) and Doc2Vec (D2V), together with four different types of clustering algorithms, K-means++, BIRCH, DBSCAN and Affinity Propagation (AP) to identify appropriate combinations for bilingual documents clustering. Parallel corpus and comparable corpus are all used for the bilingual datasets. Experimental results show that, clustering performance varies when combining different representation methods with clustering algorithms. It’s important to make good choice of models for better documents organization.
Electronic commerce and consumption: semiotic scenario and information architecture in online stores
Maria Aparecida Moura & Cássia Adriana Gomes
This work discusses changes in the consumption patterns with the spread of electronic commerce, which started to demand new informational competences in commercial relations. We highlight the concepts of semiotic scenario and information architecture, analyzing the implications of such concepts in the structure of Websites dedicated to electronic commerce. In addition, we describe a system for informational organization of the Brazilian e-stores, analyzing the interactions between the e-customer with the Website’s information architecture, along with the subjacent semiotic scenario to the business model.
Web Search Selection Criteria of Foreign-Language Searchers
Victor K. Ondego & Anita Komlodi
Searchers today are often faced with searching in a language that is not their native language. Searching in a foreign language will necessarily differ from searching in a native language. In this paper we examine one step of the foreign-language searching process: selection decisions about web search results. We report the results of an analysis of participant statements about why a certain web result was selected by 15 Hungarian and 15 Chinese native language speakers who searched in English. Our results confirm criteria identified by prior research that indicates that information seekers pay attention to topic, source, past experience and format to a slightly lesser degree.
Information Seeking Among Latinos in the Midwestern United States
Denice Adkins & Heather Moulaison Sandy
This paper is the first to present a study of information behavior of Latinos in the Midwestern U.S. It begins addressing gaps in knowledge about Latinos, particularly immigrants, and their information seeking and use of the mobile, social web in rural areas of the U.S. Midwest. Interviews were conducted in 2016 and 2017 with 20 Mid-western Latinos — 15 immigrants to the United States and five native-born. We find that interviewees use many well-known social media platforms such as Facebook and YouTube as well as online resources to meet their information needs. Many prefer to consult trusted sources, making social media access desirable. Mid-western Latinos have adopted a number of the same information seeking practices as other immigrants and marginalized peoples, which may be different from practices of dominant populations in rural areas. Given their preferences, Latinos we interviewed will be best served by access to trusted information in the languages they speak, often presented orally such as in video format. For this, a reliable internet connection and internet-enabled device are key.
The Scholarly Discourse and Topical Interconnections within the Realm of Information Behavior Research
Rong Tang & Chaoqun Ni
In response to the lack of clarity concerning both the conceptual boundaries of information behavior (IB) and the interrelationship between IB and multiple related topics, this study attempts to establish an intellectual profile of IB scholarship by examining publications of IB and 13 other related topics. Results show a similar growth pattern of IB publications and authors over past 45 years. Multi-Dimensional Scaling analysis based on top frequent terms of the 14 topics revealed a configuration of three clusters and one individual entity (information ecology). IB is found to cluster with topics of seek, need, use, practice, and experience. As the very first study investigating the underlying conceptual connection between IB and 13 other related topics, results of the study help to outline the intellectual affinity of various concepts used in IB related research, add further insights into identifying points of convergence and departure as well as scholarship configuration within the realm of information research.