• English
  • Deutsch
  • Log In
    Password Login
    or
  • Research Outputs
  • Projects
  • Researchers
  • Institutes
  • Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Artikel
  4. Linguistic feature classifying and tracing
 
  • Details
  • Full
Options
2017
Journal Article
Titel

Linguistic feature classifying and tracing

Abstract
We investigate the identification and analysis of linguistic (lexico-grammatical) features used by articles of a specific year of publication. Linguistic features differ from shallow features because they represent authors' lexico-grammatical writing styles and do not consider the bag-of-words model. Current literature uses shallow features for analyzing articles rather than using linguistic features. In contrast to related work, a new methodology is provided that identifies linguistic features that occur non-randomly for a specific year and that traces the development of the features over time based on training data. The features are created by applying a semantic structure using latent semantic indexing (LSI). This is also in contrast to related work. A linguistic feature-based prediction model is built that enables an automated assignment of articles to their years of publication. Test data are used to evaluate the prediction model. In a case study, the proposed methodology is applied to articles of the Springer book series 'Communications in Computer and Information Science' published from 2009 to 2013. They are assigned to their years of publication by the prediction model. Case study results show that the performance of the model outperforms the baseline.
Author(s)
Moohebat, Mohammadreza
University of Malaya, Department of Artificial Intelligence, Faculty of Computer Science and Information Technology, Kuala Lumpur, Malaysia
Raj, Ram Gopal
University of Malaya, Department of Artificial Intelligence, Faculty of Computer Science and Information Technology, Kuala Lumpur, Malaysia
Thorleuchter, Dirk
Fraunhofer-Institut für Naturwissenschaftlich-Technische Trendanalysen INT
Kareem, Sameem Binti Abdul
University of Malaya, Department of Artificial Intelligence, Faculty of Computer Science and Information Technology, Kuala Lumpur, Malaysia
Zeitschrift
Malaysian journal of computer science
Thumbnail Image
DOI
10.22452/mjcs.vol30no2.1
Externer Link
Externer Link
Language
English
google-scholar
Fraunhofer-Institut für Naturwissenschaftlich-Technische Trendanalysen INT
Tags
  • linguistic

  • Latent Semantic Index...

  • text mining

  • classification

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Send Feedback
© 2022