Multilingual analysis of twitter news in support of mass emergency events
Social media are increasingly becoming a source for event-based early warning systems in the sense that they can help to detect natural disasters and support crisis management during or after disasters. In this article the authors study the problems of analyzing multilingual twitter feeds for emergency events. Specifically, they consider tsunami and earthquakes as one possible originating cause of tsunami. Twitter messages provide testified information and help to obtain a better picture of the actual situation. Generally, local civil protection authorities and the population are likely to respond in their native language. Therefore, the present work focuses on English as ""lingua franca"" and on under-resourced Mediterranean languages in endangered zones, particularly Turkey, Greece, and Romania. The authors investigated ten earthquake events and defined four language-specific classifiers that can be used to detect earthquakes by filtering out irrelevant messages that do not relate to the event. The final goal is to extend this work to more Mediterranean languages and to classify and extract relevant information from tweets, translating the main keywords into English. Preliminary results indicate that such a filter has the potential to confirm forecast parameters of tsunami affecting coastal areas where no tide gauges exist and could be integrated into seismographic sensor networks.