Foundations and Trends® in Finance > Vol 8 > Issue 3

Text and Context: Language Analytics in Finance

By Sanjiv Ranjan Das, Santa Clara University, Leavey School of Business, USA, srdas@scu.edu

 
Suggested Citation
Sanjiv Ranjan Das (2014), "Text and Context: Language Analytics in Finance", Foundations and Trends® in Finance: Vol. 8: No. 3, pp 145-261. http://dx.doi.org/10.1561/0500000045

Publication Date: 20 Nov 2014
© 2014 S. R. Das
 
Subjects
Financial markets,  Event studies/market efficiency studies,  Computational problems,  Financial econometrics
 
Keywords
G00 Finance
Text analyticsText miningFinancial analysis
 

Free Preview:

Download extract

Share

Download article
In this article:
1. What is Text Mining? 
2. Text Extraction 
3. Basic Text Analytics 
4. Text Classification 
5. Metrics 
6. Applications and Empirics 
7. Text Analytics – The Future 
Acknowledgements 
A. Sample text from Bloomberg for summarization 
References 

Abstract

This monograph surveys the technology and empirics of text analytics in finance. I present various tools of information extraction and basic text analytics. I survey a range of techniques of classification and predictive analytics, and metrics used to assess the performance of text analytics algorithms. I then review the literature on text mining and predictive analytics in finance, and its connection to networks, covering a wide range of text sources such as blogs, news, web posts, corporate filings, etc. I end with textual content presenting forecasts and predictions about future directions.

DOI:10.1561/0500000045
ISBN: 978-1-60198-910-9
133 pp. $90.00
Buy book (pb)
 
ISBN: 978-1-60198-911-6
133 pp. $120.00
Buy E-book (.pdf)
Table of contents:
1. What is Text Mining?
2. Text Extraction
3. Basic Text Analytics
4. Text Classification
5. Metrics
6. Applications and Empirics
7. Text Analytics – The Future
Acknowledgements
A. Sample text from Bloomberg for summarization
References

Text and Context

Text and Context: Language Analytics in Finance describes the current landscape of text analytics in finance. After a brief introduction, Section 2 examines how text is extracted from various web sites and services. Section 3 deals with the basics of text analytics such as dictionaries, lexicons, mood scoring, and summarization of text. This is followed by the analytics of text classification in Section 4. The performance of text analytic algorithms is assessed using a range of metrics in Section 5. A survey of the empirical literature on text mining in finance and the commercialization of textual analytics is discussed in Section 6. Finally, the author takes a look at the future of text analytics in Section 7.

 
FIN-045