i will be speaking at the Big Data Bootcamp on Tuesday, May 22. The topic of my presentation will be "Dominate Your Big Text"
As Big Data marches ahead, more and more of that information is unstructured, from tweets to PDFs, and the percentage of unstructured information stored in NoSQL engines is rising fast. This session explores the options for synthesizing structure in big document sets. How do I impose order on my text? What tools can I use to find my text? How do I leverage corporate knowledge and structure to make my text easier to find? The ubiquity of full- text search makes finding this unstructured information possible. But what is the next step? How do you make it even easier to find your unstructured information? This session also focuses on taxonomies, auto-tagging, and faceted navigation of search results.