Article Details
Retrieved on: 2017-12-26 17:11:15
Tags for this article:
Click the tags to see associated articles and topics
Excerpt
<div>The SDK also handles unstructured text <b>data</b>, and provides stemming, term normalization, vocabulary reduction, creation of a term-document matrix, and concept extraction with latent semantic indexing. It even has built-in facilities to draw a statistically representative sample from an Apache Spark <b>Big</b> ...</div>
Article found on:
This article is found inside other hiswai user's workspaces. To start your own collection, sign up for free.
Sign UpAlready have an account? Log in here