2017年8月20日 星期日

DEAN 690_Week 10

Assignment: Week 10

Share
Book: Text Mining with R
Dataset
This dataset exists factors: Race/Ethnicity, Age, Education, Income, and Gender. This website also uses SAS and provides several tables that include Adults with Diabetes Currently Taking Insulin, How Often Check Blood Sugar, How Often Check Feet, and different aspect of data.
Topic: Virginia Diabetes and Prediabetes dataset from 2011 to 2014
Disease description: Diabetes is a chronic disease in which sugar levels in the bloodstream are above normal. It occurs when a person cannot produce (type 1) or properly use (type 2) insulin. Insulin is a hormone that moves sugar out of the bloodstream into the cells to be used for energy. Prediabetes is a condition in which blood sugar is high, but not high enough to be type 2 diabetes.
Questions
23. What is latent semantic indexing? What is it used for? What are the specific limitations of the method?
Latent semantic indexing is one technique in natural language and mathematic method to analyze the relationship between terms and concepts.
News
This news release theWordStat for Stat for Mac version. It also introduce the software and functions.
Topic: WordStat for Stata Now Available for Use on Mac Computers


Project Idea
Purpose:
Data visualization (Create more graphs)
Semantic Analysis
Learning:
Continue learning tutorial: Machine Learning in Natural Language Processing using R

Class 4: https://ufal.mff.cuni.cz/~hladka/2013/docs/day-4.posted.pdf

沒有留言:

張貼留言

Python program to display calendar

# Python program to display calendar of given month of the year # importing calendar module for calendar operations import calendar # set t...