'분류 전체보기' 카테고리의 글 목록 (57 Page)

분류 전체보기 337

normalization, WordNetLemmatizer, PorterStemmer, LancasterStemmer, Storword

normalization Integrate different words to make them the same word-such as US is same as USA integrate them as US. 1. WordNetLemmatizer If words have different forms, find the root word-such as the root of 'am, are, is' is 'be'. from nltk.stem import WordNetLemmatizer lemmatizer=WordNetLemmatizer() words=[ 'have', 'going', 'loves', 'lives', 'flies', 'dies', 'watched', 'has', 'starting'] print('b..

Deep Learning 2021.03.05

gensim, Scikit-learn, NLTK, TreebankWordTokenizer, WordPunctTokenizer, sent_tokenize, pos_tag, word_tokenize, NLP, text_to_word_sequence, Corpus

Corpus Natural Language Data NLP - Natural Language Processing gensim - It is an open source library for unsupervised topic modeling and natural language processing, using modern statistical machine learning. Scikit-learn - SciPy Toolkit. It features various classification, regression and clustering algorithms including support vector machines. NLTK - The Natural Language ToolKit, is a suite of ..

Deep Learning/Tensorflow 2021.03.05

pandas-1. Series, reindex, isnull, notnull, fillna, drop, dropna, randn, describe, nan, value_counts, map, apply, concat

pandas It is a software library written for the Python programming language for data manipulation and analysis. In particular, it offers data structures and operations for manipulating numerical tables and time series. 1. Series One-dimensional array with values and index can be granted to each values. import pandas as pd sr=pd.Series([1000,2000,3000,4000],index=['aaa','bbb','ccc','ddd']) sr >>>..

Analyze Data/Python Libraries 2021.03.05

cosine similarity - A measurement that quantifies the similarity between two or more vectors. - It is the cosine of the angle between vectors. - The cosine similarity is described mathematically as the division between the dot product of vectors and the product of the euclidean norms or magnitude of each vector. - Reference : towardsdatascience.com/understanding-cosine-similarity-and-its-applica..

Analyze Data/Measure of similarity 2021.03.03

Euclidean distance - It is the length of a line segment between the two points. - The distance between two objects that are not points is usually defined to be the smallest distance among pairs of points from the two objects. - Smaller, Closer. In three dimensions, for points given by their Cartesian coordinates, the distance is Reference : en.wikipedia.org/wiki/Euclidean_distance def distance(x..

Analyze Data/Measure of similarity 2021.03.03

1 ··· 54 55 56 57 58 59 60 ··· 68

randn, kafka, Sigmoid function, classmethod, Step Function, batch size, d3js, Regular Expression, yield from, docker-compose, selectall, forward propagation, abstractmethod, axis, nvidia-smi, zeros, Filter, textdistance, global variable, cross-entropy,

일	월	화	수	목	금	토
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30

내 블로그 - 관리자 홈 전환	`Q` `Q`
새 글 쓰기	`W` `W`

글 수정 (권한 있는 경우)	`E` `E`
댓글 영역으로 이동	`C` `C`

이 페이지의 URL 복사	`S` `S`
맨 위로 이동	`T` `T`
티스토리 홈 이동	`H` `H`
단축키 안내	`Shift` + `/` `⇧` + `/`

¡Hola, Mundo!

분류 전체보기 337

티스토리툴바

개인정보

단축키

내 블로그

블로그 게시글

모든 영역