๋ณธ๋ฌธ ๋ฐ”๋กœ๊ฐ€๊ธฐ

study๐Ÿ“š31

pytube ์˜ค๋ฅ˜ pytube ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ ์„ค์น˜ ํ›„ ์ฝ”๋“œ ์‹คํ–‰ํ•˜๋‹ˆ ์•„๋ž˜์™€ ๊ฐ™์€ ์—๋Ÿฌ 2๊ฐ€์ง€ ๋ฐœ์ƒ cipher.py ์˜ค๋ฅ˜ cipher.py ์ฝ”๋“œ ์ˆ˜์ • ๊ฒฝ๋กœ ์œˆ๋„์šฐ : C:/ProgramData\Anaconda3\Lib\site-packages/pytube ๋งฅ : spotlight ๊ฒ€์ƒ‰ or ํด๋” Users/[Username]/opt/anaconda3/lib/python3.9/site-packages/pytube https://github.com/pytube/pytube/issues/1281 ์ฐธ๊ณ  # ์˜ค๋ฆฌ์ง€๋‚  ์ฝ”๋“œ nfunc=function_match.group(1)) # ๋ฐ”๊พผ ์ฝ”๋“œ nfunc=re.escape(function_match.group(1))) # ์˜ค๋ฆฌ์ง€๋‚  ์ฝ”๋“œ nfunc=function_match.group(1).. 2022. 10. 30.
ROUGE score ROUGE score ROUGE Recall-Oriented Understudy for Gisting Evaluation ํ…์ŠคํŠธ ์š”์•ฝ ๋ชจ๋ธ์˜ ์„ฑ๋Šฅ ํ‰๊ฐ€ ์ง€ํ‘œ. ํ…์ŠคํŠธ ์ž๋™ ์š”์•ฝ, ๊ธฐ๊ณ„ ๋ฒˆ์—ญ ๋“ฑ ์ž์—ฐ์–ด ์ƒ์„ฑ ๋ชจ๋ธ์˜ ์„ฑ๋Šฅ์„ ํ‰๊ฐ€ํ•˜๊ธฐ ์œ„ํ•œ ์ง€ํ‘œ์ด๋ฉฐ, ๋ชจ๋ธ์ด ์ƒ์„ฑํ•œ ์š”์•ฝ๋ณธ ํ˜น์€ ๋ฒˆ์—ญ๋ณธ์„ ์‚ฌ๋žŒ์ด ๋ฏธ๋ฆฌ ๋งŒ๋“ค์–ด ๋†“์€ ์ฐธ์กฐ๋ณธ๊ณผ ๋Œ€์กฐํ•ด ์„ฑ๋Šฅ ์ ์ˆ˜๋ฅผ ๊ณ„์‚ฐ if ์‹œ์Šคํ…œ ์š”์•ฝ(๋ชจ๋ธ ์ƒ์„ฑ ์š”์•ฝ) : the cat was found under the bed ์ฐธ์กฐ์š”์•ฝ(Gold standard, ๋Œ€๊ฒŒ ์‚ฌ๋žŒ์ด ์ง์ ‘ ๋งŒ๋“  ์š”์•ฝ) : the cat was under the bed ๋ชจ๋ธ์ด ์ƒ์„ฑํ•œ ์‹œ์Šคํ…œ ์š”์•ฝ๊ณผ ์‚ฌ๋žŒ์ด ๋งŒ๋“ค์–ด ๋†“์€ ์ฐธ์กฐ ์š”์•ฝ ๊ฐ„ ๊ฒน์น˜๋Š” ๋‹จ์–ด ์ด 6๊ฐœ ํ•˜์ง€๋งŒ ์ด 6์ด๋ผ๋Š” ์ˆซ์ž๋Š” ์„ฑ๋Šฅ ์ง€ํ‘œ(Metric)๋กœ ๋ฐ”๋กœ ์‚ฌ์šฉํ•˜๊ธฐ์— ์ ํ•ฉํ•˜.. 2022. 10. 30.
[ํŒŒ์ด์ฌ/๋จธ์‹ ๋Ÿฌ๋‹] ๋ณดํ—˜๋ฃŒ ์˜ˆ์ธกํ•˜๊ธฐ - ์ „์ฒ˜๋ฆฌ ๋ณดํ—˜๋ฃŒ ์˜ˆ์ธกํ•˜๊ธฐ - ์ „์ฒ˜๋ฆฌ https://www.kaggle.com/datasets/mirichoi0218/insurance # ํ•„์š”ํ•œ ํŒŒ์ด์ฌ ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ import pandas as pd import numpy as np import seaborn as sns import missingno from sklearn.model_selection import train_test_split, GridSearchCV from sklearn.impute import SimpleImputer import matplotlib.pyplot as plt from sklearn.preprocessing import MinMaxScaler from sklearn.preprocessing import StandardScale.. 2022. 10. 21.
[๋จธ์‹ ๋Ÿฌ๋‹] ์‚ฌ์ดํ‚ท๋Ÿฐ(scikit-learn) - ๋ฐ์ดํ„ฐ_์ „์ฒ˜๋ฆฌ(๋ฐ์ดํ„ฐ ์ธ์ฝ”๋”ฉ, ํ”ผ์ฒ˜์Šค์ผ€์ผ๋ง๊ณผ ์ •๊ทœํ™”) ๋ฐ์ดํ„ฐ_์ „์ฒ˜๋ฆฌ(๋ฐ์ดํ„ฐ ์ธ์ฝ”๋”ฉ, ํ”ผ์ฒ˜์Šค์ผ€์ผ๋ง๊ณผ ์ •๊ทœํ™”) ๋ฐ์ดํ„ฐ ์ „์ฒ˜๋ฆฌ(Data Preprocessing) ๋ฐ์ดํ„ฐ ํด๋ฆฐ์ง• ๊ฒฐ์†๊ฐ’ ์ฒ˜๋ฆฌ(Null/NaN ์ฒ˜๋ฆฌ) ๋ฐ์ดํ„ฐ ์ธ์ฝ”๋”ฉ(๋ ˆ์ด๋ธ”, ์›-ํ•ซ ์ธ์ฝ”๋”ฉ) ๋ฐ์ดํ„ฐ ์Šค์ผ€์ผ๋ง ์ด์ƒ์น˜ ์ œ๊ฑฐ Feature ์„ ํƒ, ์ถ”์ถœ ๋ฐ ๊ฐ€๊ณต ๋ฐ์ดํ„ฐ ์ „์ฒ˜๋ฆฌ๋Š” ML ์•Œ๊ณ ๋ฆฌ์ฆ˜๋งŒํผ ์ค‘์š”ํ•˜๋‹ค ๋จผ์ €, ๊ฒฐ์† ๊ฐ’, ์ฆ‰ Null,NaN ๊ฐ’์€ ํ—ˆ์šฉ๋˜์ง€ ์•Š๋Š”๋‹ค. ์ด๋Ÿฌํ•œ Null๊ฐ’์€ ๊ณ ์ •๋œ ๋‹ค๋ฅธ ๊ฐ’์œผ๋กœ ๋ณ€ํ™˜ํ•ด ์ฃผ์–ด์•ผ ํ•œ๋‹ค. ์ฒ˜๋ฆฌ ๋ฐฉ๋ฒ•์€ ๋‹ค์–‘ํ•œ๋ฐ, ํ”ผ์ฒ˜ ๊ฐ’ ์ค‘ Null๊ฐ’์ด ์–ผ๋งˆ ๋˜์ง€ ์•Š์„ ๊ฒฝ์šฐ์—๋Š” ํ”ผ์ฒ˜์˜ ํ‰๊ท ๊ฐ’์œผ๋กœ ๋Œ€์ฒดํ•  ์ˆ˜ ์žˆ๊ณ , Null ๊ฐ’์ด ๋Œ€๋ถ€๋ถ„์ด๋ผ๋ฉด ์˜คํžˆ๋ ค ํ•ด๋‹น ํ”ผ์ฒ˜๋Š” ๋“œ๋กญํ•˜๋Š” ๊ฒƒ์ด ์ข‹๋‹ค. ํ•˜์ง€๋งŒ Null๊ฐ’์ด ์ผ์ • ์ˆ˜์ค€ ์ด์ƒ์ผ ๊ฒฝ์šฐ ๊ฐ€์žฅ ๊ฒฐ์ •์ด ํž˜๋“ ๋ฐ, ํ•ด๋‹น ํ”ผ์ฒ˜๊ฐ€ ์ค‘์š”๋„๊ฐ€ ๋†’์€ ํ”ผ์ฒ˜์ด๊ณ  Null์„ ๋‹จ.. 2022. 9. 4.