๋ณธ๋ฌธ ๋ฐ”๋กœ๊ฐ€๊ธฐ

์ „์ฒด ๊ธ€36

[python/ํŒŒ์ด์ฌ] ๊ทธ๋ฃนํ™” - groupby(), agg(), set_index(), reset_index() ๊ทธ๋ฃนํ™” import pandas as pd df1 = pd.DataFrame({'ID' : [1, 2, 3, 4, 5], '๊ฐ€์ž…์ผ' : ['2021-01-02', '2021-01-04', '2021-01-10', '2021-02-10', '2021-02-24'], '์„ฑ๋ณ„' : ['F', 'M', 'F', 'M', 'M']}) df2 = pd.DataFrame({'๊ตฌ๋งค์ˆœ์„œ' : [1, 2, 3, 4, 5], 'ID' : [1, 1, 2, 4, 1], '๊ตฌ๋งค์›”' : [1, 1, 2, 2, 3], '๊ธˆ์•ก' : [1000, 1500, 2000, 3000, 4000]}) -๋ฌธ์ œ : df1 ์€ ํšŒ์›์˜ ์ •๋ณด๋ฅผ ์ €์žฅํ•œ ๋ฐ์ดํ„ฐ ํ”„๋ ˆ์ž„์ด๋ฉฐ, df2 ๋Š” ๊ฐ ํšŒ์›์˜ ๊ตฌ๋งค ๋‚ด์—ญ์„ ์ €์žฅํ•œ ๋ฐ์ดํ„ฐ ํ”„๋ ˆ์ž„์ด๋‹ค. ๊ฐ ํšŒ์›์˜ ์ •๋ณด์™€ ๊ตฌ.. 2022. 7. 29.
[python/ํŒŒ์ด์ฌ] ๋ฐ์ดํ„ฐ ํ”„๋ ˆ์ž„ ๊ฒฐํ•ฉ - ์ƒํ•˜ ๊ฒฐํ•ฉ, ์ขŒ์šฐ ๊ฒฐํ•ฉ ๋ฐ์ดํ„ฐ ํ”„๋ ˆ์ž„ ๊ฒฐํ•ฉ 1. ์ƒํ•˜ ๊ฒฐํ•ฉ pd.concat([df1, df2], axis = 0) : ๋™์ผํ•œ column ๊ธฐ์ค€์œผ๋กœ ์œ„/์•„๋ž˜๋กœ ํ•ฉ์น˜๊ธฐ, ํ–‰ ๊ธฐ์ค€์œผ๋กœ ๋ฐ์ดํ„ฐ ํ”„๋ ˆ์ž„ ๊ฒฐํ•ฉ import pandas as pd df1 = pd.DataFrame({'A' : [1, 2, 3], 'B' : [11, 12, 13], 'C' : [21, 22, 23]}) df2 = pd.DataFrame({'A' : [4, 5, 6], 'B' : [14, 15, 16], 'C' : [24, 25, 26]}) pd.concat([df1, df2]) pd.concat([df2, df1]) #index ์ดˆ๊ธฐํ™”๋ฅผ ์œ„ํ•ด์„œ ignore_index = True pd.concat([df1, df2], ignore_index = True #.. 2022. 7. 28.
[python/ํŒŒ์ด์ฌ] ๋ฐ์ดํ„ฐ ์ „์ฒ˜๋ฆฌ - apply, map ์„ ํ™œ์šฉํ•œ ๋ฐ์ดํ„ฐ ๋ณ€ํ™˜ apply, map ์„ ํ™œ์šฉํ•œ ๋ฐ์ดํ„ฐ ๋ณ€ํ™˜ import pandas as pd df = pd.DataFrame({'a' : [1, 2, 3, 4, 5]}) -๋ฌธ์ œ : a ๊ฐ€ 2 ๋ณด๋‹ค ์ž‘์œผ๋ฉด '2 ๋ฏธ๋งŒ', 4 ๋ณด๋‹ค ์ž‘์œผ๋ฉด '4 ๋ฏธ๋งŒ', 4 ๋ณด๋‹ค ํฌ๋ฉด '4 ์ด์ƒ' ์ด ์ €์žฅ๋œ b ์นผ๋Ÿผ์„ ์ถ”๊ฐ€ํ•˜๊ธฐ df df['b'] = 0 df a = df[df['a] = 2) & (df['a'] = 4 df['.. 2022. 7. 27.
[python/ํŒŒ์ด์ฌ] ๋ฐ์ดํ„ฐ ์ „์ฒ˜๋ฆฌ - ๋ ˆ์ฝ”๋“œ, ์นผ๋Ÿผ ์ถ”๊ฐ€ / ์‚ญ์ œ ๋ ˆ์ฝ”๋“œ, ์นผ๋Ÿผ ์ถ”๊ฐ€ / ์‚ญ์ œ import pandas as pd df = pd.DataFrame({'a' : [1, 1, 3, 4, 5], 'b' : [2, 3, 2, 3, 4], 'c' : [3, 4, 7, 6, 4]}) df 1. ์นผ๋Ÿผ(column) ์ถ”๊ฐ€ [] ์—ฐ์‚ฐ์ž๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ๊ธฐ์กด DataFrame์— ์ƒˆ ์—ด ์ถ”๊ฐ€ ํ•  ์ˆ˜ ์žˆ๋‹ค -๋ฌธ์ œ : 1, 3, 6, 4. 8 ๋กœ ์ด๋ฃจ์–ด์ง„ d ์นผ๋Ÿผ์„ ์ถ”๊ฐ€ํ•˜๊ธฐ df['d'] = [1, 3, 6, 4, 8] df -๋ฌธ์ œ : 1๋กœ ์ด๋ฃจ์–ด์ง„ e ์นผ๋Ÿผ ์ถ”๊ฐ€ํ•˜๊ธฐ df['e'] = [1, 1, 1, 1, 1] df ์ด๋ ‡๊ฒŒ ํ•˜๋Š” ๋ฐฉ๋ฒ•๋„ ์žˆ์ง€๋งŒ ์กฐ๊ธˆ ๋” ์‰ฝ๊ฒŒ ํ•˜๋Š” ๋ฐฉ๋ฒ•์ด ์žˆ๋‹ค. df['e'] = 1 df df.dtypes a int64 b int64 c int64 d int64 e.. 2022. 7. 26.