๐Ÿ“Š Data Analysis/๐ŸŽฏ Project

๋ฏธ๋‹ˆ ํ”„๋กœ์ ํŠธ : ์‚ฌํšŒ ์ดˆ๋…„์ƒ์˜ ์ „์…‹์ง‘ ๊ตฌํ•˜๊ธฐ (3)

ny:D 2024. 5. 4. 22:49

 

๋ฏธ๋‹ˆ ํ”„๋กœ์ ํŠธ : ์‚ฌํšŒ ์ดˆ๋…„์ƒ์˜ ์ „์…‹์ง‘ ๊ตฌํ•˜๊ธฐ (2)

๋ฏธ๋‹ˆ ํ”„๋กœ์ ํŠธ : ์‚ฌํšŒ ์ดˆ๋…„์ƒ์˜ ์ „์…‹์ง‘ ๊ตฌํ•˜๊ธฐ (1)๋ฏธ๋‹ˆ ํ”„๋กœ์ ํŠธ : ์‚ฌํšŒ ์ดˆ๋…„์ƒ์˜ ์ „์…‹์ง‘ ๊ตฌํ•˜๊ธฐ๋ถ„์„ ๋ชฉ์  : ์„œ์šธ์— ์ƒ๊ฒฝํ•œ ์‚ฌํšŒ ์ดˆ๋…„์ƒ์ด ์‚ด ์ˆ˜ ์žˆ๋Š” ๊ฐ€์žฅ ํ•ฉ๋ฆฌ์ ์ธ ์ „์…‹์ง‘์„ ์ฐพ๋Š” ๊ฒƒ.๋ฐ์ดํ„ฐ ์ˆ˜

archivenyc.tistory.com

 

๊ฐ€์„ค ์ˆ˜๋ฆฝ ๋‹จ๊ณ„์—์„œ ํ™œ์šฉํ•˜๊ธฐ ์ข‹์€ ์ž๋ฃŒ

→ ์„œ์šธ์—ฐ๊ตฌ๋ฐ์ดํ„ฐ ์„œ๋น„์Šค์— ๋”ฐ๋ฅด๋ฉด, ๊ฐ•๋‚จ๊ตฌ๊ฐ€ ์œ ์ž… ํ†ต๊ทผ์ž๊ฐ€ ๊ฐ€์žฅ ๋งŽ์Œ.  (๊ด‘์—ญ์ƒํ™œ๊ถŒ ๊ฐ€๊ตฌ ํ˜„ํ™ฉ๊ณผ ํŠน์„ฑ | ์„œ์šธ์—ฐ๊ตฌ๋ฐ์ดํ„ฐ์„œ๋น„์Šค)

 

→ ๋ถ€๋™์‚ฐ ์ค‘๊ฐœ ํ”Œ๋žซํผ ๋‹ค๋ฐฉ์— (์œ ํ˜• : ์˜คํ”ผ์Šคํ…”, ์•„ํŒŒํŠธ / ์ „์„ธ/ ๋ณด์ฆ๊ธˆ 2์–ต/ 7ํ‰์ด์ƒ) ๋งค๋ฌผ ๊ฒ€์ƒ‰ ๊ฒฐ๊ณผ ์œ„ ์‚ฌ์ง„๊ณผ ๊ฐ™์€ ๊ฒฐ๊ณผ ๋„์ถœ. (๊ด€์•…๊ตฌ์— ๋งค๋ฌผ์ด ๊ฐ€์žฅ ๋งŽ์„ ๊ฒƒ์ด๋‹ค.)

๐Ÿ‘€ ๋ฐ์ดํ„ฐ ์‹œ๊ฐํ™”

๊ฑด๋ฌผ ์šฉ๋„ ๋ณ„ ๋งค๋ฌผ ์ˆ˜

๋”๋ณด๊ธฐ

๐Ÿฃ code

SELECT *,
์˜คํ”ผ์Šคํ…” + ์—ฐ๋ฆฝ๋‹ค์„ธ๋Œ€ + ์•„ํŒŒํŠธ tot
FROM (
SELECT ์ž์น˜๊ตฌ๋ช…,
max(if(๊ฑด๋ฌผ์šฉ๋„ = "์˜คํ”ผ์Šคํ…”", ๋งค๋ฌผ์ˆ˜, NULL)) "์˜คํ”ผ์Šคํ…”",
max(if(๊ฑด๋ฌผ์šฉ๋„ = "์—ฐ๋ฆฝ๋‹ค์„ธ๋Œ€", ๋งค๋ฌผ์ˆ˜, NULL)) "์—ฐ๋ฆฝ๋‹ค์„ธ๋Œ€",
max(if(๊ฑด๋ฌผ์šฉ๋„ = "์•„ํŒŒํŠธ", ๋งค๋ฌผ์ˆ˜ , NULL)) "์•„ํŒŒํŠธ"
FROM (SELECT ์ž์น˜๊ตฌ๋ช…, ๊ฑด๋ฌผ์šฉ๋„, COUNT(*) ๋งค๋ฌผ์ˆ˜
FROM Rent_2022
WHERE ์ „์›”์„ธ๊ตฌ๋ถ„ = "์ „์„ธ"
AND ์ž์น˜๊ตฌ๋ช… in ('๋™๋Œ€๋ฌธ๊ตฌ', '๊ด‘์ง„๊ตฌ', '์†กํŒŒ๊ตฌ', '์„œ์ดˆ๊ตฌ', '์šฉ์‚ฐ๊ตฌ', '์ค‘๊ตฌ', '์„ฑ๋™๊ตฌ', '๊ฐ•๋‚จ๊ตฌ', '๊ด€์•…๊ตฌ')
AND ์ธต >=3 AND ์ž„๋Œ€๋ฉด์  >= 24 AND `๋ณด์ฆ๊ธˆ(๋งŒ์›)` <= 20000
AND (์ ‘์ˆ˜๋…„๋„ - ๊ฑด์ถ•๋…„๋„) < 10
GROUP BY 1, 2)t
GROUP BY 1)t2
ORDER BY 5 DESC

  • ์ „์ฒด ๋งค๋ฌผ์ˆ˜๊ฐ€ ๊ฐ€์žฅ ๋งŽ์€ ์ง€์—ญ์€ ๊ฐ•๋‚จ๊ตฌ(339)๋กœ ์•„ํŒŒํŠธ(130)์™€ ์˜คํ”ผ์Šคํ…”(192) ๋งค๋ฌผ๋„ ๊ฐ€์žฅ ๋งŽ๋‹ค.
  • ๊ด€์•…๊ตฌ์˜ ๊ฒฝ์šฐ ์—ฐ๋ฆฝ ๋‹ค์„ธ๋Œ€๊ฐ€ ์ „์ฒด ๋งค๋ฌผ์˜ ์•ฝ 84%๋ฅผ ์ฐจ์ง€ํ•œ๋‹ค. (๋”ฐ๋ผ์„œ, ์•„ํŒŒํŠธ์™€ ์˜คํ”ผ์Šคํ…”์ด๋ผ๋Š” ์กฐ๊ฑด ํ•˜์—์„œ๋Š” ๋งค๋ฌผ ์ˆ˜๊ฐ€ ๋งŽ์ง€ ์•Š๋‹ค.)

ํ‰๊ท  ํ‰๋‹น๊ฐ€

๋”๋ณด๊ธฐ

๐Ÿฃ Code

SELECT ์ž์น˜๊ตฌ๋ช…, 
		ROUND(AVG(`ํ‰๋‹น๊ฐ€(๋งŒ์›)`),0) "ํ‰๊ท ํ‰๋‹น๊ฐ€(๋งŒ์›)", ROUND(AVG(`๋ณด์ฆ๊ธˆ(๋งŒ์›)`),0) "ํ‰๊ท ์ „์„ธ๊ธˆ(๋งŒ์›)", ROUND(AVG(์ž„๋Œ€๋ฉด์ ),1) "ํ‰๊ท ๋ฉด์ ",  COUNT(*) ๋งค๋ฌผ์ˆ˜
FROM (SELECT *, `๋ณด์ฆ๊ธˆ(๋งŒ์›)`/ ์ž„๋Œ€๋ฉด์  * 3.306 as "ํ‰๋‹น๊ฐ€(๋งŒ์›)" FROM Rent_2022
	   		WHERE ์ „์›”์„ธ๊ตฌ๋ถ„ = "์ „์„ธ" 
	   		 	  AND ์ž์น˜๊ตฌ๋ช… in ('๋™๋Œ€๋ฌธ๊ตฌ', '๊ด‘์ง„๊ตฌ', '์†กํŒŒ๊ตฌ', '์„œ์ดˆ๊ตฌ', '์šฉ์‚ฐ๊ตฌ', '์ค‘๊ตฌ', '์„ฑ๋™๊ตฌ', '๊ฐ•๋‚จ๊ตฌ', '๊ด€์•…๊ตฌ')
	   		 	  AND ์ธต >=3
	   		 	  AND ๊ฑด๋ฌผ์šฉ๋„ in ("์˜คํ”ผ์Šคํ…”","์•„ํŒŒํŠธ") 
	   	   		  AND ์ž„๋Œ€๋ฉด์  >= 24
	   		 	  AND `๋ณด์ฆ๊ธˆ(๋งŒ์›)` <= 20000
	   		 	  AND (2022 - ๊ฑด์ถ•๋…„๋„) < 10) r 
GROUP BY 1 
ORDER BY 3

  • 1 ์ œ๊ณฑ๋ฏธํ„ฐ๋‹น ๋ณด์ฆ๊ธˆ์ด ๊ฐ€์žฅ ์ €๋ ดํ•œ ์ง€์—ญ์€ ์„œ์ดˆ๊ตฌ(1154๋งŒ์›) ๊ทธ๋Ÿฌ๋‚˜ ๋งค๋ฌผ ์ˆ˜๋Š” 35๊ฐœ(4์œ„)๋กœ ์ ์€ ํŽธ
  • ๋ฐ˜๋ฉด ๋งค๋ฌผ ์ˆ˜๊ฐ€ ๊ฐ€์žฅ ๋งŽ์€ ๊ฐ•๋‚จ๊ตฌ(322๊ฐœ)์˜ ๊ฒฝ์šฐ ํ‰๋‹น ๊ฐ€๊ฒฉ์€ 1715๋งŒ์›์œผ๋กœ 9๊ฐœ ์ง€์—ญ์ค‘ 5์œ„์ด๋‹ค.

์ง€์—ญ ๋ณ„ ์ธํ”„๋ผ ๋น„๊ต

๋”๋ณด๊ธฐ

๐Ÿฃ Code

SELECT *, 
	   ๋™๋ฌผ๋ณ‘์› + ์Šคํฌ์ธ ํด๋Ÿฝ + ์Šคํฌ์ธ ๊ฐ•์Šต tot
FROM (
SELECT ์‹œ๊ตฐ๊ตฌ๋ช…,
 	   max(if(์—…์ข… = "๋™๋ฌผ๋ณ‘์›", ์ ํฌ์ˆ˜, NULL)) "๋™๋ฌผ๋ณ‘์›",
 	   max(if(์—…์ข… = "์Šคํฌ์ธ ํด๋Ÿฝ", ์ ํฌ์ˆ˜, NULL)) "์Šคํฌ์ธ ํด๋Ÿฝ",
 	   max(if(์—…์ข… =  "์Šคํฌ์ธ  ๊ฐ•์Šต", ์ ํฌ์ˆ˜, NULL)) "์Šคํฌ์ธ ๊ฐ•์Šต"
FROM (SELECT ์‹œ๊ตฐ๊ตฌ๋ช…, ์„œ๋น„์Šค_์—…์ข…_์ฝ”๋“œ_๋ช… ์—…์ข…, sum(์ ํฌ_์ˆ˜) ์ ํฌ์ˆ˜
	  		FROM shop s INNER JOIN addr a ON s.ํ–‰์ •๋™_์ฝ”๋“œ*100 =a.ํ–‰์ •๋™์ฝ”๋“œ
		    WHERE ๊ธฐ์ค€_๋…„๋ถ„๊ธฐ_์ฝ”๋“œ = 20221 and (์„œ๋น„์Šค_์—…์ข…_์ฝ”๋“œ_๋ช… = "๋™๋ฌผ๋ณ‘์›" or ์„œ๋น„์Šค_์—…์ข…_์ฝ”๋“œ_๋ช… like '%์Šคํฌ์ธ %')
		    	  and ์‹œ๊ตฐ๊ตฌ๋ช… in ('๋™๋Œ€๋ฌธ๊ตฌ', '๊ด‘์ง„๊ตฌ', '์†กํŒŒ๊ตฌ', '์„œ์ดˆ๊ตฌ', '์šฉ์‚ฐ๊ตฌ', '์ค‘๊ตฌ', '์„ฑ๋™๊ตฌ', '๊ฐ•๋‚จ๊ตฌ', '๊ด€์•…๊ตฌ') 
		    GROUP BY 1,2)t
WHERE ์‹œ๊ตฐ๊ตฌ๋ช… in ('๋™๋Œ€๋ฌธ๊ตฌ', '๊ด‘์ง„๊ตฌ', '์†กํŒŒ๊ตฌ', '์„œ์ดˆ๊ตฌ', '์šฉ์‚ฐ๊ตฌ', '์ค‘๊ตฌ', '์„ฑ๋™๊ตฌ', '๊ฐ•๋‚จ๊ตฌ', '๊ด€์•…๊ตฌ') 
GROUP BY 1)t2
ORDER BY 5 DESC

  • ์ธํ”„๋ผ ์‹œ์„ค์ด ๊ฐ€์žฅ ๋งŽ์€ ์ง€์—ญ์€ ๊ฐ•๋‚จ๊ตฌ(973๊ฐœ)

์ƒ๊ด€๊ด€๊ณ„

1. ํ‰๋‹น๊ฐ€ - ์—ฐ์‹

  • ํ‰๋‹น๊ฐ€(๋งŒ์›)๊ณผ ์—ฐ์‹์€ ์Œ์˜ ์ƒ๊ด€๊ด€๊ณ„๋กœ ์—ฐ์‹์ด ์ž‘์„ ์ˆ˜๋ก ํ‰๋‹น๊ฐ€(๋งŒ์›)๊ฐ€ ๋†’์Œ
  • ๊ทธ๋Ÿฌ๋‚˜ ๊ฒฐ์ •๊ณ„์ˆ˜(R์ œ๊ณฑ) ๊ฐ’์ด 0.0274๋กœ 0์— ๊ฐ€๊น๊ธฐ ๋•Œ๋ฌธ์—, ์œ ์˜๋ฏธํ•œ ๊ด€๊ณ„๋ผ๊ณ  ๋ณด๊ธฐ๋Š” ์–ด๋ ค์›€.

 

2. ํ‰๋‹น๊ฐ€ - ์ธํ”„๋ผ์ˆ˜

  • ํ‰๋‹น๊ฐ€(๋งŒ์›)์™€ ์ธํ”„๋ผ ์ˆ˜๋Š” ์–‘์˜ ์ƒ๊ด€๊ด€๊ณ„๋กœ ํ‰๋‹น๊ฐ€(๋งŒ์›)์ด ๋†’์„์ˆ˜๋ก ์ธํ”„๋ผ ์ˆ˜๊ฐ€ ๋งŽ๋‹ค.
  • ๊ทธ๋Ÿฌ๋‚˜ ๊ฒฐ์ •๊ณ„์ˆ˜๊ฐ’(R^2)์ด 0.0979๋กœ 0์— ๊ฐ€๊น๊ธฐ ๋•Œ๋ฌธ์—, ์œ ์˜๋ฏธํ•œ ๊ด€๊ณ„๋ผ๊ณ  ๋ณด๊ธฐ๋Š” ์–ด๋ ค์›€.