Giter Site home page Giter Site logo

level2-cv-datacentric-cv-07's Introduction

CV 07์กฐ BIG-I ๐Ÿ‘๏ธ

image

  • ํ”„๋กœ์ ํŠธ ๋ช… : ๊ธ€์ž ๊ฒ€์ถœ ํ”„๋กœ์ ํŠธ
  • ํ”„๋กœ์ ํŠธ ์ „์ฒด ๊ธฐ๊ฐ„ (2์ฃผ) : 2024.01.24 ~ 2024.02.01 19:00

ํ”„๋กœ์ ํŠธ ๊ฐœ์š”

  • OCR (Optimal Character Recognition)์€ ์ด๋ฏธ์ง€ ์†์˜ ๋ฌธ์ž๋ฅผ ์ปดํ“จํ„ฐ๊ฐ€ ์ธ์‹ ํ•  ์ˆ˜ ์žˆ๋„๋ก ํ•˜๋Š” ์ปดํ“จํ„ฐ ๋น„์ „ ๋ถ„์•ผ์˜ ๋Œ€ํ‘œ์ ์ธ ๊ธฐ์ˆ ์ธ๋ฐ, ๋ณธ ํ”„๋กœ์ ํŠธ์—์„œ๋Š” ๊ธ€์ž ๊ฒ€์ถœ task๋ฅผ ๋‹ค๋ฃจ๊ฒŒ ๋จ
  • ์ง„๋ฃŒ๋น„ ์˜์ˆ˜์ฆ ์ด๋ฏธ์ง€ ํŒŒ์ผ๋กœ ๊ตฌ์„ฑ๋œ ๋ฐ์ดํ„ฐ์…‹์— ๋Œ€ํ•˜์—ฌ ๊ธ€์ž์˜ ์˜์—ญ์„ ์ •ํ™•ํ•˜๊ฒŒ ํƒ์ง€ํ•  ์ˆ˜ ์žˆ๋Š” ๋ชจ๋ธ์„ ๊ตฌ์„ฑํ•˜๋Š” ๊ฒƒ์„ ๋ชฉํ‘œ๋กœ ํ•จ. ๋‹ค๋งŒ Data Centric์ด๋ผ๋Š” ์ฃผ์ œ์˜ ์ทจ์ง€์— ๋”ฐ๋ผ ๋ฒ ์ด์Šค๋ผ์ธ ์ฝ”๋“œ์—์„œ ์ฃผ์–ด์ง„ ๋ชจ๋ธ์„ ๊ทธ๋Œ€๋กœ ํ™œ์šฉํ•ด์•ผ ํ•œ๋‹ค๋Š” ์ œ์•ฝ์ด ์žˆ์Œ
  • ์ด๋ฒˆ ๋Œ€ํšŒ์—์„œ๋Š” ๊ตฌ์„ฑํ•œ ๋ชจ๋ธ๋กœ๋ถ€ํ„ฐ ์ƒ์„ฑ๋œ UFO ํ˜•์‹์˜ output.csv ํŒŒ์ผ์„ ์ œ์ถœํ•˜์—ฌ ํ‰๊ฐ€๋ฅผ ์ง„ํ–‰. ํ•ด๋‹น ํŒŒ์ผ์—๋Š” ๊ธ€์ž ์˜์—ญ์œผ๋กœ ๊ฐ์ง€๋œ ๋ถ€๋ถ„์ธ bounding box์˜ ์ขŒํ‘œ์ •๋ณด๊ฐ€ ํฌํ•จ๋˜์–ด ์žˆ์œผ๋ฉฐ, DetEval ๋ฐฉ์‹์œผ๋กœ ํ‰๊ฐ€๊ฐ€ ์ด๋ฃจ์–ด์ง
  • ๋ฒ ์ด์Šค๋ผ์ธ์€ ์ž‘์€ ๊ธ€์”จ๋ฅผ ์ข€ ๋” ์ž˜ ์ฐพ๊ธฐ์œ„ํ•ด ํŠœ๋‹๋œ EAST(An Efficient and Accurate Scene Text Detector) ๋ชจ๋ธ์„ ํ™œ์šฉ

ํ‰๊ฐ€ ๋ฐฉ์‹

  • DetEval ๋ฐฉ์‹์œผ๋กœ ํ‰๊ฐ€
  • ๋ชจ๋“  ์ •๋‹ต/์˜ˆ์ธก๋ฐ•์Šค๋“ค์— ๋Œ€ํ•ด์„œ Area Recall, Area Precision์„ ๋ฏธ๋ฆฌ ๊ณ„์‚ฐ image
  • ๋ชจ๋“  ์ •๋‹ต ๋ฐ•์Šค์™€ ์˜ˆ์ธก ๋ฐ•์Šค๋ฅผ ์ˆœํšŒํ•˜๋ฉด์„œ, ๋งค์นญ์ด ๋˜์—ˆ๋Š”์ง€ ํŒ๋‹จํ•˜์—ฌ ๋ฐ•์Šค ๋ ˆ๋ฒจ๋กœ ์ •๋‹ต ์—ฌ๋ถ€๋ฅผ ์ธก์ •
  • Area Recall, Area Precision์ด 0 ์ด์ƒ์ผ ๊ฒฝ์šฐ ๋งค์นญ ์—ฌ๋ถ€๋ฅผ ํŒ๋‹จํ•˜๊ฒŒ ๋˜๊ณ , ๋ฐ•์Šค์˜ ์ •๋‹ต ์—ฌ๋ถ€๋Š” Area Recall 0.8 ์ด์ƒ, Area Precision 0.4 ์ด์ƒ์„ ๊ธฐ์ค€์œผ๋กœ ํ•˜๊ณ  ์žˆ์Œ

ํŒ€ ๊ตฌ์„ฑ์› ๋ฐ ์—ญํ• 

๊น€ํ•œ๊ทœ ๋ฏผํ•˜์€ ์ดํ•˜์—ฐ ์‹ฌ์œ ์Šน ์•ˆ์ฑ„์—ฐ ๊ฐ•๋™๊ธฐ
  • ๊ฐ•๋™๊ธฐ: jsonํŒŒ์ผ ์ž‘์„ฑ, Data Labeling, EDA
  • ๊น€ํ•œ๊ทœ: ๋ฒ ์ด์Šค๋ผ์ธ ๋ชจ๋ธ ๋ถ„์„, EDA, Data Labeling
  • ๋ฏผํ•˜์€: Data Labeling, EDA, Dataset ๋น„๊ต์‹คํ—˜ ์ˆ˜ํ–‰
  • ์‹ฌ์œ ์Šน: ๊ฐ€์„ค์„ค์ • ๋ฐ ์‹คํ—˜ ์„ค๊ณ„, Dataset ์ œ์ž‘, Data Labeling
  • ์•ˆ์ฑ„์—ฐ: Dataset ๋น„๊ต์‹คํ—˜ ์ˆ˜ํ–‰, Data Labeling, EDA
  • ์ดํ•˜์—ฐ: Data Labeling, ์„œ๋ฒ„ ํ™˜๊ฒฝ ์„ค์ •, EDA, ์žฌํ•™์Šต ์ฝ”๋“œ์ž‘์„ฑ

ํ”„๋กœ์ ํŠธ ์ˆ˜ํ–‰์ ˆ์ฐจ ๋ฐ ๊ฒฐ๊ณผ

  1. ๋ฒ ์ด์Šค๋ผ์ธ ๋ชจ๋ธ ๋ถ„์„
    • ์•ฝ 16%์˜ ์ด๋ฏธ์ง€์—์„œ ์–ผ๋ฃฉ๊ณผ ๊ฐ™์€ ๋…ธ์ด์ฆˆ๋ฅผ ๊ธ€์”จ๋กœ ์ž˜๋ชป ์ธ์‹
    • ์•ฝ 33%์˜ ์ด๋ฏธ์ง€์—์„œ ์ƒ๋‹จ ์ œ๋ชฉ ๋ถ€๋ถ„์„ ๊ธ€์”จ๋กœ ์ž˜๋ชป ์ธ์‹
    • ์•ฝ 20%์˜ ์ด๋ฏธ์ง€์—์„œ QR์ฝ”๋“œ์˜ ์ผ๋ถ€๋ฅผ ๊ธ€์”จ๋กœ ์ž˜๋ชป ์ธ์‹
    • ์•ฝ 19%์˜ ์ด๋ฏธ์ง€์—์„œ QR์ฝ”๋“œ ์˜†์˜ ์„ธ๋กœ๋ฐฉํ–ฅ ๊ธ€์”จ๋ฅผ ์ž˜ ์ธ์‹ํ•˜์ง€ ๋ชปํ•จ

image

  1. ๊ฐ€์„ค ์„ค์ •
    • train ๋ฐ์ดํ„ฐ์…‹์— ๋…ธ์ด์ฆˆ๊ฐ€ ์—†๊ธฐ ๋•Œ๋ฌธ์— ๋ชจ๋ธ์ด ๋…ธ์ด์ฆˆ์— ๋Œ€ํ•œ ํ•™์Šต์„ ์ œ๋Œ€๋กœ ํ•˜์ง€ ๋ชปํ•˜์˜€์Œ
    • ๋ฌธ์„œ์˜ ์ƒ๋‹จ์ œ๋ชฉ, QR์ฝ”๋“œ ์ฃผ๋ณ€ ๋ถ€๋ถ„๊ณผ ๊ด€๋ จํ•˜์—ฌ, ๋ฌธ์„œ์—์„œ ์ฐจ์ง€ํ•˜๋Š” ๋น„์ค‘์ด ๋งค์šฐ ์ ์€ ๋ถ€๋ถ„์ด๋ผ๋Š” ์ ์— ์ฐฉ์•ˆํ•˜์˜€์Œ (๋Œ€๋ถ€๋ถ„์˜ annotation์€ ํ‘œ ์•ˆ์— ์“ฐ์—ฌ์ง„ ์ž‘์€ ๊ธ€์”จ๋“ค์— ๋Œ€ํ•œ ๊ฒƒ์ž„)
  2. Train Dataset ์ œ์ž‘
    • Dataset A : ๋…ธ์ด์ฆˆ๊ฐ€ ์ถ”๊ฐ€๋œ ์ด๋ฏธ์ง€ 50์žฅ๊ณผ ๋…ธ์ด์ฆˆ๊ฐ€ ์—†๋Š” ์ด๋ฏธ์ง€ 50์žฅ, ์ด 100์žฅ์œผ๋กœ ๊ตฌ์„ฑ
    • Dataset B : ๋ฌธ์„œ ์ƒ๋‹จ ์ œ๋ชฉ๋ถ€๋ถ„๊ณผ QR์ฝ”๋“œ์˜ ๋น„์ค‘์„ ๋Š˜๋ฆฐ ์ด๋ฏธ์ง€๋ฅผ ์ด 60์žฅ ์ œ์ž‘ํ•˜์˜€๊ณ , ์—ฌ๊ธฐ์— Dataset A๋ฅผ 0์žฅ, 40์žฅ, 60์žฅ, 80์žฅ, 100์žฅ์„ ๊ฐ๊ฐ ์ถ”๊ฐ€ํ•œ 5๊ฐ€์ง€ ์„œ๋ธŒ ๋ฐ์ดํ„ฐ์…‹์œผ๋กœ ๊ตฌ์„ฑํ•˜์˜€์Œ
  3. ์‹คํ—˜ ์ˆ˜ํ–‰ ๋ฐ ๊ฒฐ๊ณผ

image

image

level2-cv-datacentric-cv-07's People

Contributors

hayeonlee88 avatar dorianyellow avatar github-classroom[bot] avatar kimhankyu avatar

Forkers

hayeonlee88

level2-cv-datacentric-cv-07's Issues

:sparkles: feat : Augmentation ์ฝ”๋“œ ์ถ”๊ฐ€

๋‚ด์šฉ

  • labelingํ•œ dataset์„ ์ฆ๊ฐ•์‹œํ‚ค๊ธฐ ์œ„ํ•œ ์ฝ”๋“œ ์ถ”๊ฐ€
  • ์ฆ๊ฐ•๋ฐฉ์‹์€ GaussNoise, RandomFog, ColorJitter1, ColorJitter2 ๋„ค ๊ฐ€์ง€ ๋ฐฉ์‹์„ ํ™œ์šฉํ•˜์˜€์Œ

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.