Giter Site home page Giter Site logo

generative-ai-reading-list-and-dataset's Introduction

generative-ai-reading-list-and-dataset's People

Contributors

ji1kang avatar

Watchers

 avatar

generative-ai-reading-list-and-dataset's Issues

23๋…„ 8์›” 2์ผ: ๋กคํ”Œ๋ ˆ์ด ๊ด€๋ จ ๋ฐ์ดํ„ฐ

  1. ๋กคํ”Œ๋ ˆ์ด ํ…์ŠคํŠธ ๋ฐ์ดํ„ฐ (์˜๋ฌธ, with NSFW)
  1. ๋‚˜๋ฌด์œ„ํ‚ค ๋คํ”„ ๋ฐ์ดํ„ฐ (ํ•œ๊ธ€)
  1. Ployglot-ko์—์„œ ์‚ฌ์šฉํ–ˆ๋˜ ์ „์ฒ˜๋ฆฌ ์ฝ”๋“œ (pyspark) ๊ธฐ๋ฐ˜

23๋…„ 7์›” 25์ผ: ํ”„๋กฌํ”„ํŠธ์…‹ ์ƒ์„ฑ ๋ฐฉ๋ฒ•๋“ค, ์˜คํ”ˆ์†Œ์Šค LLM์˜ ๋งน์ 

1. Question Decomposition Improves the Faithfulness of Model-Generated Reasoning

  • ๋ฐฐ๊ฒฝ: CoT ๊ฐœ์„ ์„ ์œ„ํ•œ ๋ฐฉ๋ฒ• ์ œ์•ˆ. ์ด์ „ ์—ฐ๊ตฌ๋“ค์—์„œ CoT๊ฐ€ ์ตœ์ข… ๋‹ต๋ณ€์„ ์ž‘์„ฑํ•  ๋•Œ ๋‹ต๋ณ€๋ถ€์˜ ์ถ”๋ก ๋‹จ๊ณ„๋ถ€ํ„ฐ ์˜ค๋ฅ˜๊ฐ€ ๋‚  ๊ฒฝ์šฐ ์„ฑ๋Šฅ์ €ํ•˜ ๋ฌธ์ œ ์žˆ์Œ.
  • ์ œ์•ˆ: ์ด๋ฅผ ๊ฐœ์„ ํ•˜๊ธฐ ์œ„ํ•ด (1) ์—ฌ๋Ÿฌ๊ฐœ์˜ ์งˆ๋ฌธ์œผ๋กœ ์ชผ๊ฐ ๋‹ค์Œ์— ๋‹ต์„ ๋‚ด๋ฆฌ๋Š” ๋ฐฉ๋ฒ•, (2) ์ชผ๊ฐ  ์งˆ๋ฌธ์„ ์—ฌ๋Ÿฌ๋ฒˆ ์ƒ์„ฑํ•˜๋Š” ๋ฐฉ๋ฒ• (๋‘ ๋ฐฉ๋ฒ• ๋ชจ๋‘ ์งˆ๋ฌธ์„ ์ชผ๊ฐœ์ง€๋งŒ ํ•œ๋ฒˆ์— ์ชผ๊ฐœ๋Š๋ƒ, ๋‚˜๋ˆ ์„œ ์ชผ๊ฐœ๋Š๋ƒ๊ฐ€ ์ฐจ์ด)์ด ์žˆ์Œ
  • ์ฝ”๋ฉ˜ํŠธ: ์„ฑ๋Šฅ์—์„œ ๊ฒฐ๊ตญ CoT๋ฅผ ์ด๊ธฐ์ง€ ๋ชปํ•จ, ํ…Œ์Šคํฌ๋„ ๊ฐ๊ด€์‹ ๋ฌธ์ œ ๋งž์ถ”๊ธฐ๋ผ ๋‹ค๋ฅธ ํ…Œ์Šคํฌ์—๋„ ์ ์šฉ๋˜๋Š”์ง€ ํ™•์ธ ํ•„์š”
  • tag: Instruction Prompt

2. EmotionPrompt: Leveraging Psychology for Large Language Models Enhancement via Emotional Stimulus

  • ์ œ์•ˆ: ์‹ฌ๋ฆฌํ•™ ์ด๋ก (Social Identity theory, Social Cognition theory, Cognitive Emotion Regulation theory)์„ ํ™œ์šฉํ•ด ๋งŒ๋“  1-2๋ฌธ์žฅ์„ ๊ธฐ์กด ํ”„๋กฌํ”„ํŠธ์— ์ถ”๊ฐ€ํ•˜๋Š” ๊ฒƒ๋งŒ์œผ๋กœ๋„ zero-hot, few-shot์—์„œ ์„ฑ๋Šฅ์„ ํ–ฅ์ƒ๋œ๋‹ค๋Š” ๋…ผ๋ฌธ
  • ์ฝ”๋ฉ˜ํŠธ: ์žฌ๋ฐŒ๋Š” ์ ‘๊ทผ ๋ฐฉ์‹์œผ๋กœ ํ•œ๋ฒˆ ํ•ด๋ด๋„ ์ข‹์„๋“ฏ. LLM์ด ์‚ฌ๋žŒํ–‰๋™์„ ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ํ•˜๋Š” ๋…ผ๋ฌธ๋“ค์ด ์žˆ๋‹ค๋Š” ๊ฑธ ๊ณ ๋ คํ–ˆ์„๋•Œ, ์‹ฌ๋ฆฌํ•™ ์ด๋ก ๋Œ€๋กœ ํ–‰๋™ํ•œ๋‹ค๋Š” ๊ฒƒ๋„ ์œ ์˜๋ฏธํ•œ ์–˜๊ธฐ์ธ๋“ฏ.
  • tag: Instruction Prompt
  1. Multiple character and novel Object based interaction Estimation
  • ์ œ์•ˆ: TRPG ๋ฐ์ดํ„ฐ์…‹์„ ํ™œ์šฉํ•˜์—ฌ ํ”Œ๋ ˆ์ด์–ด ํ”Œ๋ ˆ์ด ๋กœ๊ทธ๋ฅผ ๋ณด๊ณ  ๋‹ค์Œํ„ด์—์„œ (1) ์–ด๋–ค ํ”Œ๋ ˆ์ด์–ด๊ฐ€ (2) ์–ด๋–ค ์Šคํ‚ฌ์„ ํ• ์ง€ ์˜ˆ์ธกํ•˜๋Š” ํ…Œ์Šคํฌ ์ œ์•ˆ
  • ํ‰๊ฐ€๋Š” ๋ถ„๋ฅ˜ ํ…Œ์Šคํฌ์ฒ˜๋Ÿผ ์บ๋ฆญํ„ฐ์™€ ์Šคํ‚ฌ์— ๋Œ€ํ•œ precision, recall, f1์„ ์ธก์ •ํ•จ
  • tag: Multi-turn Dataset
  1. LLaMA ๊ฐ™์€ ์˜คํ”ˆ์†Œ์Šค LLM์„ ์‚ฌ์šฉ ํ•  ์ˆ˜ ์—†๋Š” ์ด์œ 
  • ์˜คํ”ˆ์†Œ์Šค LLM์ด ํฌ๊ฒŒ ๋ฐœ์ „ํ•œ๋‹ค๊ณ  ํ•ด๋„, ๋Œ€๋ถ€๋ถ„์˜ ๊ฒฝ์šฐ์—” Cloud, SaaS ํ˜• LLM์„ ์‚ฌ์šฉํ•˜๋Š” ๊ฒƒ์ด ์ด๋“์ผ ๊ฒƒ. ๋Œ€๋ถ€๋ถ„์˜ IT ํšŒ์‚ฌ๊ฐ€ ์„œ๋ฒ„๋ฅผ ์ž์ฒด๊ตฌ์ถ•ํ•ด์„œ ์‚ฌ์šฉํ•˜์ง€ ์•Š๊ณ , ํด๋ผ์šฐ๋“œ ์„œ๋น„์Šค๋ฅผ ์‚ฌ์šฉํ•˜๋Š” ๊ฒƒ๊ณผ ์œ ์‚ฌํ•จ.
  • ์‚ฌ์šฉ๋Ÿ‰์ด ์ผ์ •ํ•œ ๋Œ€๋Ÿ‰์˜ ์ง€์†์ ์ธ ๋ฐฐ์น˜ ์ž‘์—…์— ์‚ฌ์šฉํ•˜๊ฑฐ๋‚˜, ์ž„๋ฒ ๋”ฉ ๋ชจ๋ธ๋กœ ์‚ฌ์šฉํ•˜๋Š” ๊ฒฝ์šฐ๋Š” ์ž์ฒด ๊ตฌ์ถ•ํ•˜๋Š” ๊ฒƒ์ด ์ด๋“์ด ์žˆ์Œ.

23๋…„ 7์›” 30์ผ

1. DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection and Instruction-Aware Models for Conversational AI

2. Challenges and Applications of Large Language Models

  • ์ž‘์—…์ค‘์ธ ์„œ๋ฒ ์ด ๋…ผ๋ฌธ

3. Skill-it! A Data-Driven Skills Framework for Understanding and Training Language Models

  • ์—ฌ๋Ÿฌ ๋ฐ์ดํ„ฐ์…‹์œผ๋กœ LM ํ›ˆ๋ จ์‹œ ํ•™์Šต ์ˆœ์„œ๊ฐ€ ์ค‘์š”ํ•  ์ˆ˜๋„ ์žˆ๊ฒ ๋‹ค... ํ•˜๋Š” ๋…ผ๋ฌธ

23๋…„ 8์›” 7์ผ: Llama2 Guideline, DeepSpeed-Chat, Personalization ๊ด€๋ จ ์„œ๋ฒ ์ด

  1. Llama2 Guideline
    https://ai.meta.com/llama/responsible-use-guide/

  2. DeepSpeed-Chat
    https://github.com/microsoft/DeepSpeed/tree/master/blogs/deepspeed-chat

  3. When Large Language Models Meet Personalization: Perspectives of Challenges and Opportunities
    https://arxiv.org/abs/2307.16376

  4. PEFT๋กœ LoRA Checkpoint ๋กœ๋“œ์‹œ size mismatch ํ•ด๊ฒฐ๋ฒ•
    https://junbuml.ee/lora-ckpt-size-mismatch

23๋…„ 7์›” 26์ผ: ๋กœ๋ผ ํ—ˆ๋ธŒ

  1. LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition
  • ํ—ˆ๊น…ํŽ˜์ด์Šค ๋ชจ๋ธํ—ˆ๋ธŒ์ฒ˜๋Ÿผ LoRA ๋ชจ๋ธ์„ ๊ฐˆ์•„๋ผ๋Š” ํ—ˆ๋ธŒ๋ฅผ ์ œ์•ˆํ•˜๋Š”๋“ฏ?
  • ์˜ค๋Š˜ ๊ธฐ์ค€์œผ๋กœ ์•„์ง๊นŒ์ง€ ์ฝ”๋“œ๋Š” ์—…๋ฐ์ดํŠธ ์•ˆ๋จ

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.