Giter Site home page Giter Site logo

w-henderson / projectpositivevibes Goto Github PK

View Code? Open in Web Editor NEW
1.0 1.0 0.0 10.94 MB

🤝 Using large language models to seamlessly help content moderators make better decisions, faster.

Home Page: https://vibes.whenderson.dev

HTML 6.91% JavaScript 1.02% SCSS 26.67% TypeScript 57.77% Python 7.63%
content-moderation gpt-3 trust-and-safety

projectpositivevibes's Introduction

Project Positive Vibes

Using large language models to seamlessly help content moderators make better decisions, faster.
Learn More »




Screenshot

(Built during START Hack 2023 by Dron Hazra, William Henderson and Sam Yun Yap)

Project Positive Vibes introduces a new approach to content moderation: harnessing the power of modern large language models (LLMs) while staying true to the core philosophy that the human moderator is the decision-maker.

On average, Supercell’s moderators take 12 seconds to make a judgement about a report after being presented with a huge assortment of noisy signals. Our idea is simple: by minimising the noise, we can maximise the productivity of the moderation teams while improving their accuracy and access to information, which is necessary for less clear-cut cases.

Behind our sleek and intuitive user interface – which supports a broad array of keyboard shortcuts, already saving moderators a few seconds for each report – lies the heart of our project: LLMs. With the volume of research in the field exponentially increasing year upon year and recent developments such as GPT-4 taking the world by storm, exploring their potential applications to content moderation seemed the clear best approach.

A shortfall of the existing content moderation interface is that to get the whole picture of a situation, the moderator may have to read up to 100 messages per report. Using LLMs, we can identify the key themes of a conversation and present these alongside, providing invaluable context without requiring everything to be read. In addition, there may be many simultaneous conversations going on, with a substantial amount of the text provided to the moderator being irrelevant. Another model allows us to identify and isolate these individual conversations, further reducing noise and accelerating the moderation process.

Currently, once a report is dealt with, the work the moderator put into it no longer has any value. With multiple teams of moderators working around the clock, this adds up to a lot of wasted work, which could be used to inform future decisions. Our solution to this is to provide insights into how similar reports were dealt with in the past, allowing moderators to more effectively come to their conclusions. To detect these similarities, we trained a text embedding model on reported messages, which can be connected to a database of prior reports to show how those similar to the one in question were judged.

Of course, it is of the utmost importance to both Supercell and ourselves that at the end of the day, the human is the one making the decision about the future of a player within the game. While large language models can show moderators very useful information, both quantitative and qualitative, to help them understand the full context of the report, we have been careful to ensure that these insights remain only insights, and that the interface provides an easy way for the moderator to dig into the data themselves in cases where the models can’t grasp the full picture.

In conclusion, Project Positive Vibes uses insights provided by LLMs, displayed in a sleek and intuitive user interface, to allow content moderators to make better decisions, faster.

projectpositivevibes's People

Contributors

w-henderson avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.