Giter Site home page Giter Site logo

convsumm's Introduction

convsumm's People

Contributors

jasonwu0731 avatar jimjag avatar likicode avatar svc-scm avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar

convsumm's Issues

Can you please let me know, how to run the SOTA CODS model mentioned in the paper.

Amazing work. I read the paper and it says, CODS can actually control the number of sentences in the summary. I think you have mentioned the generation pipeline under the topic of CODS: Salesforce/cods-bart-large-xsum-samsum.

I am interested in generating 1 sentence summaries. But I noticed sometimes, it does not generate TLDR, where it throws an error saying list out of index (due to this split(" TLDR ")[1]). Is there a specific reason to this?

When the number of words in conversation is less than the maximum output length (400), it says manually reduce the max_length parameter. Why are you using the parameter, 400 for this? Is it a hyperparameter? If I remove it will it generate dynamic length outputs?

Error occurs when load trained summarization models using the huggingface library.

thanks for sharing this code!
it seems there are some errors when using the trained model:

libc++abi.dylib: terminating with uncaught exception of type c10::Error: owning_ptr == NullType::singleton() || owning_ptr->refcount_.load() > 0 INTERNAL ASSERT FAILED at ../c10/util/intrusive_ptr.h:348, please report a bug to PyTorch. intrusive_ptr: Can only intrusive_ptr::reclaim() owning pointers that were created using intrusive_ptr::release(). (reclaim at ../c10/util/intrusive_ptr.h:348)

I use the code below(which is provided in README):
from transformers import pipeline
summarizer = pipeline("summarization", model="Salesforce/bart-large-xsum-samsum", device=0)
text = "<s> {}".format(" <s> ".join(conv))
summary = summarizer(text, min_length=10, max_length=100, num_beams=4)[0]["summary_text"]

how can I solve this problem?
thanks!

Training segment predictor

Hello salesforce,

I am interested in your project but have a question.
When training the segment predictor, where comes the golden label?
The problem is, when I am running the code "train_segment_predictor", data loader only loads datas with the label "segment", but the preprocessed data does not have it.

Thanks for you kind reply.

Pre-trained dialogue segmentation model

Dear authors,
Could you please provide an access to the pre-trained segmentation model? I suppose that it may be really useful for other related tasks as well.
Thanks.

Samsum corpus for finetuning

I wanted to know if you have used samsum corpus for fine-tuning these two models (Salesforce/bart-large-xsum-samsum and Salesforce/cods-bart-large-xsum-samsum) ?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.