salesforce / convsumm Goto Github PK
View Code? Open in Web Editor NEWLicense: BSD 3-Clause "New" or "Revised" License
License: BSD 3-Clause "New" or "Revised" License
thanks for sharing this code!
it seems there are some errors when using the trained model:
libc++abi.dylib: terminating with uncaught exception of type c10::Error: owning_ptr == NullType::singleton() || owning_ptr->refcount_.load() > 0 INTERNAL ASSERT FAILED at ../c10/util/intrusive_ptr.h:348, please report a bug to PyTorch. intrusive_ptr: Can only intrusive_ptr::reclaim() owning pointers that were created using intrusive_ptr::release(). (reclaim at ../c10/util/intrusive_ptr.h:348)
I use the code below(which is provided in README):
from transformers import pipeline
summarizer = pipeline("summarization", model="Salesforce/bart-large-xsum-samsum", device=0)
text = "<s> {}".format(" <s> ".join(conv))
summary = summarizer(text, min_length=10, max_length=100, num_beams=4)[0]["summary_text"]
how can I solve this problem?
thanks!
I wanted to know if you have used samsum corpus for fine-tuning these two models (Salesforce/bart-large-xsum-samsum and Salesforce/cods-bart-large-xsum-samsum) ?
Amazing work. I read the paper and it says, CODS can actually control the number of sentences in the summary. I think you have mentioned the generation pipeline under the topic of CODS: Salesforce/cods-bart-large-xsum-samsum.
I am interested in generating 1 sentence summaries. But I noticed sometimes, it does not generate TLDR, where it throws an error saying list out of index (due to this split(" TLDR ")[1]). Is there a specific reason to this?
When the number of words in conversation is less than the maximum output length (400), it says manually reduce the max_length parameter. Why are you using the parameter, 400 for this? Is it a hyperparameter? If I remove it will it generate dynamic length outputs?
Hello salesforce,
I am interested in your project but have a question.
When training the segment predictor, where comes the golden label?
The problem is, when I am running the code "train_segment_predictor", data loader only loads datas with the label "segment", but the preprocessed data does not have it.
Thanks for you kind reply.
Dear authors,
Could you please provide an access to the pre-trained segmentation model? I suppose that it may be really useful for other related tasks as well.
Thanks.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.