Comments (4)
@extricator thank you for your compliment. Currently, it only accepts a url, not HTML document. I'm considering to add this next release. It's true that in some cases, we have HTML content already, don't need to send another request to retrieve it any more.
from article-extractor.
It's true that in some cases, we have HTML content already, don't need to send another request to retrieve it any more.
What about file URIs pointing to html files? Does it support those?
from article-extractor.
@rpgdev I'm not sure. It depends on the way how node-fetch handles the URLs. This lib uses node-fetch to retrieve html content from given URL.
from article-extractor.
@extricator this feature has been added since v4, please try and give me the feedback if any. Thanks.
from article-extractor.
Related Issues (20)
- Preserve multiline spaces for code blocks
- Preserve multiline spaces for code blocks HOT 5
- A date like "<pubDate>Wed, 31 May 2023 13:33:19 +0000</pubDate>" in atom file will return ""
- Got an error when extract vitalik's blog. HOT 9
- Crashing on start with npm run dev HOT 6
- Error [ERR_REQUIRE_ESM]: require() of ES Module >=7.3.0 HOT 2
- How to set the rule of extracting picture when the default extraction algorithm can't get it? HOT 1
- Can't run the lib with J
- Can't run using JEST HOT 3
- Some url do not work HOT 2
- Error [ERR_REQUIRE_ESM]: require() of ES Module >=8.0.2 HOT 3
- Incorrect resolution when there are multiple Open Graph tags HOT 3
- Node example works but deno don't on a specific site HOT 2
- Can i use with utf 8 ? HOT 1
- Specific site work with deno but not node HOT 7
- Feat: extract pagetype from og:type or ld+json HOT 3
- Encountering errors while using library inside NodeJS + TS project HOT 4
- Expected ',' or '}' after property value in JSON at position 543 (line 23 column 7) HOT 4
- Crashes on Pinterest and a lot of other websites HOT 16
- @extractus/article-extractor 8.0.6 isn't compatible with Google Cloud Functions HOT 5
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from article-extractor.