osmosis .follow('a.result', function(context, data, nextPage) { // can I get t

Support for this is coming in the next version. By setting the <code class="notranslat

As of Osmosis 0.0.9 if keep_data is set to <code clas

Looks like it should be: <div class="snippet-clipboard-content notranslate positio

How to get the HTML body string after the Osmosis followed some url? about node-osmosis HOT 5 CLOSED

rchipka commented on September 24, 2024

How to get the HTML body string after the Osmosis followed some url?

from node-osmosis.

Comments (5)

rchipka commented on September 24, 2024

If you mean just the context of the HTML <body>, then context.get('body').toString() should work.

If you mean get the entire HTML document as a string, one solution would be to call context.toString(), however it's not the original string and could cause issues because it doesn't always perfectly serialize HTML (see: libxmljs/libxmljs#213).

The reason Osmosis doesn't retain the original HTML document string/buffer is to keep memory usage to a minimum. In a future release an option could be added that will cause Osmosis to store the original string in the context object. For example, calling .get(url, opts, cb, { keepOriginal: true }) could make the context.original string available.

from node-osmosis.

zzyymaggie commented on September 24, 2024

@rc0x03 💝

from node-osmosis.

rchipka commented on September 24, 2024

Support for this is coming in the next version. By setting the keep_data option to true, the original XML/HTML data is stored in context.response.data.

osmosis
.config('keep_data', true)
.get('www.craigslist.org/about/sites')
.then(function(context, data, next) {
  console.log(context.response.data);
})

from node-osmosis.

rchipka commented on September 24, 2024

As of Osmosis 0.0.9 if keep_data is set to true then the response data will be stored in context.response.data, which is accessible from .then().

from node-osmosis.

pingzh commented on September 24, 2024

Looks like it should be:

osmosis.config('keep_data', true)

osmosis
.get('www.craigslist.org/about/sites')
.then(function(context, data, next) {
  console.log(context.response.data);
})

the version that I am using is 1.1.4

from node-osmosis.

Recommend Projects

How to get the HTML body string after the Osmosis followed some url? about node-osmosis HOT 5 CLOSED

Comments (5)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent