Comments (6)
I'm not sure if I understand exactly which REST call returns 404. You use scenario where river gets list of id's from main indexer REST call, and then do separate REST call for each item to get details? And 404 is returned for some items there?
from elasticsearch-river-remote.
Well, this can occur for all APIs in those cases where the item is deleted shortly after the river is trying to index the item.
For our API we are listing deleted items so that other systems can delete them from their indices without doing a complete reindex. Four our case it would have been nice if the river could delete 404 items from Elasticsearch – but a graceful handle where the river just ignores 404 items will suffice :)
from elasticsearch-river-remote.
OK, so I'll improve river to ignore 404 returned from calls to the URL configured in remote/urlGetDocumentDetails
and continue in indexing, not to fail whole indexing in this case.
But 404 returned for calls to URL's defined in remote/urlGetDocuments
or remote/urlGetSpaces
wil be treated as indexing error still.
from elasticsearch-river-remote.
That would be perfect @velias. Thank you!
from elasticsearch-river-remote.
OK, code is updated. I'll release new version of plugin in few days.
from elasticsearch-river-remote.
released
from elasticsearch-river-remote.
Related Issues (20)
- Upgrade structured-content-tools to 1.3.7
- feature for management - add wild card to show state of all rivers
- is there a way to easily implement deletes HOT 15
- How to distinguish between incremental and full on remote side HOT 5
- how to disable initial full update HOT 2
- Update structured-content-tools to 1.3.8
- RSS dates parsing depends on the system default locale and it should be US always.
- LDAP Authentication HOT 1
- Problems with indexFullUpdateCronExpression HOT 3
- Rivers are deprecated HOT 3
- Make the river capable of indexing content from Splunk REST API
- Support for getting values outside of the defined getDocsResFieldDocuments
- Make it possible to specify remote_field_updated field format.
- Allow injecting hidden values into URL
- Implement throttling ability
- Implement handling of enforced wait for the next API call.
- spacesIndexed parameter improvements
- typo in readme HOT 1
- minGetDocumentsDelay parameter doesn't work correctly when spacesIndexed is used
- structured-content-tools update to 1.3.10
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from elasticsearch-river-remote.