Comments (8)
@showcasefloyd whenever you submit an image to be added to the index by Pastec, the file is processed to extract features (signature) that allow searching for it later. This index of signatures is stored in memory by the server while it is running.
If you shut down the server, the index is lost, and when you start it again you must submit all images so that the index is built again.
Saving the index dumps all the information contained in it to a file on disk. This allows you to start the server again and load that index from a file, without needing to submit each image again.
Clearing the index removes all currently indexed images from Pastec, that is, it clears the in memory index.
from pastec.
@chrishein First thanks so much for your answer. This helps me a ton. So it sounds like this is Pastec's way of storing it's data then. So as a rule do you save the index ever time a new image is submitted to it. Also are there best practices for how to use the index and if possible can you give me an scenario of why I would ever need to clear it?
Floyd
from pastec.
@showcasefloyd Regarding rules about saving the index, it all dependes on the usage scenario. Saving the index to disc, especially if it is a large one, will take time and resources, and it will probably block the access to the service. So doing it very often can be a problem.
Clearing the index can be useful when needing to start from scratch, during development for testing, for loading a different index when the usage scenario changes. Again, it all depends on your specific use case.
from pastec.
@chrishein Okay, thanks again. I think I understand now. By the way are there any limitations and performance issues I need to be thinking about in terms of the index? Meaning if I have a million images indexed will there be a huge performance hit when I try to do a search against it? If so are there techniques to think about when designing my app.
from pastec.
I have another noob question. Is there a way to actually list / query what's in the index? Basically, if we have to know if an image has already been added to the index, is there a way to find it?
Floyd
from pastec.
Le 31/03/2015 16:11, showcasefloyd a écrit :
I have another noob question. Is there a way to actually list / query
what's in the index? Basically, if we have to know if an image has
already been added to the index, is there a way to find it?
There is currently no API call that allows to list the images in the
index. However, that would be easy to implement.
Adrien Maglo, Ph.D.
Pastec developer, Visualink Founder
http://www.pastec.io
+33 6 27 94 34 41
from pastec.
@magwyz - Yes this would be very useful! Thanks so much for all your hard work. Pastec is amazing.
from pastec.
A new API call that lists the image ids in the index has been added by the commit 748b979.
example:
~$ curl -X GET http://127.0.0.1:4212/index/imageIds
from pastec.
Related Issues (20)
- Unix socket support HOT 1
- Pastec Server Not retruns More than 100 records HOT 1
- Installation Error - CMake on mac HOT 2
- Run pastec in https HOT 1
- IMAGE_NOT_ENCODED HOT 4
- Image not encoded HOT 2
- Querying from HTTP HOT 1
- How does image resolution impact results? HOT 4
- Loading the same index on different platforms HOT 5
- Problem when installing pastec on ubuntu 18.04 HOT 5
- Questions regarding pastec HOT 1
- Python wrapper HOT 5
- Feature: VisualWordsORB, but for Danbooru HOT 4
- all the -d commands error, why ? HOT 1
- Segmentation fault (core dumped) HOT 3
- pastec "Could not open the backward index file" HOT 11
- Attach 2 models to the index ?
- Continued developement and problem in weight ranking
- Any publication or text about Pastec? HOT 1
- The URL for setup is not available anymore. Is this project still being maintained?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pastec.