Comments (8)
Also, I've been using codesearch as part of a webapp at work that does forensic
analysis of crashes (by letting us search through backtraces), and it's amazing
:)
I'm kinda stuck right now because I cannot index some files and I'm thinking
about using a different indexer / search system, but really codesearch is all I
need so if someone can figure out what the problem is that would be awesome.
Thanks !!
Original comment by [email protected]
on 14 Mar 2013 at 2:00
from codesearch.
Also, I have one line that is crazy long: 2245 characters. Maybe the problem is
that the indexer reads line by line and has some hardcoded limit on the number
of chars in a single line ?
Original comment by [email protected]
on 14 Mar 2013 at 2:04
from codesearch.
Try indexing with -verbose and -logskip flags to see if the file is getting
skipped.
The arbitrary limits are in the source so you can always hand edit and tweak
them. I have a version at
http://github.com/junkblocker/codesearch
which I did to specifically add such options.
Original comment by [email protected]
on 14 Mar 2013 at 4:07
from codesearch.
Thanks for the tip. Indeed I've removed those long lines and now everything
works fine. I've seen that your copy of the code has that -maxlinelen that
should be what I need. Now I have to understand how to build a go program ...
Original comment by [email protected]
on 14 Mar 2013 at 5:57
from codesearch.
Alright, I figured it out, thanks.
repro$ awk '{print length($0)}' badfile | sort -n | tail
972
1001
1043
1071
1456
1529
1724
1792
2259
2328
and in index/write.go there's a
maxLineLen = 2000
Original comment by [email protected]
on 14 Mar 2013 at 6:36
from codesearch.
Feel free to close the issue whoever can.
Original comment by [email protected]
on 14 Mar 2013 at 6:39
from codesearch.
I'm going to leave this open until I can get something like -logskip into
the mainline codesearch branch.
Original comment by [email protected]
on 14 Mar 2013 at 2:08
from codesearch.
I don't know how far you guys should go with that, but having those 2 options
to set the maxLineLen and maxFileSize on the command line would also help.
The default behavior could be to print a message like that (with a better
phrasing probably / different options names) when a file got skipped.
=> /tmp/foo wasn't indexed (maxLine too long) / try to reindex with cindex
-maxLineLen 3000
=> /tmp/foo wasn't indexed (file too big) / try to reindex with cindex
-maxFileSize 1M
Original comment by [email protected]
on 14 Mar 2013 at 4:26
from codesearch.
Related Issues (20)
- Line missing in read.go HOT 3
- csearch: Add flag (-g) for grouping output by file (a'la ack --group, or git grep --heading) [PATCH] HOT 4
- Error in read.go comments
- go install fails to find package
- Option to remove a path from index
- Pull request: add support for per-file custom annotations
- "csearch -n" output is 0-based, not 1-based HOT 1
- Files containing ascii8 are not indexed (feature/request) HOT 4
- IndexWriter always writes logs
- How can I tell cindex to ignore certain folders? (such as logs/, *.log, tmp/ etc.) HOT 3
- cindex fails with mmap errors on OpenBSD HOT 1
- mmap_windows.go not updated for latest go version
- .csearchindex is world-readable HOT 1
- cindex needs vast amounts of temporary space HOT 1
- Windows missing conversion HOT 1
- fix bug in siftUp() HOT 3
- Patch for /index/write_test.go HOT 1
- cindex ignores IRC log files HOT 3
- Minor adjustment of Csearch page. HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from codesearch.