libmir / dcv Goto Github PK

View Code? Open in Web Editor NEW

91.0 91.0 18.0 34.14 MB

Computer Vision Library for D Programming Language

Home Page: http://dcv.dlang.io/

License: Boost Software License 1.0

D 99.81% Shell 0.19%

computer-vision image-processing

dcv's People

Contributors

Stargazers

Watchers

Forkers

gitter-badger henrygouk dmitryolshansky rjmcguire 9il strogo andrewbenton ljubobratovicrelja timotheecour carun smietzner topcomma isabella232 rillki inochi2d aferust

dcv's Issues

Raster shape drawing

Implement basic raster shape drawing for dcv.plot module.

Sources:
https://en.wikipedia.org/wiki/Bresenham%27s_line_algorithm
https://en.wikipedia.org/wiki/Xiaolin_Wu%27s_line_algorithm

Global Goal

Possible variants:

Library for D. Disadvantage: D community is really small. Advantage: can be implemented in any programming style including GC + OOP. A project to play with D and have a fun.
Library written in D. Disadvantage: nothrow @nogc layer should be presented. OOP API should be optional or removed. Advantage: number of users and contributors are limited only by langauge bindings like Python, Julia, Ruby, Rust, and Go (yes, we can build libraries for Rust and Go 😄 I think it is better way to move forward with D ). A professional project that can be live during many-many years.

Library for D in 99% cases is useless as general purpose library. Library written in D is a library for D, which can be used in other languages like a common C library.

imwrite should accept lazy slice

lazy image / image with sride!1 != 1 -> buffering (4 KB buffer) -> writer stream
image with srride!1 == 1 -> writer stream
image with ideal strides -> writer stream (single call).

For example void write_png(Writer stream, long w, long h, in ubyte[] data, long tgt_chans = 0) from image formats can be used.

dcv.dlang.io

@ljubobratovicrelja You may want to ask @wilzbach to move DCV site :-)

norm and asType can be deprecated.

Both are single line functions. See #33
normalized single line too.
scaled is not used if am not wrong. User can do slice[] *= scalar/slice[] += scalar or use ndEach if both * and + are required.

Proof of concept for ndslice

dcv is amazing proof of concept for ndslice. Please add notes, that it is based on ndslice in the future forum announce and dcv readme.

The site is broken on android

the site is broken on android, it constantly just redirects to itself
from https://twitter.com/WebFreak001/status/765138108417474560

[Docs] Bug in example code paths

Image path strings have the ?raw=true metadata:
https://ljubobratovicrelja.github.io/dcv/?loc=example_filter.html

examples/video

Won't work

# ./video -f ../data/centaur_1.mpg

[mpegvideo @ 0xd9d320] Estimating duration from bitrate, this may be inaccurate
core.exception.AssertError@../../source/dcv/core/image.d(171): Assertion failure

Iteration performance

aSlice[i] requires 1 addition and 1 multiplication
aSlice[i, j] requires 2 additions and 2 multiplications
90% of algorithm can be iterated using ndslice.algorithm (ndMap, ndReduce). Other 10% can use front!d, popFront!d methods

~= concatenation should be removed

From RHT module:

    /// Run RHT using non-zero points in image as edge points.
    auto opCall(T)(Slice!(2, T*) image)
    {
        Point[] points;
        foreach (y; 0 .. image.length!0)
            foreach (x; 0 .. image.length!1)
            {
                if (image[y, x] > 0)
                {
                    points ~= Point(cast(int)x, cast(int)y);
                }
            }
        return this.opCall(image, points);
    }

This example has 2 issue. The first is slow indexing. The second is ~= concatenation, which changes complexity from O(n) to O(n^2).

Note about LDC

That it is required.
ldmd2 should be used with DUB, not ldc2

[Docs] Finish up lucas-kanade example

Double allocation without reason

as!aType.slice.asImage allocates without reason, bacause asImage allocates data anyway. asImage is uses data too. In addition, it should be something like toImage.

Is it's possible to remove background with dcv?

I have got a lot of images in jpg format with black background. Is it's possible to find background and remove it (make transparent) with current version of DCV? PNG as output would be enough for me (in ideally wepb, but D do not have native encoder/decoder :( )

If not DCV, maybe there is any better tools for this task?

Go forward with LLVM and drop DMD BE?

According to libmir/dcompute#7 upcoming openCL can be used for thread and synchronization management instead of druntime for CPU (not only for GPUs) and kernels can be optimized as good as common CPU code. So the idea is to drop support for DMD. The benefits:

Do not need optimise DMD BE, currently it is 20 times slower for matrix multiplication comparing with LDC. In addition, @WalterBright has a lot of another work with DMD FE.
Simple, fast and nothrow @nogc parallellism using upcoming dcompute.
Code will be simplified.
Less maintaining efforts will be required.
Less uncertainty for users.

lockstep is slow

It is very slow (zip is the same)

Canny

Non-maxima suppression implementation is faulty.

See RHT example canny image:
https://github.com/ljubobratovicrelja/dcv/blob/master/examples/rht/result/canny.png

ranged should be deprecated (again)

As soon as mir.image be implemented, we will have good conversions between all formats we use. ranged is functions that stretch color domain for each channel. If you apply this for not very contrast image, it will make it more contrast. This implicit filter functionality is bad practice.

Filters with fixed size

Many separable filters are used with fixed size, e.g. 2, 3, 4, 5. It can be done with static foreach.

Filters should be separated

See https://en.wikipedia.org/wiki/Separable_filter

Naming convensins

It seem your naming conventions and abbreviations does not give a good experience IMO. Long naming might be better in some cases.

Like;

Image image = imread("/path/to/image.png"); // read an image from filesystem.

auto slice = image.sliced; // slice image data (calls std.experimental.ndslice.slice.sliced on image data)

slice
    .asType!float[0..$, 0..$, 1] // convert slice data to float, and take the green channel only.
    .conv!symmetric(sobel!float(GradientDirection.DIR_X)) // convolve image with horizontal Sobel kernel.
    .byElement
    .ranged(0, 255).array.sliced(slice.shape[0..2]) // scale values to fit the range between the 0 and 255
    .imshow("Sobel derivatives"); // preview changes on screen.

waitKey();

some thing like readImage(), showImage().

You tend to mix them, like there is byElement() which is not consistent with imread(), imshow() etc.

Edge preserving blur

median
bilateral filtering.

Figure out a proper API for filterNonMaximum

https://github.com/ljubobratovicrelja/dcv/blob/master/source/dcv/imgproc/filter.d#L311

[question] Migration to libmir

@ljubobratovicrelja Are you interesting to move DCV to libmir project? Copyright, the project name are preserved as is.

Machine learning algorithm

Does existing or future dcv requires ML algorithms? If yes, please open an issue in https://github.com/libmir/mir/issues . Most of ML can be split to data preparation and other computations. Other computations_ may be added to Mir as building blocks for ML algorithms in dcv.

[Docs] Add links from source to examples and vice versa

Why normalize function use conversion to real when normalizing integers?

Updated Mir version

@9il unit tests are failing - I'm not quite sure what has changed in mir.ndslice.slice.slice, could you help out?

Better testing

Please add tests for LDC and different compiler versions. Also windows testing is very important for users.

[Docs] Move examples to gh-pages

Remove README.md content from examples in the project - not to have doubled example content that requires updating and syncing.

Remove dcv.core.image.Image

Proposition by @9il, started in #62.

Here are the relevant copy/pasted messages:

9il:

Do we really need Image type? Why?

ljubobratovicrelja:

As said in the description of the module in docs it is designed mainly to help with image I/O, but also to hold additional image metadata. Since it's data type is defined in runtime, it allows reading of unknown image format. Since Slice format is statically defined, we would have to expect certain image format when reading it, and if read image is not of expected format, we'd have to convert it. Also, Image contains additional metadata, e.g. color format (HSV, YUV, RGB etc.). And, in future Image should hold EXIF metadata.

Pipeline in DCV should be:

Image = dcv.core.image.Image
Slice = mir.ndslice.slice.Slice

LoadImage(path) --> dcv.core.image.Image
InspectAndAdoptImageFormat(Image) --> Slice
Processing(Slice) --> Slice
PackSliceToImage(Slice) --> Image
SaveImage(Image, path)

Long story short, we need image container with runtime defined data type, and additional image related metadata.

9il:

This is scripting language idioms. They are not good for D.

If you have processing, then you work with one, two, maximum three formats for processing.
They should have their own CT instantiations because performance reasons.
Then, when you want to save something, you can just call a function which accepts Slice, Metadata, and optionally RT/CT format.

The last one issue os reading. Yes, when we read something, the image format is unknown. But, as was said above, only beforehand image types are interesting. So, a user or library should define mapping, for example:

RT image type1 -> Alg1
RT image type2 -> Alg1
RT image type3 -> Alg2
Other RT image -> Error

It is not possible to eliminate this mapping. But rather hiding it in different classes implementations it is better how have an explicit way to do it and library helpers if required.

Please avoid any usage of classes (except already existing D libs, which can be replaced in future). Even async I/O can be performed without classes. D users like it because they are familiar with OOP. But this is bad practice for D. Structural programming is proper way to move forward with D.

Rename histEqual to histEqualize

Equal is really weird shortcut for Equalize in Dlang

Image I/O

As discussed in #48, maybe we should start planning on how to enrich the image I/O package of the library. Imageformats library is good (especially because it is purely written in D), but format coverage is poor (especially for encoders).

Use C libraries

So first idea is to build minimal bindings (or use existing ones) to popular C libraries:

Pros:

industry-wide used and tested libraries
minimal pain for maximal gain :)

Cons:

more C dependencies

Translate libraries to D

Some people already translated some of the popular encoder/decoders to D. I feel that's not that easy to do, and I'd personally much more like to focus on the DCV's core, but if we decide to take this step, no problem.

Using FFmpeg

@henrygouk suggested we could use ffmpeg to encode/decode image formats. This also seems like a great choice since lot of formats are supported, and ffmpeg-d is already a dependency.

Custom image I/O library's synergy with dcv:core

There was also discussion that users should be free to use 3rd party libraries for image I/O with DCV. I believe this is already achievable in DCV - e.g. if user is working with gtkd, and dcv:core, he/she can load pixbuf from file, then slice it's data and work along with dcv algorithms. So, I believe we're OK here, except maybe we should make an example on this topic to show it to people.

Separation of Image I/O from Video

Also we discussed if image io should be separated from video - in #48 we defined dcv:io sub-package, where we could have defined dcv:ioimage, and dcv:iovideo. If we decide to go with first option (bind C libs), I really think we should do this since it would be heavy loaded with C libs.

Any comment is welcome.

Library separation

A user may want to use this library only for CV algorithms. Library should not force a user to install any C libraries.
Looks like DCV may be splitt to

algorithms / image manipulations (DCV)
decoding / codecs
visualization

Current library looks like it is oriented for the end user. Comparing with Python, it is better practice for D to have an API, which can be used to build extended functionality, e.g. to be used in other libraries and cross-platform products.

ggplot dependency should be optional and only for figure drawing

glfw can be used for imshow directly

imgproc.color should be deprecated

It can be replaced with single template which calls ndslice.algorithm and colour transformations from upcoming Phobos color module.

Dub: http://code.dlang.org/packages/color
Git: https://github.com/TurkeyMan/color
Colour thread at forum: https://forum.dlang.org/thread/[email protected]

In addition, color transformation should use 2D representation where color is packed in the last dimension. This will optimise iteration.

How the linear resizing is supposed to work?

When I resize an image in GIMP, I got completely different result.
For example, in DCV I do this:

// w == 3; The image is a 2MP RGBA photo.

auto slice = image.sliced;
writeln(slice[0, 0, 0 .. $]);
auto thumbnail = slice.resize!linear([w, w]);
auto rgb = thumbnail[0 .. $, 0 .. $, 0 .. 3];
writeln(rgb);

and I got this:

[196, 198, 249, 255]
[[[196, 198, 249], [156, 166, 189], [82, 59, 100]], [[231, 163, 178], [35, 39, 34], [104, 98, 122]], [[141, 120, 119], [26, 36, 44], [169, 141, 130]]]

The first pixel of the downscaled image is the same as the first pixel of original image.
This is not what I expect.

migration to ndslice.algorithm and Mir

The common pattern for dcv is aSlice.byElement.rangedFunction. It has few performance issues:

It can not be vectroized
It requires additional computations because range interface

So, you may want to swtich from dcv.core.algorithm to ndslice.algorithm.

The documentation for ndslice.algorithm can be found here. ndslice.algorithm is currently available only in Mir.

So, switching to Mir is good option. It is providing recent and upcoming ndslice changes for both DMD and LDC. Also, mir will migrate to Phobos's ndslice after 2-3 DMD releases and will provide deprecation imports with aliasing. ndslice,algorithm is not the last module in ndslice package, also ndslice.concatenation will be added this year.

Please ask me questions if you have any.

debug

HI again, cv library always is complex, so i wanna join to develop, but some internals hard to understand, i think we need provide some debug output (to help bug reports) in "debug build" or some version definition
like USE_DCV_DEBUG

GPU acceleration

Nicholas Wilson merged dcompute to libmir organization. It is not ready yet, but we can figure out what DCV algorithms can be expressed as GPU kernels, and what GPU subroutines required in DCV should be implemented in Mir.

uninitializedSlice

dlang/phobos#4780 will be available for 2.072. As soon as LDC will have it too, the code can be simplified.

Use std.color based images type of `Slice!(2, XXX*)`

Benefits:

Faster iterations (the color dimension becames CT loop).
Explicit image type system is less buggy for devs and users
Single conversion shell based on std.color can be used instead of set of conversion.

std.color can be improved to support DCV if it is required. In addition, we can add fastmath to std.color if we want.

Code clean and optimization: remove std.range

I have reviewed a set of files. We still have std.range and std.array are used frequently. The reasons to remove them:

Less template bloat with iota - iotaSlice uses size_t only and it is faster
Many usage cases can be improved with ndslice primitives.

ndslice does not require std.range and incorporates its funcitonality.
Maybe few cases with std.array can be still useful. But most of them are used for slice allocation

Basic implementation
CLAHE

Optional: Implement Otsu's thresholding method.

Parallelisation should be optional

Support 32 bit compilation

As noted in #19 ulong is used for size type, which should be fixed.
Surely there's other stuff to be discovered that's brake 32bit compilation...

libmir / dcv Goto Github PK

dcv's People

Contributors

Stargazers

Watchers

Forkers

dcv's Issues

Here are the relevant copy/pasted messages:

9il:

ljubobratovicrelja:

9il:

Use C libraries

Pros:

Cons:

Translate libraries to D

Using FFmpeg

Custom image I/O library's synergy with dcv:core

Separation of Image I/O from Video

Recommend Projects

Recommend Topics

Recommend Org