merge romote #2

ranjiewwen · 2017-06-11T09:10:59Z

This pullrequest changes

…vide parallel version of both Wu's and Grana's algorithms (using TBB library)

Extended parallel version to all frameworks supported by OpenCV; Added some documentation notes in modules/imgproc/include/opencv2/imgproc.hpp;

…h getNumberOfCPUs

…rmap

…into UserColormap

ICV_HLINE is split into several specific cases, according to pixel_size, to optimize memory copies of the same color components along the line.

On MacOS and iOS, the unused opencvBigToHost32 is a warning for buildbot

…g_performance # Conflicts: # modules/imgproc/src/drawing.cpp

added 64b optimization for 3 channels case not added 64b optimization for 4 channels case since timings did not show any improvement split ICV_HLINE cases into inline functions instead of macro for code size reduction, without significand speed drawback at first sight

added ICV_HLINE custom implementations for element sizes up to 32 but timings show that it is not very relevant for sizes >= 12

I took the subScalar.cu code and changed the inner operation

* avoid link error (move the implementation of software version to header) * make getConvertFuncFp16 local (move from precomp.hpp to convert.hpp) * fix error on 32bit x86

…flow

C4189: 'clImageUV' : local variable is initialized but not referenced

Fixed snprintf for VS 2013 (#8816) * Fixed snprintf for VS 2013 * snprintf: removed declaration from header, changed implementation * cv_snprintf corrected according to comments * update snprintf patch

There is no cast to wide integer type: std::numeric_limits<ST>::max() * std::numeric_limits<ST>::max()

[GSOC] Speeding-up AKAZE, part #2 (opencv#8951) * feature2d: instrument more functions used in AKAZE * rework Compute_Determinant_Hessian_Response * this takes 84% of time of Feature_Detection * run everything in parallel * compute Scharr kernels just once * compute sigma more efficiently * allocate all matrices in evolution without zeroing * features2d: add one bigger image to tests * now test have images: 600x768, 900x600 and 1385x700 to cover different resolutions * explicitly zero Lx and Ly * add Lflow and Lstep to evolution as in original AKAZE code * reworked computing keypoints orientation integrated faster function from https://github.com/h2suzuki/fast_akaze * use standard fastAtan2 instead of getAngle * compute keypoints orientation in parallel * fix visual studio warnings * replace some wrapped functions with direct calls to OpenCV functions * improved readability for people familiar with opencv * do not same image twice in base level * rework diffusity stencil * use one pass stencil for diffusity from https://github.com/h2suzuki/fast_akaze * improve locality in Create_Scale_Space * always compute determinat od hessian and spacial derivatives * this needs to be computed always as we need derivatives while computing descriptors * fixed tests of AKAZE with KAZE descriptors which have been affected by this Currently it computes all first and second order derivatives together and the determiant of the hessian. For descriptors it would be enough to compute just first order derivates, but it is not probably worth it optimize for scenario where descriptors and keypoints are computed separately, since it is already very inefficient. When computing keypoint and descriptors together it is faster to do it the current way (preserves locality). * parallelize non linear diffusion computation * do multiplication right in the nlp diffusity kernel * rework kfactor computation * get rid of sharing buffers when creating scale space pyramid, the performace impact is neglegible * features2d: initialize TBB scheduler in perf tests * ensures more stable output * more reasonable profiles, since the first call of parallel_for_ is not getting big performace hit * compute_kfactor: interleave finding of maximum and computing distance * no need to go twice through the data * start to use UMats in AKAZE to leverage OpenCl in the future * fixed bug that prevented computing determinant for scale pyramid of size 1 (just the base image) * all descriptors now support writing to uninitialized memory * use InputArray and OutputArray for input image and descriptors, allows to make use UMAt that user passes to us * enable use of all existing ocl paths in AKAZE * all parts that uses ocl-enabled functions should use ocl by now * imgproc: fix dispatching of IPP version when OCL is disabled * when OCL is disabled IPP version should be always prefered (even when the dst is UMat) * get rid of copy in DeterminantHessian response * this slows CPU version considerably * do no run in parallel when running with OCL * store derivations as UMat in pyramid * enables OCL path computing of determint hessian * will allow to compute descriptors on GPU in the future * port diffusivity to OCL * diffusivity itself is not a blocker, but this saves us downloading and uploading derivations * implement kernel for nonlinear scalar diffusion step * download the pyramid from GPU just once we don't want to downlaod matrices ad hoc from gpu when the function in AKAZE needs it. There is a HUGE mapping overhead and without shared memory support a LOT of unnecessary transfers. This maps/downloads matrices just once. * fix bug with uninitialized values in non linear diffusion * this was causing spurious segfaults in stitching tests due to propagation of NaNs * added new test, which checks for NaNs (added new debug asserts for NaNs) * valgrind now says everything is ok * add nonlinear diffusion step OCL implementation * Lt in pyramid changed to UMat, it will be downlaoded from GPU along with Lx, Ly * fix bug in pm_g2 kernel. OpenCV mangles dimensions passed to OpenCL, so we need to check for boundaries in each OCL kernel. * port computing of determinant to OCL * computing of determinant is not a blocker, but with this change we don't need to download all spatial derivatives to CPU, we only download determinant * make Ldet in the pyramid UMat, download it from CPU together with the other parts of the pyramid * add profiling macros * fix visual studio warning * instrument non_linear_diffusion * remove changes I have made to TEvolution * TEvolution is used only in KAZE now * Revert "features2d: initialize TBB scheduler in perf tests" This reverts commit ba81e2a.

sturkmen72 and others added 30 commits September 11, 2015 16:41

Update min_enclosing_triangle.cpp

3f3e6ba

Improvement of sequential connected components Wu's algorithm and pro…

0bc9a0d

…vide parallel version of both Wu's and Grana's algorithms (using TBB library)

Fixed unnecessary black spaces;

5b23c0b

Extended parallel version to all frameworks supported by OpenCV; Added some documentation notes in modules/imgproc/include/opencv2/imgproc.hpp;

Fixed _P reserved variable name problem and changed getNumThreads wit…

4b7fc59

…h getNumberOfCPUs

Removed parallel version for CV_16U label type

89a0a46

ApplyColorMap can be used with a user colormap

61b9484

Suppress warning unused parameter

4826d97

mend

d8fdf93

Add sample

5ad02d7

warnings

1f724e2

warning 2

f92c9dd

warnings 2

8415b90

Merge branch 'master' of git://github.com/Opencv/opencv into UserColo…

a2f3692

…rmap

Merge branch 'master' of git://github.com/Opencv/opencv into UserColo…

5e08d58

…rmap

remove new operator

587e9a5

Merge branch 'master' of git://github.com/Opencv/opencv into UserColo…

48e2d38

…rmap

Merge branch 'UserColormap' of https://github.com/LaurentBerger/opencv …

91e06e7

…into UserColormap

optimize ICV_HLINE

af746a9

ICV_HLINE is split into several specific cases, according to pixel_size, to optimize memory copies of the same color components along the line.

do not use GCC_VERSION

d3a15c6

comment unused function

7521bcc

On MacOS and iOS, the unused opencvBigToHost32 is a warning for buildbot

adaptation for iOS buildbot

e19000a

new try to adapt to iOS build bot

16a9407

try to fix Android compilation

91a0270

Merge remote-tracking branch 'origin/drawing_performance' into drawin…

afbcc07

…g_performance # Conflicts: # modules/imgproc/src/drawing.cpp

Merge branch 'master' into master

0d7666a

Modified code to work with universal build.

2d20aa4

more ICV_HLINE specific cases

27cfe31

added ICV_HLINE custom implementations for element sizes up to 32 but timings show that it is not very relevant for sizes >= 12

Added message about synthesize keyword.

10651d4

make cuda::absdiff support multi-channel scalars

6cf4371

I took the subScalar.cu code and changed the inner operation

alalek and others added 28 commits June 3, 2017 16:57

photo(test): fix MergeRobertson test for AARCH64 build

3933958

Merge pull request #8848 from alalek:fix_test_photo_aarch64

ebd98ea

Fixing buildbot's messages.

a113e8f

TBB: fix build on ARM

a426a65

update convertFp16 using CV_CPU_CALL_FP16

e269ef9

* avoid link error (move the implementation of software version to header) * make getConvertFuncFp16 local (move from precomp.hpp to convert.hpp) * fix error on 32bit x86

Merge pull request #8838 from tomoaki0705:dispatchFp16

125abe2

build: fix PCH stub files generation optimization

0e1d65d

java: use module's public headers only

59798b3

Merge pull request #8857 from alalek:fix_pch_stub_regeneration

7b8d107

Modify the pyrlk.cl to support winSize from 8*8 to 24*24 for optical …

cc47ee3

…flow

Merge pull request #8803 from 4ekmah:sgbm_modehh4_SIMD

31c7966

cmake: add ENABLE_BUILD_HARDENING option

1961bb1

build: fix errors for MSVS2010-2013, reduce default softfloat scope

71517a9

Merge pull request #8844 from mshabunin:fix-arm-tbb

fd7e516

Merge pull request #8860 from alalek:fix_java_headers

fc84c48

add tests for videostab;

1887dcb

Merge pull request #8852 from BKNio:testsForVideoStab

515e01e

build: fix "ambiguous call" (MSVS2010)

781515c

build: fix warning

5c0a287

C4189: 'clImageUV' : local variable is initialized but not referenced

Merge pull request #8816 from mshabunin:sprintf-fix

f71ea4d

Fixed snprintf for VS 2013 (#8816) * Fixed snprintf for VS 2013 * snprintf: removed declaration from header, changed implementation * cv_snprintf corrected according to comments * update snprintf patch

Merge pull request #8876 from alalek:fix_build_msvs

e3c0d11

Merge pull request #8863 from LukeZheZhu:pyrlk_small_winsize

ea93bcc

Merge pull request #8868 from alalek:fix_build_softfloat

0213b50

photo: fix integer overflow

e665be1

There is no cast to wide integer type: std::numeric_limits<ST>::max() * std::numeric_limits<ST>::max()

Merge pull request #8877 from alalek:fix_integer_overflow

daf3fab

Merge pull request #8862 from alalek:build_hardening_flag

5b63399

Update doc build instructions for doxygen

47c9bb7

Merge pull request #8888 from lewisjb:docs-build-doxygen

772a818

ranjiewwen merged commit 5432bb2 into DIP-ML-AI:master Jun 11, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

merge romote #2

merge romote #2

Uh oh!

ranjiewwen commented Jun 11, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

34 participants

merge romote #2

merge romote #2

Uh oh!

Conversation

ranjiewwen commented Jun 11, 2017

This pullrequest changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

34 participants