improve performance by AvenSun · Pull Request #6 · sdcb/OpenVINO.NET

AvenSun · 2023-11-28T05:06:42Z

hi @sdcb
sorry for that I didn't noticed they have variety size in last PR.

I thought they already have the same size after padding and resize.

I looked into your implementation of PaddleOcrRecognizer.

OpenVINO.NET/projects/PaddleOCR/Sdcb.OpenVINO.PaddleOCR/PaddleOcrRecognizer.cs

Lines 129 to 144 in 6b99f89

    
           int modelHeight = Model.Shape.Height; 
        
           int maxWidth = StaticShapeWidth ?? (int)Math.Ceiling(srcs.Max(src => 
        
           { 
        
               Size size = src.Size(); 
        
               double width = 1.0 * size.Width / size.Height * modelHeight; 
        
               double padded = 32 * Math.Ceiling(1.0 * width / 32); 
        
               return padded; 
        
           })); 
        
           using Mat final = PrepareAndStackImages(srcs, modelHeight, maxWidth); 
        
           using InferRequest ir = _compiledModel.CreateInferRequest(); 
        
           using (Tensor input = final.StackedAsTensor(srcs.Length)) 
        
           { 
        
               ir.Inputs.Primary = input; 
        
               ir.Run(); 
        
           }

the input of model must have the same shape in one batch.

I think you have no doubt about this.

It should be like the image below after invoked the method PrepareAndStackImages.
four images stack

Am I right? Here's my question.

the implementation of PrepareAndStackImages invoked the method ResizePadding

why not the method of ResizePadding resize and pad image to maxWidth directly?

then we can invoke VConcat immediately.

OpenVINO.NET/projects/PaddleOCR/Sdcb.OpenVINO.PaddleOCR/PaddleOcrRecognizer.cs

Lines 208 to 239 in 6b99f89

    
           private static unsafe Mat PrepareAndStackImages(Mat[] srcs, int modelHeight, int maxWidth) 
        
           { 
        
               Mat[] normalizeds = null!; 
        
               Mat final = new(); 
        
               try 
        
               { 
        
                   normalizeds = srcs 
        
                       .Select(src => 
        
                       { 
        
                           using Mat channel3 = src.Channels() switch 
        
                           { 
        
                               4 => src.CvtColor(ColorConversionCodes.RGBA2RGB), 
        
                               1 => src.CvtColor(ColorConversionCodes.GRAY2RGB), 
        
                               3 => src.FastClone(), 
        
                               var x => throw new Exception($"Unexpect src channel: {x}, allow: (1/3/4)") 
        
                           }; 
        
                           return ResizePadding(channel3, modelHeight, maxWidth); 
        
                       }) 
        
                       .ToArray(); 
        
                   using Mat combined = normalizeds.StackingVertically(modelHeight, maxWidth); 
        
                   combined.ConvertTo(final, MatType.CV_32FC3, 2.0 / 255, -1.0); 
        
               } 
        
               finally 
        
               { 
        
                   foreach (Mat normalized in normalizeds) 
        
                   { 
        
                       normalized.Dispose(); 
        
                   } 
        
               } 
        
               return final; 
        
           }

according to what you mentioned in last PR (#5) , I made this new PR ,

It fixs all problem, you can take a look.

I confirmed It keeps exactly the same OCR result compared with your commit (same input and same model).

meanwhile we also can get some improvement as shown below.


BenchmarkDotNet v0.13.10, Windows 10 (10.0.19044.3693/21H2/November2021Update)
  [Host]     : .NET 7.0.14 (7.0.1423.51910), X64 RyuJIT AVX2
  Job-PARHUX : .NET 7.0.14 (7.0.1423.51910), X64 RyuJIT AVX2
  Job-YLHAAW : .NET 8.0.0 (8.0.23.53103), X64 RyuJIT AVX2

IterationCount=10  LaunchCount=1  WarmupCount=1

Method	Runtime	modelHeight	maxWidth	Mean	Error	StdDev	Ratio	RatioSD	Allocated	Alloc Ratio
StackingVerticallyBySdcb	.NET 7.0	48	320	415.2 μs	15.14 μs	10.01 μs	1.00	0.00	1.93 KB	1.00
StackingVerticallyByAven	.NET 7.0	48	320	252.8 μs	8.97 μs	5.93 μs	0.61	0.02	1.79 KB	0.93

StackingVerticallyBySdcb	.NET 8.0	48	320	408.3 μs	11.43 μs	7.56 μs	1.00	0.00	1.93 KB	1.00
StackingVerticallyByAven	.NET 8.0	48	320	248.5 μs	12.37 μs	8.18 μs	0.61	0.02	1.79 KB	0.93

StackingVerticallyBySdcb	.NET 7.0	48	512	423.3 μs	18.53 μs	12.26 μs	1.00	0.00	1.54 KB	1.00
StackingVerticallyByAven	.NET 7.0	48	512	361.4 μs	14.72 μs	9.74 μs	0.85	0.04	1.62 KB	1.05

StackingVerticallyBySdcb	.NET 8.0	48	512	419.8 μs	14.95 μs	9.89 μs	1.00	0.00	1.54 KB	1.00
StackingVerticallyByAven	.NET 8.0	48	512	360.3 μs	14.88 μs	9.84 μs	0.86	0.03	1.62 KB	1.05

add benchmark for Net 8.0 add more test images

sdcb

Overall speaking, thanks for you PR again, it's really a promising performance improvement result even if you added padding manually, but I requested a few changes, can you check.

sdcb · 2023-11-28T07:10:45Z

I agreed that we can use the method of ResizePadding resize and pad image to maxWidth directly in previous code, but I believe the StackingVertically method should keeps the same behavior of not constraint the input srcs width, which allows input srcs able to be different width.

AvenSun · 2023-11-28T10:51:49Z

hi @sdcb
All suggestions you mentioned are adopted.
please review.

sdcb · 2023-11-28T14:48:11Z

@AvenSun Hi AvenSun, I created a branch and did some necessary work that I believe, can you merge my upstream https://github.com/sdcb/OpenVINO.NET/tree/mat-ext branch into your PR's branch? After that, I'll go ahead and merge this PR.

AvenSun · 2023-11-28T15:39:55Z

hi @sdcb
I just did what you said, you can have a try.

sdcb · 2023-11-29T01:04:43Z

Thanks, merged.

AvenSun added 9 commits November 22, 2023 15:34

initiate benchmark project

45f5ae2

add image stacking benchmark

0e76afc

improve benchmark

af63cbc

add memory diagnoser and multi exporter

0ab3785

add benchmark for Net 8.0 add more test images

benchmark result

4d89d95

improve performance greatly (30% ~ 44%)

3cac1aa

update benchmark

54c37e7

update images and benchmark result

096d7d7

update StackingVertically and ResizePadding

d124f75

sdcb requested changes Nov 28, 2023

View reviewed changes

fix problems

517e182

sdcb added 3 commits November 28, 2023 22:15

add unit test, align code.

771fc48

Merge branch 'master' into mat-ext

abb2d9b

update packages.

4459186

sdcb changed the base branch from master to mat-ext2 November 29, 2023 01:04

sdcb merged commit 7307a63 into sdcb:mat-ext2 Nov 29, 2023

AvenSun deleted the mat-ext branch November 29, 2023 01:17

AvenSun restored the mat-ext branch November 29, 2023 01:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

improve performance#6

improve performance#6
sdcb merged 13 commits intosdcb:mat-ext2from
AvenSun:mat-ext

AvenSun commented Nov 28, 2023 •

edited

Loading

Uh oh!

sdcb left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sdcb commented Nov 28, 2023

Uh oh!

AvenSun commented Nov 28, 2023

Uh oh!

sdcb commented Nov 28, 2023 •

edited

Loading

Uh oh!

AvenSun commented Nov 28, 2023

Uh oh!

sdcb commented Nov 29, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	int modelHeight = Model.Shape.Height;
	int maxWidth = StaticShapeWidth ?? (int)Math.Ceiling(srcs.Max(src =>
	{
	Size size = src.Size();
	double width = 1.0 * size.Width / size.Height * modelHeight;
	double padded = 32 * Math.Ceiling(1.0 * width / 32);
	return padded;
	}));

	using Mat final = PrepareAndStackImages(srcs, modelHeight, maxWidth);
	using InferRequest ir = _compiledModel.CreateInferRequest();
	using (Tensor input = final.StackedAsTensor(srcs.Length))
	{
	ir.Inputs.Primary = input;
	ir.Run();
	}

	private static unsafe Mat PrepareAndStackImages(Mat[] srcs, int modelHeight, int maxWidth)
	{
	Mat[] normalizeds = null!;
	Mat final = new();
	try
	{
	normalizeds = srcs
	.Select(src =>
	{
	using Mat channel3 = src.Channels() switch
	{
	4 => src.CvtColor(ColorConversionCodes.RGBA2RGB),
	1 => src.CvtColor(ColorConversionCodes.GRAY2RGB),
	3 => src.FastClone(),
	var x => throw new Exception($"Unexpect src channel: {x}, allow: (1/3/4)")
	};
	return ResizePadding(channel3, modelHeight, maxWidth);
	})
	.ToArray();
	using Mat combined = normalizeds.StackingVertically(modelHeight, maxWidth);
	combined.ConvertTo(final, MatType.CV_32FC3, 2.0 / 255, -1.0);
	}
	finally
	{
	foreach (Mat normalized in normalizeds)
	{
	normalized.Dispose();
	}
	}

	return final;
	}

Conversation

AvenSun commented Nov 28, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sdcb left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sdcb commented Nov 28, 2023

Uh oh!

AvenSun commented Nov 28, 2023

Uh oh!

sdcb commented Nov 28, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AvenSun commented Nov 28, 2023

Uh oh!

sdcb commented Nov 29, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

AvenSun commented Nov 28, 2023 •

edited

Loading

sdcb commented Nov 28, 2023 •

edited

Loading