Fuse Convolution + Activation node as a part of graph transformations pipeline by NeiroYT · Pull Request #276 · embedded-dev-research/ITLabAI

NeiroYT · 2026-02-27T14:58:45Z

Closes #272

codecov · 2026-02-27T15:27:40Z

Codecov Report

❌ Patch coverage is 86.06061% with 46 lines in your changes missing coverage. Please review.
✅ Project coverage is 84.95%. Comparing base (028996d) to head (e27d175).
⚠️ Report is 2 commits behind head on main.

Files with missing lines	Patch %	Lines
src/layers_fused/ConvRelu.cpp	79.24%	9 Missing and 13 partials ⚠️
include/layers_fused/ConvRelu.hpp	91.38%	8 Missing and 10 partials ⚠️
...rc/graph_transformations/graph_transformations.cpp	28.57%	3 Missing and 2 partials ⚠️
include/layers/ConvLayer.hpp	85.71%	0 Missing and 1 partial ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #276      +/-   ##
==========================================
+ Coverage   84.42%   84.95%   +0.53%     
==========================================
  Files          57       59       +2     
  Lines        3274     3710     +436     
  Branches     1989     2287     +298     
==========================================
+ Hits         2764     3152     +388     
- Misses        245      271      +26     
- Partials      265      287      +22

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

…_2023 into neiroyt/feature_fuseconvrelu e enter a commit message to explain why this merge is necessary, iall if it merges an updated upstream into a topic branch. starting with '#' will be ignored, and an empty message aborts

aobolensk · 2026-03-07T13:48:16Z

    }
-    std::shared_ptr<Layer> layer = layer_based_shared_copy(layer_to, options);
+    std::shared_ptr<Layer> layer;
+    if (layer_to->getName() == kConvRelu &&


Can't we use layer_based_shared_copy() for kConvRelu case as well? This looks cumbersome

layer_based_shared_copy() copies layer (shared_ptr layer_to in this case), also this func is used in graph.clone().
However any ConvRelu in new graph needs fields from ConvLayer+Relu, so here in general case a new function can be implemented, which constructs layer depending on what we want.

aobolensk · 2026-03-07T13:51:59Z

    start_ = layer->getID();
  }

+  void setInput(Tensor& vec) {


Is it possible to avoid introducing setInput/setOutput just for the one function call?

aobolensk · 2026-03-07T13:54:47Z

 void build_graph_linear(it_lab_ai::Graph& graph, it_lab_ai::Tensor& input,
                        it_lab_ai::Tensor& output, RuntimeOptions options,
-                        bool comments) {
+                        bool comments, bool enable_postops) {


I remember you mentioned that you need ito set enable_postops = false. Where is that?

aobolensk · 2026-03-07T14:04:48Z

+  conv.run(input, output, options);
+  switch (input[0].get_type()) {
+    case Type::kInt: {
+      relu<int>(output[0]);
+      break;
+    }
+    case Type::kFloat: {
+      relu<float>(output[0]);
+      break;
+    }
+    default: {
+      throw std::runtime_error("Unsupported tensor type");
+    }


Now I clearly see why conv + relu exec time is the same as with fused layers. You need to implement fsed implementation here to see the time difference instead of calling two layers

Changes

06c0d3a

NeiroYT requested review from allnes and aobolensk as code owners February 27, 2026 14:58

NeiroYT added 12 commits February 27, 2026 19:04

Fix

4bdbcde

Clang

d0e2d9f

Clang again

e305545

Changes

0ff9504

clang

a0b4d7f

Tidy

cebf82c

Tidy

66b8cc5

Tidy

82a6b00

nodiscard

a9d207b

Test fix

6e0a91b

Tidy again

0cbff4e

aobolensk reviewed Mar 7, 2026

View reviewed changes

NeiroYT added 2 commits March 13, 2026 02:08

Changes

d1a499a

Clang

e27d175

aobolensk approved these changes Mar 13, 2026

View reviewed changes

allnes approved these changes Mar 13, 2026

View reviewed changes

allnes merged commit e581001 into main Mar 13, 2026
23 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fuse Convolution + Activation node as a part of graph transformations pipeline#276

Fuse Convolution + Activation node as a part of graph transformations pipeline#276
allnes merged 15 commits into
mainfrom
neiroyt/feature_fuseconvrelu

NeiroYT commented Feb 27, 2026

Uh oh!

codecov Bot commented Feb 27, 2026 •

edited

Loading

Uh oh!

aobolensk Mar 7, 2026

Uh oh!

NeiroYT Mar 12, 2026

Uh oh!

aobolensk Mar 7, 2026

Uh oh!

NeiroYT Mar 12, 2026

Uh oh!

aobolensk Mar 7, 2026

Uh oh!

aobolensk Mar 7, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

NeiroYT commented Feb 27, 2026

Uh oh!

codecov Bot commented Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

aobolensk Mar 7, 2026

Choose a reason for hiding this comment

Uh oh!

NeiroYT Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

aobolensk Mar 7, 2026

Choose a reason for hiding this comment

Uh oh!

NeiroYT Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

aobolensk Mar 7, 2026

Choose a reason for hiding this comment

Uh oh!

aobolensk Mar 7, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov Bot commented Feb 27, 2026 •

edited

Loading