Separate data and instruction #128

KDr2 · 2022-03-03T02:20:16Z

Implement #127 and #97.

The type info task will be in another PR.

KDr2 · 2022-03-03T04:19:05Z

This PR reduces the Integrate Test from 140-160 mins to about 100 mins.

rikhuijzer · 2022-03-03T07:05:06Z

This PR reduces the Integrate Test from 140-160 mins to about 100 mins.

Tested locally or on the runners?

EDIT: Benchmark below: #128 (comment)

rikhuijzer

I don't understand most of the code, but I like that the struct fields have clearer types

src/tapedfunction.jl

devmotion · 2022-03-03T07:18:35Z

src/tapedfunction.jl

+        haskey(tf.bindings, :_1) && (tf.bindings[:_1].val = tf.func)
+        for i in 1:length(args)
+            slot = Symbol("_", i + 1)
+            haskey(tf.bindings, slot) && (tf.bindings[slot].val = args[i])
+        end


It would be faster if one could avoid looking up every key twice, once in haskey and once in setindex!.

One alternative would be to use get with a default value such as a NoKeyFound() singleton instead of haskey (nothing seems ambiguous for the arguments unfortunately). Another alternative would be to iterate over keys - but this would only work, or be efficient at least, if most keys are :_i with <= length(args) + 1.

We need an Option or Maybe type in Julia 😄

Here most keys in the Dict will not be :_i.

The plan is to replace to.bindings::Dict with NamedTuple in the next PR. So it probably won’t matter.

KDr2 · 2022-03-03T07:28:54Z

This PR reduces the Integrate Test from 140-160 mins to about 100 mins.

Tested locally or on the runners?

I simply read the time spent on github action, but it may be not so accurate due to the job concurrency and queue?

KDr2 · 2022-03-03T07:42:31Z

This PR reduces the Integrate Test from 140-160 mins to about 100 mins.

Tested locally or on the runners?

I simply read the time spent on github action, but it may be not so accurate due to the job concurrency and queue?

You can see the time the IntegerateTest consumed here: https://github.com/TuringLang/Libtask.jl/actions/workflows/IntegrationTest.yml

rikhuijzer · 2022-03-03T07:46:23Z

This PR reduces the Integrate Test from 140-160 mins to about 100 mins.

Nice. I can confirm that this PR is good for performance. I ran the following subset of the Turing tests (Turing.jl/test/runtests.jl) on my system:

[...]
include(pkgdir(Turing)*"/test/test_utils/AllUtils.jl")

using Logging
Logging.disable_logging(Logging.Info)

@testset "Turing" begin
    @time include("inference/AdvancedSMC.jl")
    @time include("inference/gibbs.jl")
    @time include("contrib/inference/sghmc.jl")
    @time include("stdlib/RandomMeasures.jl")
end

`master` branch

 43.139966 seconds (87.17 M allocations: 5.339 GiB, 3.38% gc time, 94.57% compilation time)
443.315047 seconds (1.18 G allocations: 77.239 GiB, 3.21% gc time, 18.66% compilation time)
 13.570206 seconds (39.32 M allocations: 2.376 GiB, 4.77% gc time, 98.35% compilation time)
  9.087712 seconds (17.42 M allocations: 1.047 GiB, 4.29% gc time, 73.16% compilation time)

`sep-data` branch (this PR)

 41.664786 seconds (84.94 M allocations: 5.297 GiB, 3.60% gc time, 94.48% compilation time)
390.978920 seconds (842.42 M allocations: 69.394 GiB, 3.04% gc time, 20.74% compilation time)
 13.259697 seconds (39.32 M allocations: 2.376 GiB, 4.87% gc time, 98.34% compilation time)
  8.926231 seconds (17.57 M allocations: 1.072 GiB, 4.81% gc time, 67.26% compilation time)

The number of allocations either reduced or stayed roughly the same.

rikhuijzer · 2022-03-03T07:47:16Z

This PR reduces the Integrate Test from 140-160 mins to about 100 mins.

Tested locally or on the runners?

I simply read the time spent on github action, but it may be not so accurate due to the job concurrency and queue?

Mostly due to different CPUs. Not all runners use the same CPU and the difference in running time can be as big as 40%.

KDr2 · 2022-03-03T07:52:31Z

Nice. I can confirm that this PR is good for performance.

I think the main reason behind this is that we now only need to copy the data(variables), and don't need to copy the instructions anymore.

yebai

Thanks @KDr2 - nice improvements!

yebai · 2022-03-03T11:22:33Z

src/tapedfunction.jl


 abstract type AbstractInstruction end
-abstract type Taped end


Maybe keep this type for now - it’ll become useful again in future work.

At this moment, it only has one subtype: TapedFunction, so I removed it. And I think it's not hard to add it back when we need it.

But I'm OK to take it back now if you insist to do so 😄

src/tapedfunction.jl

yebai · 2022-03-03T11:30:26Z

src/tapedfunction.jl

+        haskey(tf.bindings, :_1) && (tf.bindings[:_1].val = tf.func)
+        for i in 1:length(args)
+            slot = Symbol("_", i + 1)
+            haskey(tf.bindings, slot) && (tf.bindings[slot].val = args[i])
+        end


The plan is to replace to.bindings::Dict with NamedTuple in the next PR. So it probably won’t matter.

src/tapedfunction.jl

yebai · 2022-03-03T12:05:10Z

I think it’s ready to merge when the CI passes.

KDr2 added 2 commits March 3, 2022 07:23

remove MacroTools

5a8101c

separate data and instruction

983d7fa

This was linked to issues Mar 3, 2022

Remove the dependency on MacroTools #127

Closed

Seprate code and data in instructions #97

Closed

KDr2 added 2 commits March 3, 2022 13:36

refactor: tapedfunction constructor and copying

e491fba

compact code

b8d0ee7

rikhuijzer reviewed Mar 3, 2022

View reviewed changes

src/tapedfunction.jl Outdated Show resolved Hide resolved

devmotion reviewed Mar 3, 2022

View reviewed changes

code revise

6a2d043

make instructions immutable

a988d98

yebai requested changes Mar 3, 2022

View reviewed changes

remove TapedFunction.owner

715555f

yebai approved these changes Mar 3, 2022

View reviewed changes

yebai merged commit f4e64d5 into master Mar 3, 2022

delete-merged-branch bot deleted the sep-data branch March 3, 2022 15:44

yebai mentioned this pull request Mar 8, 2022

Add type info to Boxes/Bindings #129

Closed


		abstract type AbstractInstruction end
		abstract type Taped end

Separate data and instruction #128

Separate data and instruction #128

Uh oh!

Conversation

KDr2 commented Mar 3, 2022

Uh oh!

KDr2 commented Mar 3, 2022

Uh oh!

rikhuijzer commented Mar 3, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rikhuijzer left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

devmotion Mar 3, 2022

Choose a reason for hiding this comment

Uh oh!

KDr2 Mar 3, 2022

Choose a reason for hiding this comment

Uh oh!

yebai Mar 3, 2022

Choose a reason for hiding this comment

Uh oh!

KDr2 commented Mar 3, 2022

Uh oh!

KDr2 commented Mar 3, 2022

Uh oh!

rikhuijzer commented Mar 3, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

master branch

sep-data branch (this PR)

Uh oh!

rikhuijzer commented Mar 3, 2022

Uh oh!

KDr2 commented Mar 3, 2022

Uh oh!

yebai left a comment

Choose a reason for hiding this comment

Uh oh!

yebai Mar 3, 2022

Choose a reason for hiding this comment

Uh oh!

KDr2 Mar 3, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

yebai Mar 3, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

yebai commented Mar 3, 2022

Uh oh!

Uh oh!

rikhuijzer commented Mar 3, 2022 •

edited

Loading

rikhuijzer commented Mar 3, 2022 •

edited

Loading

`master` branch

`sep-data` branch (this PR)