Replacing DB names with compilation DBs before network call #1

kosta-foundational · 2024-03-26T12:25:43Z

No description provided.

elish7lapid

mainly questions

elish7lapid · 2024-03-27T09:58:19Z

dbt/adapters/snowflake/connections.py

@@ -45,6 +45,7 @@
 from dbt.events.types import AdapterEventWarning
 from dbt.ui import line_wrap_message, warning_tag

+from dbt.adapters.snowflake.compilation_db_replacer import CompilationDBReplacer


if we leave it here, we should probably add a comment that its imported from a different project (however I think that even copying it to be here would be better than this weird import cycle)
maybe we can put the compilation_db_replacer in fd_common and then we can just add it as a regular project dependency
and in order to pass the correct db names around we can use a file or something

Wouldn't it cause cyclic dependency between packages? Python will compile since it is not cyclic dependency between python files, but it will cause future architectural nightmare. Anyway, ildis suggested to create a new dependency package that will hold compilation_db_replacer.py only.

elish7lapid · 2024-03-27T10:03:40Z

dbt/adapters/snowflake/connections.py

-                database=creds.database,
+                database=compilation_db,


elish7lapid · 2024-03-27T10:11:36Z

dbt/adapters/snowflake/connections.py

+            if "already exists" in repr(e):
+                connection, cursor = self._add_begin_commit_only_queries(


when does this happen and why is the fix to start a new transaction? why did the replacement of database name cause this?

I don't know why some queries run twice now, it is a bit worrying, so this happens when you try to create seed table twice.
'commit' is just a random query that always succeed and allows me to get a cursor, but you are right, it might have adverse effects.

elish7lapid · 2024-03-27T11:38:43Z

dbt/adapters/snowflake/impl.py

-            results = self.execute_macro(LIST_SCHEMAS_MACRO_NAME, kwargs={"database": database})
+            compilation_db = CompilationDBReplacer.customer_db_to_compilation_db(database)
+            results = self.execute_macro(LIST_SCHEMAS_MACRO_NAME, kwargs={"database": compilation_db})


hmm the name of the database in the request should already be replaced by the code you added in the snowflake adapter no..? why do you need to replace it here too? (same question for all changes in this file)

All these functions are called directly from 'dbt' package, in addition to calling functions in connections.py

elish7lapid · 2024-03-27T12:35:06Z

dbt/adapters/snowflake/impl.py

@@ -254,3 +260,33 @@ def submit_python_job(self, parsed_model: dict, compiled_code: str):

    def valid_incremental_strategies(self):
        return ["append", "merge", "delete+insert"]
+
+    def drop_relation(self, relation):
+        with RelationDBSubstitution(relation):


why is this implemented with a context manager that mutates the relation, and not just do copy(relation) and replace the relevant field? so it wont need to deal with cleanup at all?

It at least needs to be a deep copy since I am substituting second level field (relation.path.database). So who knows which complexities this will cause. My solution is simple and works. Do you expect issues with it? Multi-tasking can be an issue in theory, but in practice the class uses a single snowflake adapter, which is not thread safe.

… and unnecessary

kosta-foundational requested a review from elish7lapid March 26, 2024 12:25

elish7lapid approved these changes Mar 27, 2024

View reviewed changes

elish7lapid reviewed Mar 27, 2024

View reviewed changes

flow3d force-pushed the kosta/fou-4082-snowflake-adapter-release-level-solution branch from b72b5a1 to e0b87ac Compare March 28, 2024 10:22

kosta-foundational added 4 commits March 28, 2024 16:14

Replacing DB names with compilation DBs before network call

5d3b0a2

Removing print

dd335df

Removing RelationDBCompilation - it is confusing with multi-threading…

f2d4134

… and unnecessary

Small fix

a96b443

flow3d force-pushed the kosta/fou-4082-snowflake-adapter-release-level-solution branch from e0b87ac to a96b443 Compare March 28, 2024 14:14

flow3d changed the base branch from 1.5.latest to 1.5.latest-fork March 28, 2024 14:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Replacing DB names with compilation DBs before network call #1

Replacing DB names with compilation DBs before network call #1

Uh oh!

kosta-foundational commented Mar 26, 2024

Uh oh!

elish7lapid left a comment

Uh oh!

elish7lapid Mar 27, 2024

Uh oh!

kosta-foundational Mar 27, 2024

Uh oh!

elish7lapid Mar 27, 2024

Uh oh!

elish7lapid Mar 27, 2024

Uh oh!

kosta-foundational Mar 27, 2024

Uh oh!

elish7lapid Mar 27, 2024

Uh oh!

kosta-foundational Mar 27, 2024

Uh oh!

elish7lapid Mar 27, 2024

Uh oh!

kosta-foundational Mar 27, 2024

Uh oh!

Uh oh!

		if "already exists" in repr(e):
		connection, cursor = self._add_begin_commit_only_queries(

Replacing DB names with compilation DBs before network call #1

Are you sure you want to change the base?

Replacing DB names with compilation DBs before network call #1

Uh oh!

Conversation

kosta-foundational commented Mar 26, 2024

Uh oh!

elish7lapid left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!