Suggest to create an extended statistics entry about a functional dependency between columns

After some discussion over here https://github.com/ankane/dexter/issues/32 I'v following suggestion:

Optimization assumes that columns are independent, so a query plan can go wrong if there are functional dependencies (FD) between columns which he doesn't know. Real world datasets often have FDs between columns because of denormalization or just because the data is like this. 

Actually, finding FDs for all columns could become a rather hard problem to solve. But hopefully this challenge becomes feasible if applied only to columns as part of given slow queries.

**Expected behaviour:**

PostgreSQL offers CREATE STATISTICS in order to inform the planner about FDs. So, looking e.g. at the freely available database "ATP tennis tour", and given a slow query involving tourney_id, tourney_name, pg_plan_advsr could make a specific suggestion like this:

````
create statistics atp_matches_2019_tourney_id_tourney_name (dependencies) 
	on tourney_id, tourney_name 
	from atp_matches_2019;
````

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Suggest to create an extended statistics entry about a functional dependency between columns #7

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Suggest to create an extended statistics entry about a functional dependency between columns #7

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions