Add order Hints for Bulk Copy operations #2701

divang · 2025-07-09T09:28:41Z

Description

This PR adds support for specifying order hints during Bulk Copy operations in the Microsoft JDBC Driver for SQL Server. Order hints can be used to optimize data loading performance by informing SQL Server about the order of the incoming data, potentially improving index maintenance and query execution plans.

Changes

Introduced a new API to allow clients to specify one or more order hints for Bulk Copy operations.
Updated the Bulk Copy implementation to pass order hints to the underlying SQL Server command.
Added validation and documentation for supported order hint values.
Extended relevant tests to cover order hint scenarios.

Motivation

Enabling order hints helps users optimize large data transfers when loading sorted data into SQL Server tables, improving performance in ETL and data warehousing scenarios.

Testing

New unit and integration tests have been added to verify order hint handling.
Manual testing performed with large datasets to confirm improved performance and correctness.

Notes

Only supported on SQL Server versions that accept order hints in Bulk Copy.
Invalid or unsupported hints will result in an exception.

Test Code

    public static void demonstrateOrderHintUsage() {
        System.out.println("\n--- Demonstration: ASC/DESC Order Hints Usage ---");
        
        String tableName = "OrderHintDemo_" + System.currentTimeMillis();
        
        try (Connection conn = DriverManager.getConnection(CONNECTION_URL)) {
            
            // Create a table with a clustered index on (id ASC, timestamp DESC)
            try (Statement stmt = conn.createStatement()) {
                stmt.execute("IF OBJECT_ID('" + tableName + "', 'U') IS NOT NULL DROP TABLE " + tableName);
                
                String createSQL = "CREATE TABLE " + tableName + " (" +
                                 "id INT, " +
                                 "timestamp DATETIME, " +
                                 "data NVARCHAR(100), " +
                                 "INDEX CI_" + tableName + " CLUSTERED (id ASC, timestamp DESC)" +
                                 ")";
                stmt.execute(createSQL);
                System.out.println("Created table with clustered index: (id ASC, timestamp DESC)");
            }
            
            // Scenario 1: Data matches clustered index order - OPTIMAL
            System.out.println("\n--- Scenario 1: Data matches clustered index order ---");
            try (SQLServerBulkCopy bulkCopy = new SQLServerBulkCopy(conn)) {
                bulkCopy.setDestinationTableName(tableName);
                
                // Add column mappings
                bulkCopy.addColumnMapping("id", "id");
                bulkCopy.addColumnMapping("timestamp", "timestamp");
                bulkCopy.addColumnMapping("data", "data");
                
                // Add order hints that match the clustered index
                bulkCopy.addColumnOrderHint("id", SQLServerSortOrder.ASCENDING);      // Matches clustered index
                bulkCopy.addColumnOrderHint("timestamp", SQLServerSortOrder.DESCENDING); // Matches clustered index
                
                System.out.println(" Added order hints: id ASC, timestamp DESC (matches clustered index)");
                System.out.println(" This should provide optimal performance as data order matches index order");
                
                // Use sorted test data
                SortedTestData sortedData = new SortedTestData();
                bulkCopy.writeToServer(sortedData);
                
                System.out.println(" Bulk copy completed with matching order hints");
            }
            
            // Clear the table for next test
            try (Statement stmt = conn.createStatement()) {
                stmt.execute("TRUNCATE TABLE " + tableName);
            }
            
            // Scenario 2: Data doesn't match clustered index order - SUBOPTIMAL
            System.out.println("\n--- Scenario 2: Data doesn't match clustered index order ---");
            try (SQLServerBulkCopy bulkCopy = new SQLServerBulkCopy(conn)) {
                bulkCopy.setDestinationTableName(tableName);
                
                // Add column mappings
                bulkCopy.addColumnMapping("id", "id");
                bulkCopy.addColumnMapping("timestamp", "timestamp");
                bulkCopy.addColumnMapping("data", "data");
                
                // Add order hints that DON'T match the clustered index
                bulkCopy.addColumnOrderHint("id", SQLServerSortOrder.DESCENDING);   // Opposite of clustered index
                bulkCopy.addColumnOrderHint("timestamp", SQLServerSortOrder.ASCENDING);  // Opposite of clustered index
                
                System.out.println(" Added order hints: id DESC, timestamp ASC (opposite of clustered index)");
                System.out.println(" This may cause SQL Server to perform additional sorting operations");
                
                // Use reverse sorted test data
                ReverseSortedTestData reverseSortedData = new ReverseSortedTestData();
                bulkCopy.writeToServer(reverseSortedData);
                
                System.out.println(" Bulk copy completed with non-matching order hints");
            }
            
            // Verify data in both scenarios
            verifyData(conn, tableName);
            
        } catch (Exception e) {
            System.out.println("✗ Order hint demonstration failed: " + e.getMessage());
            e.printStackTrace();
        } finally {
            dropTestTable(tableName);
        }
        
    }

Performance result

FINAL 10-MINUTE PERFORMANCE RESULTS
Total iterations: 20
Rows per iteration: 25000
Total rows processed: 1500000

COMPREHENSIVE PERFORMANCE STATISTICS
Baseline (No Hints) : avg=8241.9 ms, min=8042 ms, max=9339 ms, stddev=300.9 ms, samples=20
Optimal (Correct Hints): avg=8173.9 ms, min=7998 ms, max=8746 ms, stddev=196.4 ms, samples=20

PERFORMANCE COMPARISON
Optimal is 0.8% FASTER than baseline

…ethod

codecov · 2025-07-09T09:47:14Z

Codecov Report

Attention: Patch coverage is 79.06977% with 9 lines in your changes missing coverage. Please review.

Project coverage is 51.55%. Comparing base (87f0553) to head (c6508ee).

Files with missing lines	Patch %	Lines
...om/microsoft/sqlserver/jdbc/SQLServerBulkCopy.java	79.06%	6 Missing and 3 partials ⚠️

Additional details and impacted files

@@             Coverage Diff              @@
##               main    #2701      +/-   ##
============================================
+ Coverage     51.50%   51.55%   +0.05%     
- Complexity     4050     4064      +14     
============================================
  Files           149      149              
  Lines         34136    34177      +41     
  Branches       5700     5707       +7     
============================================
+ Hits          17581    17620      +39     
- Misses        14076    14086      +10     
+ Partials       2479     2471       -8

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

mobilebilly added 30 commits July 7, 2024 10:18

Add order Hints for Bulk Copy operations (#1481)

442c816

Correct the wrong javadoc in SQLServerBulkCopy.addColumnOrderHint

1629062

Escape the column name in the order hints with square bracket correctly

8da2481

Add overload addColumn which accept custom column name

e35d099

Escape the Close escape character correctly in the escapeIdentifier m…

dcead0d

…ethod

Add unit test cases to the column order hints enhancement

e3c765b

Merge branch 'microsoft:main' into main

31b5d00

Merge branch 'microsoft:main' into main

e6a3005

Merge branch 'microsoft:main' into main

9b195c2

Merge branch 'microsoft:main' into main

f8827a2

Merge branch 'microsoft:main' into main

c4777de

Merge branch 'microsoft:main' into main

86e8f98

Merge branch 'microsoft:main' into main

971c2ec

Merge branch 'microsoft:main' into main

433da04

Merge branch 'microsoft:main' into main

761060b

Merge branch 'microsoft:main' into main

c071e5f

Merge branch 'microsoft:main' into main

a3cb495

Merge branch 'microsoft:main' into main

13c683e

Merge branch 'microsoft:main' into main

fab83aa

Merge branch 'microsoft:main' into main

87a0bd5

Merge branch 'microsoft:main' into main

ad6b8a4

Merge branch 'microsoft:main' into main

c8e872a

Merge branch 'microsoft:main' into main

02d7758

Merge branch 'microsoft:main' into main

39cc70e

Merge branch 'microsoft:main' into main

0c3fe65

Merge branch 'microsoft:main' into main

45d9060

Merge branch 'microsoft:main' into main

45daa81

Merge branch 'microsoft:main' into main

bf03f21

Merge branch 'microsoft:main' into main

11fd746

Merge branch 'microsoft:main' into main

8bede18

Merge branch 'microsoft:main' into main

c6508ee

divang added this to the 13.1.1 milestone Jul 9, 2025

divang requested review from David-Engel, Ananya2, muskan124947 and machavan July 9, 2025 13:32

machavan approved these changes Jul 10, 2025

View reviewed changes

muskan124947 approved these changes Jul 11, 2025

View reviewed changes

Ananya2 approved these changes Jul 11, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add order Hints for Bulk Copy operations #2701

Add order Hints for Bulk Copy operations #2701

Uh oh!

divang commented Jul 9, 2025 •

edited

Loading

Uh oh!

codecov bot commented Jul 9, 2025 •

edited

Loading

Uh oh!

Uh oh!

Add order Hints for Bulk Copy operations #2701

Are you sure you want to change the base?

Add order Hints for Bulk Copy operations #2701

Uh oh!

Conversation

divang commented Jul 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Changes

Motivation

Testing

Notes

Test Code

Performance result

Uh oh!

codecov bot commented Jul 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

divang commented Jul 9, 2025 •

edited

Loading

codecov bot commented Jul 9, 2025 •

edited

Loading