[Bug]: Spans associated with experiment items are collected under incorrect traces

### What component(s) are affected?

- [x] Opik Python SDK
- [ ] Opik Typescript SDK
- [ ] Opik Agent Optimizer SDK
- [ ] Opik UI
- [x] Opik Server
- [ ] Documentation

### Opik version

- Opik version: 1.9.48

### Describe the problem

I'm using `opik.evaluation.evaluator.evaluate` together with `opik.evaluation.metrics.ragas_metric.RagasMetricWrapper` and `track=True` to evaluate my RAG application and compute Ragas metrics.

**Expected behavior**

After evaluation, each experiment item should have one associated trace. That trace should consist of spans that show steps used to calculate the metrics.


**Actual behavior**

After evaluation, each experiment item does have one associated trace. However:

- Some traces are empty and contain no spans. 
- Other traces contain spans that belong to another traces.

I also noticed that the number of non-empty traces is equal to the number of task threads.

**Screenshots**

An empty trace:

<img width="2553" height="1252" alt="Image" src="https://github.com/user-attachments/assets/a41115f7-e4b0-4614-93dd-a48298e62203" />

A trace with unrelated spans:

<img width="2560" height="1254" alt="Image" src="https://github.com/user-attachments/assets/80bc3c25-9a97-4867-a203-7371a4680310" />

### Reproduction steps and code snippets

Code snippet that I'm using to evaluate the RAG application:

```python
def get_ragas_metrics():
    ...


def run_rag(input_data):
    ...


opik_client = opik.Opik()
ragas_metrics = get_ragas_metrics()
scoring_metrics = [RagasMetricWrapper(metric, track=True) for metric in ragas_metrics]
num_task_threads = 2

evaluation_result = evaluate(
    opik_client.get_dataset("Dataset Name"),
    run_rag,
    scoring_metrics=scoring_metrics,
    experiment_name=None,
    verbose=2,
    task_threads=num_task_threads
)
```

### Error logs or stack trace

_No response_

### Healthcheck results

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bug]: Spans associated with experiment items are collected under incorrect traces #4483

What component(s) are affected?

Opik version

Describe the problem

Reproduction steps and code snippets

Error logs or stack trace

Healthcheck results

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Bug]: Spans associated with experiment items are collected under incorrect traces #4483

Description

What component(s) are affected?

Opik version

Describe the problem

Reproduction steps and code snippets

Error logs or stack trace

Healthcheck results

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions