Can dataclass ordering avoid tuple wrappers for single field comparisons?

Proposal and Rationale
--------------

When `order=True`, the generated rich comparison methods wrap the target field in a tuple.  This makes sense when multiple fields are being compared. However for a single field, it adds unnecessary extra work:  two extra tuple allocations and deallocations, a tuple compare, and superfluous equality check on the field.

For example, the [heapq docs](https://docs.python.org/3/library/heapq.html#priority-queue-implementation-notes) suggest making the following class:

```python
from dataclasses import dataclass, field
from typing import Any

@dataclass(order=True)
class PrioritizedItem:
    priority: int
    item: Any=field(compare=False)
```

The generated code for less-than operation is:

```python
 def __lt__(self,other):
  if other.__class__ is self.__class__:
   return (self.priority,)<(other.priority,)
  return NotImplemented
```

Instead, it would be nicer to generate:

```python
 def __lt__(self,other):
  if other.__class__ is self.__class__:
   return self.priority<other.priority
  return NotImplemented
```

Behavior change
------------

When the tuple wrapper was removed from `__eq__` in favor of individual field comparisons linked by `and`, it resulted in faster code that matched what people would write by hand. However, it also caused issues where users had been relying on the identity checks inside the tuple comparison. That affects us here as well:

```python
from math import nan

>>> nan <= nan
False
>>> (nan,) <= (nan,)
True
```

Unlike the impact of the `__eq__` change, users of ordering operations are far less likely to be relying on identity-implies-equality.

One reason is that `__eq__` is used with hashing but ordering operations are not.  So the prior change affected `dict()` and `set()` while the proposed change does not.

Another reason is that it only affects `__le__` and `__ge__` but not `__eq__`, `__ne__`, `__lt__`, and `__ge__`, so it doesn't show-up 100% of the  time.  

Also Python's ordering tools (`sorted`, `min`, `max`, `nsmallest`, `nlargest`, `merge`, `bisect`, and `heapq`) do not use  `__le__` and `__ge__`. So they are entirely unaffected.

The upside of the behavior change is that it fixes an existing inconsistency between '<', '==', and '<=':

```python
>>> PrioritizedItem(nan, 'x') < PrioritizedItem(nan, 'y')
False
>>> PrioritizedItem(nan, 'x') == PrioritizedItem(nan, 'y')
False 

# Inconsistent with the previous two calls
>>> PrioritizedItem(nan, 'x') <= PrioritizedItem(nan, 'y')  
True
```

This inconsistency is unexpected because it does not show up in hand-written code:

```python
>>> nan < nan
False
>>> nan == nan
False
>>> nan <= nan
False
```




### Linked PRs
* gh-144222

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Can dataclass ordering avoid tuple wrappers for single field comparisons? #144191

Proposal and Rationale

Behavior change

Linked PRs

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Uh oh!

Can dataclass ordering avoid tuple wrappers for single field comparisons? #144191

Description

Proposal and Rationale

Behavior change

Linked PRs

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions