|< < 19 > >|

Combining operators

Sort, Project, and Unique

Project early

If we are sorting on [a, b, c] and then projecting on the same columns, then why not project early?

No need to carry around all the other columns that are just going to be eliminated.

So, generalize Sort:

  • Add an argument specifying projection columns.

  • Do the projection in the first pass, producing runs.

|< < 19 > >|