I have two files and I am getting few columns based on one common column.. but I want the unique records not based on the key column.
right now I am using sort stage and passing all the column name as key column in the hash partition. is this the right approach or do we have any better approach to get unique records from all the column
get unique columns
Moderators: chulett, rschirm, roy