What is Name Normalization and meta data aggregation?
You can apply custom aggregation rules to meta data for assignees, inventors and IPCs to have them considered as one during the similarity calculation and clustering process.
Setting up Name Normalization
- Create an aggregation rule file in .xlsx, .tsv, or .txt format 
- Upload the file under More… → Name Normalization 
- Select the aggregation rule on Options tab before starting the analysis 
Creating an Aggregation Rule
Matching Methods
MATCHING METHOD              IDENTIFIER
Perfect match                            full
Partial match                             partial
Forward match                          forward
Backward match                       backward
- Create an Excel file starting with the name you want to appear in the analysis 
- In the next column, enter the matching method 
- In the following columns enter the text to be used for aggregation 
- Start a new row to enter multiple rules in one file 
If using .tsv or .txt, follow the same steps and separate each section with a tab
Format
Aggregated word 1[TAB]matching method identifier[TAB]aggregation target[TAB]aggregation target…
- Applicant: abc[TAB]partial[TAB]abc corporation[TAB]ABC co 
- Inventor: John Smith[TAB]full[TAB]JOHN SMITH[TAB]John SMITH[TAB]john smith 
- IPC: G06N[TAB]forward[TAB]G06N 5 
For example: BOEING Company[TAB]partial[TAB]Boeing[TAB]BOEING[TAB]The Boeing. This will aggregate names such as Boeing, Inc., BOEING Inc, The Boeing Corporation USA, etc. in the dataset into BOEING Company.
