What is Name Normalization and meta data aggregation?

You can apply custom aggregation rules to meta data for assignees, inventors and IPCs to have them considered as one during the similarity calculation and clustering process.

Setting up Name Normalization

  • Create an aggregation rule file in .xlsx, .tsv, or .txt format

  • Upload the file under More…Name Normalization

  • Select the aggregation rule on Options tab before starting the analysis

Creating an Aggregation Rule

Matching Methods

MATCHING METHOD IDENTIFIER
Perfect match full
Partial match partial
Forward match forward
Backward match backward

  • Create an Excel file starting with the name you want to appear in the analysis

  • In the next column, enter the matching method

  • In the following columns enter the text to be used for aggregation

  • Start a new row to enter multiple rules in one file

If using .tsv or .txt, follow the same steps and separate each section with a tab


Format

Aggregated word 1[TAB]matching method identifier[TAB]aggregation target[TAB]aggregation target…

  • Applicant: abc[TAB]partial[TAB]abc corporation[TAB]ABC co

  • Inventor: John Smith[TAB]full[TAB]JOHN SMITH[TAB]John SMITH[TAB]john smith

  • IPC: G06N[TAB]forward[TAB]G06N 5

For example: BOEING Company[TAB]partial[TAB]Boeing[TAB]BOEING[TAB]The Boeing. This will aggregate names such as Boeing, Inc., BOEING Inc, The Boeing Corporation USA, etc. in the dataset into BOEING Company.