OpenSourceRisk
diff --git a/‎Tools/PythonTools/Readme.md‎
Lines changed: 48 additions & 21 deletions b/‎Tools/PythonTools/Readme.md‎
Lines changed: 48 additions & 21 deletions
@@ -2,17 +2,14 @@
 
 The file *comparison_config.json* holds the default configuration that is used for comparing expected and generated csv, json and text files during the regression and Example test runs.
 
-Under the field `csv_settings` and `files`, there is a collection of objects. Each object has a key that is equal to either a file name or a regular expression that should ultimately match a file name. This configuration is read into an `OrderedDict` in Python and the files that are to be compared during a test run are compared against the keys to determine the comparison configuration that they should use. For this reason, keys with exact names should be placed before regular expression keys in the configuration so that they will be found first.
+Under the field `csv_settings`, `json_settings` and `files`, there is a collection of objects. Each object has a key that is equal to either a file name or a regular expression that should ultimately match a file name. This configuration is read into an `OrderedDict` in Python and the files that are to be compared during a test run are compared against the keys to determine the comparison configuration that they should use. For this reason, keys with exact names should be placed before regular expression keys in the configuration so that they will be found first.
 
-Each object under a given key, `file_name`, has the following format:
+Each object under a given key, `file_name`, has the following format (note that the formats of `csv_settings` and `json_settings` are different):
   ```js
   {
-    "csv_settings":
-    {
-      "files":
-      {
-        "file_name":
-        {
+    "csv_settings": {
+      "files": {
+        "file_name": {
           "keys": [
             "a",
             "b"
@@ -25,8 +22,7 @@ Each object under a given key, `file_name`, has the following format:
           "optional_cols": [
             "col4"
           ]
-          "rename_cols":
-          {
+          "rename_cols": {
             "A": "a",
             "B": "b",
           },
@@ -58,20 +54,51 @@ Each object under a given key, `file_name`, has the following format:
           ]
         }
       }
+    },
+    "json_settings": {
+      "files": {
+        "file_name": {
+          "ignore_keys": [
+            "key1",
+            "key2",
+            "key3/subkey1"
+          ],
+          "settings": [
+            {
+              "names": [
+                "key1/subkey1",
+                "key2/subkey1/subkey2"
+              ],
+              "abs_tol": 0.01,
+              "rel_tol": 0.001
+            }
+          ]
+        },
+        "all_string_file_name": {}
+      }
     }
   }
   ```
 
-- The `keys` specify which columns will be used as keys for the comparison. The comparison fails if all of these keys are not in both files to be compared.
-- The `use_cols` specifies on which columns the actual comparisons are evaluated.
-- The `optional_cols`, as with `use_cols` above, specifies columns on which comparisons are evaluated, but these columns are only included if they are present in both files. If they are not present in either file or if present in one file nad not the other, the corresponding comparison defined in `column_settings` below will not be evaluated for the missing column/s.
-- The `rename_cols` object specifies columns that should be renamed before the comparison is performed. In the example above, `A` would be renamed to `a` etc.
-- The `col_types` object allows you to explicitly specify the type of a given set of columns if necessary.
-- The `drop_rows` object allows you to specify a threshold for the values in a given set of columns. If the absolute value for a given row, in one of the specified columns, is below the threshold, the row is dropped from the comparison.
-- The `column_settings` object allows you to specify an absolute and/or a relative tolerance that should be used for a group of columns when comparing their values. There can be multiple groupings used in the `column_settings` array with different values of absolute and relative tolerance.
+For `csv_settings`:
+  - The `keys` specify which columns will be used as keys for the comparison. The comparison fails if all of these keys are not in both files to be compared.
+  - The `use_cols` specifies on which columns the actual comparisons are evaluated.
+  - The `optional_cols`, as with `use_cols` above, specifies columns on which comparisons are evaluated, but these columns are only included if they are present in both files. If they are not present in either file or if present in one file nad not the other, the corresponding comparison defined in `column_settings` below will not be evaluated for the missing column/s.
+  - The `rename_cols` object specifies columns that should be renamed before the comparison is performed. In the example above, `A` would be renamed to `a` etc.
+  - The `col_types` object allows you to explicitly specify the type of a given set of columns if necessary.
+  - The `drop_rows` object allows you to specify a threshold for the values in a given set of columns. If the absolute value for a given row, in one of the specified columns, is below the threshold, the row is dropped from the comparison.
+  - The `column_settings` object allows you to specify an absolute and/or a relative tolerance that should be used for a group of columns when comparing their values. There can be multiple groupings used in the `column_settings` array with different values of absolute and relative tolerance.
+
+  For new files that would require the same comparison config as another standard file, e.g "simm_additional.csv" from a SIMM Impact calc would be the same as a "simm.csv" report from a SIMM calc, they can copy that file's comparison config:
+
+  For the regression tests under the *RegressionTests* directory and the Example tests under *Examples* and *ExamplesPlus*, each test may have its own specific comparison configuration file following this format. If a test specific comparison configuration file is present, it is merged with this default comparison configuration file to give the final comparison configuration file used for the test. The merge function is in the file *merge_comparison_configs.py* and it uses the following logic:
+  - The test specific file is used as the starting point for the final merged configuration `OrderedDict`.
+  - Any file names in the default comparison configuration file that are not in the test specific comparison configuration file, are added *at the end* of the merged configuration `OrderedDict`. They will therefore only be used during comparison if there is not a match in the test specific file.
 
-For new files that would require the same comparison config as another standard file, e.g "simm_additional.csv" from a SIMM Impact calc would be the same as a "simm.csv" report from a SIMM calc, they can copy that file's comparison config:
+For `json_settings`:
+  - Any key value (i.e. in `ignore_keys`, `settings.names`, etc.) must include the parent, if any. Using the sample comparison_config.json template above, we would ignore "key1" and "key2" at the top level in a JSON file comparison, and "subkey1" only if it appears inside of "key3".
+  - The `ignore_keys` is an array of strings, each string being a key in the JSON file. If the key is found in one or both of the files, any diffs will be ignored for this key and its children (i.e. if the value is itself a nested object).
+  - The `settings` object works the same as the `column_settings` file in `csv_settings`, except that the keys must include the parent in the JSON `settings`.
+  - **NOTE:** In order for a JSON check to be applied, a comp config must be provided for the filename, even if the config is empty (see e.g. `all_string_filename` in the template above). Otherwise a direct file comparison will be done.
+  - **NOTE:** String diffs are automatically processed (i.e. unless they are in `ignore_keys`, then a string diff will be a failing diff) (see e.g. `all_string_filename` in the template above). Only numerical differences need to be handled in `settings`.
 
-For the regression tests under the *RegressionTests* directory and the Example tests under *Examples* and *ExamplesPlus*, each test may have its own specific comparison configuration file following this format. If a test specific comparison configuration file is present, it is merged with this default comparison configuration file to give the final comparison configuration file used for the test. The merge function is in the file *merge_comparison_configs.py* and it uses the following logic:
-- The test specific file is used as the starting point for the final merged configuration `OrderedDict`.
-- Any file names in the default comparison configuration file that are not in the test specific comparison configuration file, are added *at the end* of the merged configuration `OrderedDict`. They will therefore only be used during comparison if there is not a match in the test specific file.