> For the complete documentation index, see [llms.txt](https://developer.kizen.com/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://developer.kizen.com/docs/concepts/smartconnectors/smartconnector-diff-checking.md). # SmartConnector Diff Checking {% hint style="success" %} **Audience**: Admins, Developers, Solution Architects **Purpose**: Explains what diff checking is, how it works, how to enable and disable it, and when its built-in behavior is not appropriate for a given use case. {% endhint %} ## Overview Diff checking is an optional optimization that prevents space.vars.smartconnectors from re-processing rows that have not changed since the previous run. In recurring imports, diff checking often speeds up processing by 10x or more, but understanding exactly what it compares, and what it does not, is essential to using it correctly. ### What Is Diff Checking? Diff checking is a mechanism that compares each incoming row against the previous run's output before deciding whether to send that row to the load step. If the contents of a row are the same as the last run, it is skipped. Only rows whose contents have changed are sent through to the load step. This is most valuable for recurring imports that ingest complete dataset files on a schedule. When most rows are identical from run to run, diff checking eliminates the overhead of re-processing thousands of unchanged records. One important detail: diff checking occurs *after* variable resolution. The hash is computed on the mapped execution variable values. If you want a value to be considered by the diff, map it as a variable. This works even if the variable is not used by a later load step: any mapped variable contributes to the diff. *** ## How To Enable Diff Checking Diff checking is configured at the space.vars.smartconnector level and applies to the entire space.vars.smartconnector when enabled. It is not set per output table. Diff checking can also be toggled directly in the Run GUI at the time of starting a run. Turning it off before a run forces a full re-ingestion regardless of what was processed in previous runs. Every row is sent to the load step as if it were new. {% hint style="info" %} **Troubleshooting Tip**: If a space.vars.smartconnector ran successfully but data is not updating as expected, check two things. First, load step conflict resolution rules (for example, "only update if blank" on a field that is not blank). Second, diff check skipping the row; the run report shows which rows were skipped, or you can toggle diff check off and re-run to rule it out. {% endhint %} *** ## Behavior and Limitations Diff check compares the current run's output against the previous successful run's output, not against the current state of space.vars.entities in space.vars.Kizen\_company\_name. If the previous run failed, it is not used for comparison, and the next run will diff against the most recent successful run before it. This distinction matters in any environment where space.vars.entities can be modified between runs. This behavior is by design for straightforward ingestion space.vars.workflows, where the source file is the authoritative data source and space.vars.Kizen\_company\_name space.vars.entities are not expected to be modified independently. In those cases, skipping unchanged rows is safe and efficient. The built-in diff check is not the right tool for every use case. If your pipeline needs to detect and correct changes that have been made directly to space.vars.Kizen\_company\_name space.vars.entities between runs, diff checking will miss those changes entirely. #### Custom diff checking For cases where the built-in diff check is not sufficient, it is also possible to use SQL processing to compare incoming data against reference data from space.vars.Kizen\_company\_name. This allows for more complex comparisons, such as checking against the current state of space.vars.entities in space.vars.Kizen\_company\_name rather than the previous run's output. The most common reason to reach for a custom diff is to handle space.vars.entities that are missing from the source data. For example, if one run's file contains space.vars.entities A, B, and C, and the next run's file contains B, C, and D, a custom diff can detect that A is no longer present and take action on it (such as expiring or archiving the space.vars.entity). The built-in diff check cannot do this, because it only evaluates rows that are in the incoming file. This is a power-user feature. It requires significant SQL ability to implement correctly, and most space.vars.smartconnectors will not need it. {% hint style="warning" %} **Caution**: Diff checking has no visibility into changes made directly to space.vars.Kizen\_company\_name space.vars.entities between runs. If a user, space.vars.automation, or other process updates a space.vars.entity after the last run, the space.vars.smartconnector will not detect that change. The row will hash to the same value as before and will be skipped on the next run. {% endhint %} *** ## What's Next With diff checking configured, you have everything you need to run your space.vars.smartconnector reliably. Continue to [Running a SmartConnector](/docs/concepts/smartconnectors/running-a-smartconnector.md) to learn how to activate your space.vars.smartconnector, execute a dry run, interpret the XLS output report, and understand what each execution status means.

Related Topics

* [SmartConnector SQL Processing](/docs/concepts/smartconnectors/smartconnector-sql-processing.md) * [SmartConnector External Data Sources](/docs/concepts/smartconnectors/smartconnector-external-data-sources.md) * [SmartConnector Execution Variables](/docs/concepts/smartconnectors/smartconnector-execution-variables.md) * [SmartConnector Load Steps](/docs/concepts/smartconnectors/smartconnector-load-steps.md)

--- # Agent Instructions This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com. ## Querying This Documentation If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question. Perform an HTTP GET request on the current page URL with the `ask` query parameter, and the optional `goal` query parameter: ``` GET https://developer.kizen.com/docs/concepts/smartconnectors/smartconnector-diff-checking.md?ask=&goal= ``` `ask` is the immediate question: it should be specific, self-contained, and written in natural language. `goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal. The response will contain a direct answer to the question and relevant excerpts and sources from the documentation. Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.