Synthetic Data and Digital Twins: A Synergistic Cycle of Continuous Improvement

By Dick Weisinger

The current generation of machine learning and AI algorithms require training with data. Lots of it. The algorithms scan massive amounts of data and are able to identify recurring patterns. After being trained, when the AI algorithm encounters similar data, it can recognize known patterns, often better than a human, and respond appropriately.

ML and AI work great when there is lots of data, but what about for situations where only a sparse amount of data are available? Without sufficient data, the AI can’t be trained to a point where it will be able to correctly respond.

When there isn’t sufficient data available, researchers can generate simulated data using a ‘digital twin’. A digital twin is a software simulation of a process, system, or object. Digital twin software might be designed using computer aided design, finite element analysis, physics engines, statistical and probabilistic techniques, and other software modeling tools to create the simulation. Then, many simulations are made using different assumptions and environmental conditions, and the output data from the digital twin simulation is observed and collected. In this way, simulated data can provide a large pool of training data for the AI/ML model.

George Brunner, VP of analytics and CTO at Acument Analytics, said that “a digital twin uses data from the physical entity to create the algorithm which then ‘models’ the physical entity. Once the digital twin AI algorithm is created it can then be used to generate synthetic data. Therefore, they can work in unison in an AI workflow cycle. Synthetic data can ‘prime the pump’ to create the initial digital twin. Data capture from the physical twin allows the digital twin to improve over time. Then the digital twin can be used to enhance the quality of Synthetic data in a cycle of continuous improvement.”

A report by CapGenmini found that “digital twins provide the ideal playground to test hypotheses, train and evaluate algorithms, test transparency, and generate synthetic data and events – exploring levels of “smart” that initially might even seem inapplicable to the real world.”

Vaibhav Nivargi, co-founder and chief technology officer of Moveworks, told the Wall Street Journal that “synthetic data becomes very important because we operate in a domain with limited data.” Gartner predicts that “by 2024, 60% of the data used for the development of AI and analytics projects will be synthetically generated.”

June 29th, 2022

Category: Data Management, Digital Twins

1 Comment

One comment on “Synthetic Data and Digital Twins: A Synergistic Cycle of Continuous Improvement”

AndrewL2O says:

June 30, 2022 at 6:24 am

What are the examples of things that it can do?

Reply

Leave a Reply Cancel reply

Legal Terms & Disclaimers

This blog site is accessed from the website of Formtek, Inc. All visitors to or users of this blog site are subject to the terms and conditions and privacy policy that govern the Formtek website, links for which are provided above.

Some of the individuals posting to this blog site, including the moderators, work for Formtek. Postings by these individuals are the personal opinions of these individuals, not of Formtek. Their posted content is provided for informational purposes only and is not meant to be an endorsement or representation by Formtek or any other party. Postings to this blog site may be outdated, invalid or inaccurate by the time you read them. Individuals posting to this blog site make no statements, representations or warranties as to the timing, validity, accuracy or reliability of their postings.

This blog site may contain links to third party sites. Access to any third party site linked to this blog site is at your own risk. None of Formtek, the blog site moderator(s) and the individuals posting on this blog site that work for Formtek is responsible for the timing, validity, accuracy or reliability of any information, data, opinions, advice or statements made on these third party sites. These links are provided merely as a convenience and do not imply any endorsement.

Postings to this blog site are available to the public. You should not post, link to or otherwise upload any information considered confidential to this blog site. All postings to this blog site are moderated. Postings will appear if and when they are approved by the moderator. Notwithstanding any approval by the moderator, by posting information to this blog site, you agree to be solely responsible for the information you post, link to, or otherwise upload to the blog site. You agree to release Formtek from any liability related to that information or to your use of the blog site. You grant Formtek a worldwide, perpetual, irrevocable, royalty-free, fully-paid, and transferable (including rights to sublicense) right to exercise all copyright, publicity, and moral rights with respect to any information you post, link to or otherwise upload to this blog site.

Synthetic Data and Digital Twins: A Synergistic Cycle of Continuous Improvement

One comment on “Synthetic Data and Digital Twins: A Synergistic Cycle of Continuous Improvement”

Leave a Reply Cancel reply

Company

Products and Services

News

Resources

Legal Terms & Disclaimers