Lightning AI Studios: Never set up a local environment again →

← Back to blog

Train Machine Learning Models with Synthetic Data

The Lightning App built with NVIDIA Omniverse Replicator (try it for free today!) lets you generate synthetic data to enhance the training and performance of your AI models.

Why did we build the NVIDIA Omniverse Replicator Lightning App?

We built this Lightning App because we’re confident that synthetic data will transform how researchers and engineers train models and implement AI solutions.

For all the talk about how technology powered by artificial intelligence will automate away many aspects of the way we work, there remains a large obstacle in the way of achieving that automation: data. Data is the beginning – and, depending on who you ask, the key – of any machine learning project. That’s because machine learning models, the things that power artificial intelligence, need to learn from something. In order for a computer to, say, spot a crack in the windshield of a car or determine whether a human is standing on an oil rig, it has to first learn what those things look like. That means giving a computer thousands (if not more) pictures of cracked windshields to learn from.


No data? No project.

But what happens if you don’t have those datasets at your disposal? That’s where synthetic data comes in. You can always go and take a few thousand photos of cracked windshields yourself, but that’s the kind of project that would be an artist’s life work – not the first step in the agile implementation of a machine learning tool that can automate a process in a factory or garage. In the past, the lack of high-quality, labeled datasets could simply stop a project dead in its tracks.

So, what alternatives are there? You can hire a dedicated team to gather, label, and prepare this data to train your model, but that’s a nonstarter if your organization doesn’t have the resources to commit to a project of that magnitude. Whether you’re a solo researcher or a machine learning engineer, acquiring this ground truth data is an expensive, lengthy process that is often subject to human error. Not exactly an automated utopia.


More high-quality data, faster.

That’s why we teamed up with NVIDIA to bring NVIDIA Omniverse Replicator, a software development kit for synthetic data generation, to a Lightning App. With it, you can generate physically accurate 3D datasets, complete with ground-truth annotations that can be used to train models – all from your browser. This gives you access to synthetic data much faster and easier than ever before. You can also run this App on the Lightning Cloud (backed by AWS), or on your own private cluster, if you’re working with stricter compliance guidelines.


Lightning + NVIDIA Omniverse Replicator = less time and money spent on data.

If you’re a company looking to streamline a process by implementing artificial intelligence, you no longer need to spend countless hours and a mountain of resources collecting and preparing data. In industries like retail, for instance – where artificial intelligence has already transformed the way we shop for everything from books to baby food – synthetically-generated datasets can be used to train models for use in warehouse logistics and inventory management. You can train a machine learning model to spot defective windshields in a factory, all without ever having to crack a single pane of glass yourself.

In short, you can use the Lightning App built on NVIDIA Omniverse Replicator to generate a dataset for use in any situation where a computer needs to see something – like AI perception models.

Typically, in the past these kinds of tools have only been available to organizations with deep pockets and plenty of resources to spare. In our ongoing efforts to expand access to the tools that are defining the next generation of industry, this Lightning App is available for use by anyone, anywhere. You don’t need to spend countless hours and resources building your own solution – we’ve done that for you.


Try the Lightning App for free today! We’re always on the lookout for the amazing things our community creates with Lightning. Get in touch by joining our Slack, whether you want to give us feedback or show off what you’ve built.