AI Legal Q&A

Do I have to get permission before training a small AI model on copyrighted photos I bought online?

PA - Pennsylvania 6 min read
X LinkedIn Reddit Bluesky

Short Answer

In general, buying a copyrighted photo online does not automatically give you the right to use that photo for every purpose. It usually gives you only the rights that were included in the purchase or license, and those rights may be limited. If you want to train a small AI model on the photos, permission may be required depending on the license terms, the way the photos were obtained, and how the training use is characterized under copyright law.

A key issue is that “owning” a copy of a photo is not the same as owning the copyright. The photographer or rights holder may still control reproduction, distribution, and some other uses. Training an AI model often involves making copies of images, at least temporarily, and that can raise copyright questions. Whether that use is allowed may depend on the license, any terms of service, and broader copyright defenses that may or may not apply.

Another important factor is the source of the images. Photos bought from a stock site, marketplace, or app may come with a license that allows some personal, editorial, or commercial uses but restricts machine learning, scraping, redistribution, or derivative uses. If the license is unclear, narrow, or silent about AI training, that does not necessarily mean the use is allowed. It may mean the issue is unresolved and needs careful review.

There is also a distinction between training on a very small internal dataset and using images to build a commercial product. Even a small model can raise copyright concerns if it uses copyrighted works in a way not covered by the license. On the other hand, some uses may be more defensible than others depending on the facts, including the purpose of the use, whether the copies are kept, and whether the model is exposed to the public.

Because copyright and contract issues can overlap, the safest general approach is to read the license terms closely before training on bought photos. In Pennsylvania, as in other states, federal copyright law is usually the main legal framework, but contract claims and state-law issues may also matter. If the license is unclear or the project has commercial value, it is often wise to get legal review before proceeding.

What This Question Usually Means

People usually ask this when they have legally purchased or downloaded photos online and want to use them as training data for an AI or machine-learning model. The real question is often whether the purchase included a license broad enough to cover training, whether the act of training creates copies that implicate copyright law, and whether any contract restrictions on the website or marketplace apply. The question may also include concern about whether a small, private, or noncommercial model is treated differently from a public or commercial model.

Key Factors

What rights were actually purchased

The most important factor is whether the transaction gave you only a copy of the photo or also a license to use it in training. A purchase often does not equal full ownership of the copyright.

License language and site terms

Stock-photo licenses, marketplace terms, and website terms of service often control permitted uses. If they restrict machine learning, data mining, copying, or derivative works, permission may be needed.

How the training process uses the photos

Training may require making copies, resizing files, caching images, or storing them on servers. Those acts can matter under copyright law even if the final model does not display the original photos.

Commercial versus noncommercial use

Some licenses treat commercial and personal use differently. A private, experimental model may be viewed differently from a product or service offered to others, though that does not automatically make the use lawful.

Whether the photos are copyrighted and who owns them

Most original photographs are protected by copyright. If the photos are in the public domain or licensed under terms allowing training, the analysis may be different.

Whether the use might fall within a legal defense

Some AI training arguments may rely on copyright defenses such as fair use, but that is a fact-specific analysis and not something to assume. A defense may or may not apply depending on the circumstances.

Possible contract claims

Even if copyright law is uncertain, a website or seller may claim breach of contract if you use the images outside the license terms. That is why the purchase agreement matters.

When to Talk to a Lawyer

You may want to talk to a lawyer if the photos are part of a commercial product, if the license is unclear or silent about AI training, if the dataset includes many third-party images, if the site terms restrict automated use, or if you received a takedown notice or other complaint. A lawyer can also help if your project involves Pennsylvania contract issues, employee-created data, privacy concerns, or potential exposure in multiple states. Because AI and copyright issues are evolving, legal review is especially useful before launch rather than after a dispute arises.

Find Pennsylvania Lawyers

Browse lawyer profiles in Pennsylvania before deciding who to contact about your situation.

Find Pennsylvania Lawyers

Questions to Ask an Attorney

  • Does this license allow training or other machine-learning uses?
  • Could the training process be considered copying under copyright law?
  • Do the website terms create contract restrictions beyond copyright?
  • Would the analysis differ if the model is only for internal use and not public release?
  • What records should I keep to show where the images came from and what rights I obtained?
  • Are there Pennsylvania contract issues that could matter even though copyright is federal law?
  • Would using a different dataset or a different license reduce the risk?
  • Are there other issues, such as privacy, publicity rights, or consumer-protection concerns, that I should consider?

Documents and Evidence

Purchase receipt or order confirmation

It can help show what was bought, when the transaction occurred, and which seller or platform was involved.

License agreement or download terms

This is often the most important document because it may define what uses are allowed or prohibited.

Website terms of service in effect at the time of purchase

Online terms may include restrictions on automated processing, copying, redistribution, or AI-related uses.

Screenshots or saved copies of the product page

These may help preserve the exact wording shown to you before or during the transaction.

Dataset inventory

A list of every image source can help show which files are covered by which license and which may need separate review.

Internal project notes about the model’s purpose and use

These notes may matter when evaluating whether the use was personal, research-related, or commercial.

Legal Disclaimer

This page is for general legal information only and is not legal advice. It does not create an attorney-client relationship. Laws and procedures may change and may vary by jurisdiction. You should talk to a qualified attorney licensed in your jurisdiction about your specific situation.

Community Replies

Users and attorneys can reply here with general information, experience, or attorney commentary.

0 replies

Members can post a User Comment. Verified attorneys can also post an Attorney Commentary.

No replies yet.
Top