Intellectual Property

USPTO Patent Analysis Assistant - USA

Using data from the United States Patent and Trademark Office (USPTO), this AI embedding identifies patterns, trends, and key insights in patent data, enhancing patent analysis and strategy formulation.

Avatar of Author

EmbedElite

Rating: 5/5

Data Amount

2 TB (full-text)

Data Freshness

Updated July 2023

Use Cases

Legal Intellectual Property

Category

Intellectual Property

Sign up to get access
Example output

Can you locate any US patents filed within the last 5 years, which discuss applying convolutional neural networks for image recognition in medical devices, even if they do not explicitly use these terms but the implied meaning aligns with this search?

Yes, I have found 3 patents filed recently related to your query...

1. Patent US-20230162428-A1: "APPARATUS AND METHOD FOR ACCELERATION DATA STRUCTURE REFIT" (provided on 2023-05-25 by APODACA; Michael et al.)
View here

2. Patent US-20230157618-A1: "Method and System to Assess Pulmonary Hypertension Using Phase Space Tomography and Machine Learning" (provided on 2023-05-25 by Grouchy; Paul et al.)
View here

3. Patent US-20230162043-A1: "SYSTEMS AND METHODS FOR MAGNETIC RESONANCE IMAGING STANDARDIZATION USING DEEP LEARNING" (provided on 2023-05-25 by Zhang; Tao et al.)
View here

Patent Analysis Assistant - USA

How is the patent data indexed and embeddings created?

How do we keep the data updated?

Our system is integrated with USPTO Bulk Data Storage System (BDSS). Every week, a process runs to check for new patents or updates to existing patents. New and updated data is preprocessed, indexed, and embeddings are generated before it’s stored in our vector database. This ensures our AI embeddings always have the freshest patent data.

How to use

Python Code

You can access the embedding using EmbedElite python package or curl request.

# this returns a list of embeddings
from embedelite import load_embedding

embeddings = load_embedding("uspto-patents")
print(len(embeddings))
print(embeddings[1])

# this returns an object which can be directly inserted into Qdrant
result = load_embedding("uspto-patents", embed_for="qdrant")
# result is {"embeddings": [], "documents": [] "ids": []}
print(result["embeddings"])
print(result["documents"])
print(result["ids"])

Curl

Request

curl -X POST -H "Content-Type: application/json" -d '{
"doc_id": "uspto-patents"
}' https://api.embedelite.com/v1/embeddings/download/

Response

{
    "mappings": {
        "properties": {
        "doc_source": {"type": "keyword"},
        "sentence": { "type": "text" },
        "embeddings": { "type": "dense_vector", "dims": 1536, "index": False },
        "doc_source": {"type": "keyword"}
        }
    }

Data credit to USPTO Bulk Data Storage System (BDSS) without which this embedding wouldn’t be possible.