Skip to main content

documents

Creates, updates, deletes, gets or lists a documents resource.

Overview

Namedocuments
TypeResource
Idgoogle.contentwarehouse.documents

Fields

NameDatatypeDescription
namestringThe resource name of the document. Format: projects/{project_number}/locations/{location}/documents/{document_id}. The name is ignored when creating a document.
cloudAiDocumentobjectDocument represents the canonical document resource in Document AI. It is an interchange format that provides insights into documents and allows for collaboration between users and Document AI to iterate and optimize for quality.
contentCategorystringIndicates the category (image, audio, video etc.) of the original content.
createTimestringOutput only. The time when the document is created.
creatorstringThe user who creates the document.
displayNamestringRequired. Display name of the document given by the user. This name will be displayed in the UI. Customer can populate this field with the name of the document. This differs from the 'title' field as 'title' is optional and stores the top heading in the document.
displayUristringUri to display the document, for example, in the UI.
dispositionTimestringOutput only. If linked to a Collection with RetentionPolicy, the date when the document becomes mutable.
documentSchemaNamestringThe Document schema name. Format: projects/{project_number}/locations/{location}/documentSchemas/{document_schema_id}.
inlineRawDocumentstringRaw document content.
legalHoldbooleanOutput only. Indicates if the document has a legal hold on it.
plainTextstringOther document format, such as PPTX, XLXS
propertiesarrayList of values that are user supplied metadata.
rawDocumentFileTypestringThis is used when DocAI was not used to load the document and parsing/ extracting is needed for the inline_raw_document. For example, if inline_raw_document is the byte representation of a PDF file, then this should be set to: RAW_DOCUMENT_FILE_TYPE_PDF.
rawDocumentPathstringRaw document file in Cloud Storage path.
referenceIdstringThe reference ID set by customers. Must be unique per project and location.
textExtractionDisabledbooleanIf true, text extraction will not be performed.
textExtractionEnabledbooleanIf true, text extraction will be performed.
titlestringTitle that describes the document. This can be the top heading or text that describes the document.
updateTimestringOutput only. The time when the document is last updated.
updaterstringThe user who lastly updates the document.

Methods

NameAccessible byRequired ParamsDescription
getSELECTdocumentsId, locationsId, projectsIdGets a document. Returns NOT_FOUND if the document does not exist.
createINSERTlocationsId, projectsIdCreates a document.
deleteDELETEdocumentsId, locationsId, projectsIdDeletes a document. Returns NOT_FOUND if the document does not exist.
patchUPDATEdocumentsId, locationsId, projectsIdUpdates a document. Returns INVALID_ARGUMENT if the name of the document is non-empty and does not equal the existing name.
linked_sourcesEXECdocumentsId, locationsId, projectsIdReturn all source document-links from the document.
linked_targetsEXECdocumentsId, locationsId, projectsIdReturn all target document-links from the document.
lockEXECdocumentsId, locationsId, projectsIdLock the document so the document cannot be updated by other users.
searchEXEClocationsId, projectsIdSearches for documents using provided SearchDocumentsRequest. This call only returns documents that the caller has permission to search against.
set_aclEXECdocumentsId, locationsId, projectsIdSets the access control policy for a resource. Replaces any existing policy.

SELECT examples

Gets a document. Returns NOT_FOUND if the document does not exist.

SELECT
name,
cloudAiDocument,
contentCategory,
createTime,
creator,
displayName,
displayUri,
dispositionTime,
documentSchemaName,
inlineRawDocument,
legalHold,
plainText,
properties,
rawDocumentFileType,
rawDocumentPath,
referenceId,
textExtractionDisabled,
textExtractionEnabled,
title,
updateTime,
updater
FROM google.contentwarehouse.documents
WHERE documentsId = '{{ documentsId }}'
AND locationsId = '{{ locationsId }}'
AND projectsId = '{{ projectsId }}';

INSERT example

Use the following StackQL query and manifest file to create a new documents resource.

/*+ create */
INSERT INTO google.contentwarehouse.documents (
locationsId,
projectsId,
cloudAiDocumentOption,
requestMetadata,
policy,
createMask,
document
)
SELECT
'{{ locationsId }}',
'{{ projectsId }}',
'{{ cloudAiDocumentOption }}',
'{{ requestMetadata }}',
'{{ policy }}',
'{{ createMask }}',
'{{ document }}'
;

UPDATE example

Updates a documents resource.

/*+ update */
UPDATE google.contentwarehouse.documents
SET
cloudAiDocumentOption = '{{ cloudAiDocumentOption }}',
requestMetadata = '{{ requestMetadata }}',
document = '{{ document }}',
updateOptions = '{{ updateOptions }}'
WHERE
documentsId = '{{ documentsId }}'
AND locationsId = '{{ locationsId }}'
AND projectsId = '{{ projectsId }}';

DELETE example

Deletes the specified documents resource.

/*+ delete */
DELETE FROM google.contentwarehouse.documents
WHERE documentsId = '{{ documentsId }}'
AND locationsId = '{{ locationsId }}'
AND projectsId = '{{ projectsId }}';