18.1. Clara Deploy SDK Platform API Overview

The Clara Deploy SDK Platform API is provided to facilitate pipeline, pipeline job, and payload related operations. The API is based on Google’s GRPC standard which is both platform and language agnostic. This means anyone can develop clients for Clara Deploy SDK using the language of their choice, on the platform of their choice, for any platform possible.

18.1.1. Concepts

GRPC based solutions are provided as a set of remote procedure calls [RPC] grouped together in a service. Each RPC defines a request and a response message. GRPC messages are structured data definitions based on Google’s wire-type encoding solution Protocol Buffers.

When calling a service’s RPC, the requester will provide a populated request message to the RPC. In turn, the RPC will handle the communication with the remote service provider, then return a response message.

All Clara Platform API response messages will include a standard response header, which can be used to determine if the call was successful or not. Additional information will depend on the RPC used and the response message returned. See the service listing below for details regarding each RPC.

18.1.1.1. Services

The Clara Platform API provides five separate, but related, services: Clara Service, Jobs Service, Model Service, Payload Service, and Pipeline Service. Those familiar with object oriented software development will recognize the idea of separating concerns into separate interfaces.

18.1.2. Clara Service

Provides functionality related to management of the Clara Platform Server.

Remove

Requests the termination of Clara Platform Server and associated resource cleanup.
Utilization

Requests utilization data for all Clara Platform managed GPUs.
Version

Requests version information from Clara Platform Server.

18.1.3. Jobs Service

Provides functionality related to pipeline jobs such as creation, status, and inspection of pipeline jobs.

Cancel

Requests cancellation of a pending or running job by its identifier.
Create

Requests creation of a new job based on a known pipeline.
List

Requests a filtered list of all known jobs, or a list of all running jobs when no filter is provided.
ReadLogs

Requests the download of logs for an operator of a job.
Start

Requests starting a job.
Status

Requests the status of a known job by its identifier.
ClaraStop

Requests all pipeline services deployed during pipeline job initialization(s) to be halted, and associated resources released.

18.1.4. Models Service

Provide functionality related to inference model repository management.

DeleteModel

Requests the deletion of an inference model from the model repository.
DownloadModel

Request the download of an inference model from the model repository.
ListModels

Requests a list of all inference models currently available from the model repository.
UploadModel

Requests the upload of an inference model to the model repository.

18.1.5. Payloads Service

Provides functionality related to pipeline payloads such as uploading and downloading of data, and enumerating the contents of a payload.

Create

Requests the creation of a new payload.
Delete

Requests the deletion of a known payload by its identifier.
Details

Requests the details (file listing) of a known payload by its identifier.
Download

Requests the download of a blob (file) from a known payload by its identifier and path.
Remove

Requests the removal, or deletion, of a blob from a known payload.
Upload

Requests the upload of a blob (file) from a known payload. When payload type is PAYLOAD_TYPE_PIPELINE, uploads are written to the ~/input/ folder of the payload.

18.1.6. Pipelines Service

Provides functionality related to pipeline definitions such as creation, enumeration, and removal of pipeline definitions.

Create

Requests the creation of a new pipeline, based on a definition provided to the service.
Details

Requests details of a pipeline.
List

Requests a listing of all pipelines known by the service.
Remove

Requests the removal of a pipeline definition from the service.
Update

Requests an update to a pipeline definition known by the service.

18.1.6.1. Examples

All examples below are written using a kind of “pseudo-code”, meaning that none of them are expected to work “as-is”. Each example is expected to be interpreted and re-written in your language of choice using the freely downloadable GRPC tools (see Useful Links below).

18.1.7. Create a New Pipeline

Copy
Copied!

            
            // Create a pipelines service GRPC client.
var client = PipelinesServiceClient(connection_string)

// Create a request stream object.
var request_stream = client.create()

// Add each file that composes the pipeline definition
// to the request stream.
for (var filename in pipeline_filename_list)
{
  // Create a 64 kilobyte (chunk size limit) array.
  var buffer = new char[64 * 1024 / sizeof(char)];

  // Open the file for reading because we need the
  // content of the file.
  var reader = File.openRead(filename)
  int read = 0;

  // Read up to 64 KiB of data from the file, the GRPC message
  // definition assumes the native encoding of your language
  // of choice, and will handle re-encoding the content as part
  // of the send to the server.
  while ((read = reader.read(buffer, 0, buffer.Length)) > 0)
  {
    // Streams are composed of a series of messages, or chunks.
    // Create a `PipelineCreateRequest` message for each
    // chunk of the stream.
    var request = PipelineCreateRequest();

    // Create a `PipelineDefinitionFile` message and populate
    // it with the data read this time around the loop. It is
    // important to use the same `path` value for content from
    // the same file, when a file exceeds the chunk size limit.
    request.definition = PipelineDefinitionFile()
    request.header = RequestHeader()
    request.header.user_agent = "custom-client-example"
    request.definition.path = filename
    request.definition.content = String(buffer, 0, read)

    request_stream.append(request);
  }
}

var response = request_stream.close()

18.1.8. Get a List of Existing Pipelines

Copy
Copied!

            
            // Create a pipelines service GRPC client.
var client = PipelineServiceClient(connection_string)

// Create a pipelines list request object.
var request = PipelinesListRequest()
request.header = RequestHeader()
request.header.user_agent = "custom-client-example"

// Request data from Clara Deploy SDK.
var response_stream = client.list();

// Loop over the response messages.
// Each pipeline registered with Clara Deploy SDK will
// be described in a separate message.
while (var response = response_stream.read_next())
{
  print("Pipeline {response.details.name} ({response.details.pipeline_id})\n")
}

18.1.9. Create and Start a Job

Copy
Copied!

            
            // Create a jobs service GRPC client.
var jobs_client = JobsServiceClient(connection_string)

// Create a jobs create request object.
var create_request = JobsCreateRequest()
create_request.header = RequestHeader()
create_request.header.user_agent = "custom-client-example"
// Assign the local `job_pipeline_id` value to the request's `pipeline_id`.
create_request.pipeline_id = job_pipeline_id
// Assign the local `job_name` value to the request's `name`.
create_request.name = job_name;

// Use the client to create the job, capturing the response from the client.
var create_response = jobs_client.Create(create_request)

// Clara Deploy SDK will send a `ResponseHeader` along with its response.
// When the header's response code value is less than zero, an error has occurred.
if (create_response.header.code < 0)
  throw error("Failed to create job. Clara Deploy SDK responded with an error ({create_response.header.code}).")

// Capture the job and payload identifiers
var job_id = create_response.job_id
var payload_id = create_response.payload_id

// Since the pipeline has data it needs as input
// we need to upload that data to payload prior to
// starting the job.
var payloads_client = PayloadsServiceClient(connection_string)

// Use the client to create an upload request stream.
var request_stream = payloads_client.upload()

// Allocate a buffer for copying data to each stream chunk.
// We'll use 64 KiB because that's vert close to the request
// message size limit.
var buffer = new byte[64 * 1024]

// Loop over all of the files we need to push to
// Clara Deploy SDK prior to starting the job.
for (var local_file_path in job_input_files)
{
  // Open the file and get its size,
  var reader = file.open_read(local_file_path)
  var file_size = file.get_size(local_file_path)
  var read = 0;

  // Read the file in 64 KiB sized chunks, and send each one
  // to Clara Deploy SDK to store in the pipeline's input folder.
  while ((read = reader.read(buffer, 0, buffer.length)) > 0)
  {
    // Clara Deploy SDK only want the name of the file. Any
    // included path information could cause issues for the
    // pipeline operators.
    var file_name = file.get_name(local_file_path)

    // Create the request message. Notice that at least one
    // message per input file will be created. Multiple messages
    // will be created for larger files.
    // Clara Deploy SDK will reassemble larger files base on
    // the file name in the request's `details` message.
    var request = PayloadUploadRequest()
    request.header = RequestHeader()
    request.header.user_agent = "custom-client-example"
    request.payload_id = payload_id
    request.details = PayloadFileDetails()
    request.details.size = file_size
    request.details.name = file_name
    request.data = buffer.range(0, read)

    request_stream.append(request)
  }
}

// Close the request to inform Clara Deploy SDK that no more
// input data will be coming.
request_stream.close()

// Now it is time to actually start the job.
// Clara Deploy SDK will not start the job immediately, but
// will queue the job if there are currently insufficient
// resources available.
var start_request = JobsStartRequest()
start_request.header = RequestHeader()
start_request.header.user_agent = "custom-client-example"
start_request.job_id = job_id

var start_response = jobs_client.Start(start_request)

if (start_response.header.code < 0)
  throw error("Failed to start job ({job_id}). Clara Deploy SDK responded with an error ({start_response.header.code}).")

18.1.10. Get Status of a Job

Copy
Copied!

            
            // Create a jobs service GRPC client.
var client = JobsServiceClient(connection_string)

// Create a jobs service status request object.
var request = JobsStatusRequest()
request.header = RequestHeader()
request.header.user_agent = "custom-client-example"
request.job_id = my_job_id

// Send to the request to Clara Deploy SDK and
// receive a response message back.
var response = client.Status(request)

// Check the response header for an error code.
if (response.header.code < 0)
  throw error("Failed to get job status ({job_id}). Clara Deploy SDK responded with error ({response.header.code}).")

// Print out the details of the job status response.
print("job status\n")
print("  name: {response.name}\n")
print("  id: {response.job_id}\n")
print("  payload: {response.payload_id}\n")
print("  pipeline: {response.pipeline_id}\n")
print("  state: {response.state}\n")
print("  status: {response.status}\n")

if (response.messages.count > 0)
{
  print("  messages:\n")

  for (var message in response.messages)
  {
    print("    {message}\n")
  }
}

18.1.11. Get a File Listing from a Payload

Copy
Copied!

            
            // Create a payloads service GRPC client.
var client = PayloadsServiceClient(connection_string)

// Create a payloads details request object.
var request = PayloadsDetailsRequest()
request.header = RequestHeader()
request.header.user_agent = "custom-client-examples"
request.payload_id = my_payload_id

// Request the data from Clara Deploy SDK
var response_stream = client.Details(request)

// Loop over each response message sent.
// Each file in the payload will have a corresponding
// message in the response.
while (var response = response_stream.read_next())
{
  print("  {response.file.name}    {response.file.size}\n")
}

18.1.12. Download a File from a Payload

Copy
Copied!

            
            // Create a payloads service GRPC client.
var client = PayloadsServiceClient(connection_string)

// Create a payloads download request object.
// The payload's identifier and the name of the file
// contained by the payload are required to download
// a payload file's contents.
// See the example above to see how to get the names
// of files contained in a payload.
var request = PayloadsDownloadRequest()
request.header = RequestHeader()
request.header.user_agent = "custom-client-examples"
request.payload_id = my_payload_id
request.name = payload_file_name

var response_stream = client.Download(request)

if (response.header.code < 0)
  throw error("Failed to download file. Clara Deploy SDK returned error ({response.header.code})")

// Create a local file for the download data.
var writer = file.create(local_file_name)

// Loop over the stream response messages. Each
// message will be a chunk of the file. Appending
// each in order will restore the file locally.
while (var response = response_stream.read_next())
{
  writer.write(response.data)
}

writer.close()

18.1.12.1. Useful Links

18.1.13. Introduction to GRPC and Protobuf

GRPC Guides
GRPC Tutorials in different languages

( C++, C#, Go, Java, Python )
GRPC Reference Documentation

( C++, C#, Go, Java, Python )
Protocol Buffers Tutorials in different languages

( C++, C#, Go, Java, Python )

18.1.14. GRPC Examples

Official Github Repository