src/jarvis_proto/jarvis_cv.proto¶
-
service
JarvisVision
¶ The Jarvis Vision service provides methods for obtaining inference results for various vision models.
-
rpc GazeResponse GetGaze(GazeRequest)
Given a GazeRequest for Gaze inference, outputs a GazeResponse.
-
rpc FaceDetectResponse GetFaceDetect(FaceDetectRequest)
Given a FaceDetectRequest for FaceDetect inference, outputs a FaceDetectResponse.
-
rpc FacialLandmarksResponse GetFacialLandmarks(FacialLandmarksRequest)
Given a FacialLandmarksRequest for FacialLandmarks inference, outputs a FacialLandmarksResponse.
-
rpc BodyPoseResponse GetBodyPose(BodyPoseRequest)
Given a BodyPoseRequest for BodyPose inference, outputs a BodyPoseResponse.
-
rpc EmotionResponse GetEmotion(EmotionRequest)
Given a EmotionRequest for Emotion inference, outputs a EmotionResponse.
-
rpc HeadPoseResponse GetHeadPose(HeadPoseRequest)
Given a HeadPoseRequest for HeadPose inference, outputs a HeadPoseResponse.
-
rpc UserResponse GetUserAttributes(UserRequest)
Given a UserRequest for getting user data, outputs a UserResponse.
-
-
message
BodyPose
¶ BodyPose datastructure to be returned when there is BodyposeRequest.
-
BodyPose.Joint joints (repeated)
-
-
message BodyPose.Joint
Joint object containing location descriptor and x,y coordinate.
-
BodyPose.JointDescriptor descriptor
-
int32
x
¶
-
int32
y
¶
-
-
message
BodyPoseRequest
¶ Request for BodyPose inference needs image. Optionally, provide imageID which will be returned in the response.
Image is expected in BGR format in HWC.
-
uint64
imageID
¶ Optionally provide imageID which will be mirrored in response
-
uint64
-
message
BodyPoseResponse
¶ Response for BodyPose inference outputs bounding boxes of faces.
-
uint64
imageID
¶ ID from request
-
uint64
-
message
BoundingBox
¶ Bounding box datastructure expressed as (x,y) coordinate for top left and (w,h) for width and height with (x+w, y+h) as bottom right coordinate.
-
int32
x
¶ Top left x-coordinate
-
int32
y
¶ Top left y-coordinate
-
int32
w
¶ Width such that bottom right x-coordinate = x + w
-
int32
h
¶ Height such that bottom right y-coordinate = y + h
-
int32
-
message
Data
¶ Generic data block that can hold images or tensors.
-
bytes
buffer
¶ Buffer of bytes for data.
-
int32
shape
(repeated)¶ Shape of data used for deserialization.
-
DataType
dtype
¶ Datatype of buffer for deserialization.
-
bytes
-
message
Emotion
¶ Emotion datastructure to be returned when there is EmotionRequest.
-
BoundingBox
bbox
¶
-
Emotion.EmotionDescriptor emotion
-
BoundingBox
-
message
EmotionRequest
¶ Request for Emotion inference needs image. Optionally, provide imageID which will be returned in the response.
Image is expected in BGR format in HWC.
-
uint64
imageID
¶ Optionally provide imageID which will be mirrored in response
-
uint64
-
message
EmotionResponse
¶ Response for Emotion inference outputs list of emotions for every face detected.
-
uint64
imageID
¶ ID from request
-
uint64
-
message
FaceDetectRequest
¶ Request for FaceDetect inference needs image. Optionally, provide imageID which will be returned in the response.
Image is expected in BGR format in HWC.
-
uint64
imageID
¶ Optionally provide imageID which will be mirrored in response.
-
uint64
-
message
FaceDetectResponse
¶ Response for FaceDetect inference outputs bounding boxes of faces.
-
BoundingBox
bbox
(repeated)¶ A list of output face bounding boxes
-
uint64
imageID
¶ ID from request.
-
BoundingBox
-
message
FacialLandmarksRequest
¶ Request for FacialLandmarks inference needs image. Optionally, provide imageID which will be returned in the response. Optionally, user can provide face bounding boxes to run inference for FacialLandmarks in specific regions.
Image is expected in BGR format in HWC.
-
uint64
imageID
¶ Optionally provide imageID which will be mirrored in response.
-
BoundingBox
face_bbox
(repeated)¶ Optional input
-
uint64
-
message
FacialLandmarksResponse
¶ Response for FacialLandmarks inference outputs landmarks of (x,y) coorindates for each face.
-
uint64
imageID
¶ ID from request.
-
uint64
-
message
Gaze
¶ Gaze datastructure to be returned when there is GazeRequest.
-
double
x
¶ x-coordinate of the gaze point in camera space (millimeter)
-
double
y
¶ y-coordinate of the gaze point in camera space (millimeter)
-
double
z
¶ z-coordinate of the gaze point in camera space (millimeter)
-
double
theta
¶ Horizontal angle of the gaze point in camera space (radians)
-
double
phi
¶ Vertical angle of the gaze point in camera space (radians)
-
double
-
message
GazeRequest
¶ Request for Gaze inference needs image. Optionally, provide imageID which will be returned in the response. Optionally, user can provide face bounding boxes to run inference for Gaze in specific regions. Optionally, user can provide landmarks of (x,y) coordinates for each face to run inference for Gaze in specific regions.
Image is expected in BGR format in HWC.
-
uint64
imageID
¶ Optionally provide imageID which will be mirrored in response.
-
BoundingBox
face_bbox
(repeated)¶ Optional input
-
uint64
-
message
GazeResponse
¶ Response for Gaze inference outputs Gazes for each person.
-
uint64
imageID
¶ ID from request.
-
uint64
-
message
Head
¶ Head datastructure to be returned when there is Headpose.
-
double
x
¶ x-coordinate of the head center point in camera space (millimeter)
-
double
y
¶ y-coordinate of the head center point in camera space (millimeter)
-
double
z
¶ z-coordinate of the head center point in camera space (millimeter)
-
double
pitch
¶ Pitch angle of the head center point in camera space (degrees)
-
double
yaw
¶ Yaw angle of the head center point in camera space (degrees)
-
double
roll
¶ Roll angle of the head center point in camera space (degrees)
-
double
-
message
HeadPoseRequest
¶ Request for HeadPose inference needs image and camera parameters. Optionally, provide imageID which will be returned in the response.
Image is expected in BGR format in HWC.
-
uint64
imageID
¶ Optionally provide imageID which will be mirrored in response
-
uint64
-
message
HeadPoseResponse
¶ Response for HeadPose inference outputs points in 3D space.
-
uint64
imageID
¶ ID from request
-
uint64
-
message
UserRequest
¶ Request for User inference needs image and point cloud. Optionally, provide imageID which will be returned in the response.
Image is expected in BGR format in HWC.
-
uint64
imageID
¶ Optionally provide imageID which will be mirrored in response
-
uint64
-
message
Users
¶ User datastructure
-
BoundingBox
face
¶ Users Face Detect Result
-
BoundingBox
-
enum BodyPose.JointDescriptor
Descriptors for Joints. Default is None.
-
enumerator
NONE
= 0¶
-
enumerator
NOSE
= 1¶
-
enumerator
NECK
= 2¶
-
enumerator
RIGHT_SHOULDER
= 3¶
-
enumerator
RIGHT_ELBOW
= 4¶
-
enumerator
RIGHT_WRIST
= 5¶
-
enumerator
LEFT_SHOULDER
= 6¶
-
enumerator
LEFT_ELBOW
= 7¶
-
enumerator
LEFT_WRIST
= 8¶
-
enumerator
RIGHT_HIP
= 9¶
-
enumerator
RIGHT_KNEE
= 10¶
-
enumerator
RIGHT_ANKLE
= 11¶
-
enumerator
LEFT_HIP
= 12¶
-
enumerator
LEFT_KNEE
= 13¶
-
enumerator
LEFT_ANKLE
= 14¶
-
enumerator
RIGHT_EYE
= 15¶
-
enumerator
LEFT_EYE
= 16¶
-
enumerator
RIGHT_EAR
= 17¶
-
enumerator
LEFT_EAR
= 18¶
-
enumerator