The key step of developing Edge-AI camera on open camera is to deploy deep learning neural network model at the edge. The model may be trained from Caffe *, tensorflow *, mxnet *, Kaldi *, paddy *, and onnx *.
OpenNCC SDK installation
Please install the SDK development environment according to documentation.Such as:
YOUR OPENNCC SDK INSTALL PATH'/Platform/Linux/readme.md
We use Linux for demo,other platform also well supported by OpenNCC.
If you still don't have a SDK,you could clone it from github openncc
To use Model Optimizer and BLOB Converter,please make sure the OpenVINO ToolKit already installed. You could download the OpenVINO here. And MUST choose the 2020 3 LTS version which comprehensive tested with OpenNCC.
If you want to download a model from Open Model Zoo,you need Model Downloader. Or you could also download from website.
Steps of deployment a deep learning model on edge
The following figure shows a complete AI model development process:
If we start with a existing model, the steps are simplified as follows:
Step1: Prepare a trained model
Use the Open Model Zoo to find open-source, pretrained, and preoptimized models ready for inference, or use your own deep-learning model.
Download a model from open model zoo,like:
$./downloader.py --name person-detection-retail-0002
Note: The OpenNCC support FP16 format,please use FP16 model.
Step2: Model Optimizer
Run the trained model through the Model Optimizer to convert the model to an Intermediate Representation (IR), which is represented in a pair of files (.xml and .bin). These files describe the network topology and contain the weights and biases binary data of the model.
If you download the model from Open model zoo,it is already IR files.So we don't need run model optimizer.
If you use your own model,you need run model optimizer,please follow Model Optimizer Developer Guide.
Step3: Converte to BLOB format
After the model optimization is completed,means you already have two files(.xml and .bin), the model needs to be converted to the Blob format before it can be deployed on OpenNCC.
You need run myriad_compile tool to packet the IR files to a BLOB file,the openncc using the blob file to inference the model.
Example:
$ ./myriad_compile -m input_xxx-fp16.xml -o output_xxx.blob -VPU_PLATFORM VPU_2480 -VPU_NUMBER_OF_SHAVES 8 -VPU_NUMBER_OF_CMX_SLICES 8
Note:
The myriad_compile is a tool of OpenVINO ToolKit,you need install the openvino on your development host. The tool under:
/opt/intel/openvino/deployment_tools/inference_engine/lib/intel64myriad_compile
Step4: Inference on OpenNCC and extract the results
Use the OpenNCC SDK to download the BLOB file,run inference and output results on OpenNCC Cameras.
The OpenNCC SDK would output two types of streams:
* Normal video stream,support YUV420,YUV422,MJPG,H.264 and H.265
* AI-Meta data stream is the binary results by inference with frame based data. The specific output structure depends on the model which running on OpenNCC.Take the person-detection-retail-0002 model as an example:
Outputs:
The net outputs "detection_output" blob with shape: [1x1xNx7], where N is the number of detected pedestrians. For each detection, the description has the format: [image_id, label, conf, x_min, y_min, x_max, y_max], where:
image_id - ID of image in batch
label - ID of predicted class
conf - Confidence for the predicted class
(x_min, y_min) - Coordinates of the top left bounding box corner
(x_max, y_max) - Coordinates of the bottom right bounding box corner.
So a AI-Meta frame you get from SDK would be this:
Demo and Running
1. Enter 'YOUR OPENNCC SDK INSTALL PATH'/Platform/Linux/Example/How_to/Load_a_model
2. Copy all the related files from sdk to this demo.
$ ./copy.sh
3. Download the BLOB model file.
In the main.cpp,
........
//5.2 Image preprocessing parameter initialization
cam_info.inputDimWidth = 300;
cam_info.inputDimHeight = 300;
cam_info.inputFormat = IMG_FORMAT_BGR_PLANAR;
cam_info.meanValue[0] = 0;
cam_info.meanValue[1] = 0;
cam_info.meanValue[2] = 0;
cam_info.stdValue = 1;
// 5.2 Blob file path
const char *blob = "./blob/2020.3/object_classification/object_classification.blob";
//6. sdk initialization,and download the model to the openncc
ret = sdk_init(NULL, NULL, (char*) blob, &cam_info, sizeof(cam_info));
printf("sdk_init %d\n", ret);
if (ret < 0)
return -1;
We need load the BLOB model file to the OpenNCC,if your OpenNCC version don't have a Flash or EMMC on board.
const char *blob = "./blob/2020.3/object_classification/object_classification.blob";
Here pass the BLOB model file path to the SDK,which you want to run on OpenNCC. You could change the blob file yourself.
4. Get the AI-meta data frame
//Non-blocking read metedata data
max_read_size = 512*1024;
memset(recv_metedata, 0, max_read_size);
if (read_meta_data(recv_metedata, &max_read_size, false) == 0)
{
memcpy(&hdr, recv_metedata, sizeof(frameSpecOut));
printf("metehdr:type:%2d,seqNo:%-6d,size %d, NCC_T:(%dMS)\n", hdr.type, hdr.seqNo,hdr.size, hdr.res[0]);
}
5. Post-process to extract the results
5.1 Skip the openncc header
memedata = (char*) recv_metedata + sizeof(frameSpecOut)+16*sizeof(int);
obj_show_img_func(yuv420p, cameraCfg.camWidth, cameraCfg.camHeight,scale, src, 1, &cam_info,memedata , min_score);
5.2 Extract the results in obj_demo_show.cpp
....
#define MAX_INTEMS_RESULTS 200
....
uint16_t* cls_ret = (uint16_t*)nnret; //nnret is the point of the ai-meta data
....
for (i = 0; i < MAX_INTEMS_RESULTS; i++)
{
//Since the VPU output FP16 format,we need convert to fp32.
int image_id = (int)(f16Tof32(cls_ret[i*7+0]));
int label = (int)(f16Tof32(cls_ret[i*7+1]));
score =(float)f16Tof32(cls_ret[i*7+2]);
if (image_id < 0) {
break;
}
x0 = f16Tof32(cls_ret[i*7+3]);
y0 = f16Tof32(cls_ret[i*7+4]);
x1 = f16Tof32(cls_ret[i*7+5]);
y1 = f16Tof32(cls_ret[i*7+6]);
}