Thanks for the kind words🙂 Regarding the object detection, you could also use the DepthAI FCC module, to which you can attach cameras. It was designed specifically for such use cases. Note here that you will have to calibrate the mono cameras for the depth (some info here), so a bit more work. And you should also check the
Vision System for visually impaired from the OpenCV competition for some ideas. We do have an OCR demo on the
depthai-experiments repo, and as you said, it could be modified for your use-case. I'm sure there are some pre-trained models for Chinese OCR.
For the epilepsy detection, OAK-1 should do the job, since you don't need the spatial information of the patient. Note here that there is no "mechanical zoom " on the camera, so to get a high enough resolution of the face I would first do a facial detection on preview image (for example 300x300 and feed the preview to the pretrained
face-detection-retail-0004 model, similar demo here) and then based on the face detection result take the 4k video and crop that to get a high-resolution image of the detected face - to feed it into your epilepsy detection model. And as per the OAK-1 docs, the FoV is
81° DFoV - 68.8° HFoV.
Power consumption mostly depends on the workload of the Myriad X (the VPU), but from our testing OAK-1 uses about
~0.6A @ 5V.