FindFaces uses Haar cascades through an external library, OpenCV. I suggest you explore the OpenCV docs to accomplish this.
If you really want to use Mathematica you could try something where you train a classifier based on image segments, but I wouldn't expect very good performance.