Wouldn't direct use of discrete convolution instead of a ParallelTable of NIntegrate commands be much faster, and also be more useful for hardware and (fast) software implementations?
ParallelTable
NIntegrate
-- you have earned Featured Contributor Badge Your exceptional post has been selected for our editorial columns Staff Picks http://wolfr.am/StaffPicks and Publication Materials https://wolfr.am/PubMat and Your Profile is now distinguished by a Featured Contributor Badge and is displayed on the Featured Contributor Board. Thank you!