Hi folks, thanks for the help!
Here's the result of running the requested commands. I had already reset my default configs using the KB article mentioned:
In[1]:= Needs["CUDALink`"]
During evaluation of In[1]:= General::cdir: Cannot set current directory to authserver. >>
(* What's up with this error? Some Googling revealed this is a common error on the OS X version of Mathematica. *)
In[2]:= CUDAResourcesInformation[]
Out[2]= {{"Name" -> "CUDAResources", "Version" -> "8.0.4.1",
"BuildNumber" -> -1, "QualifiedName" -> "CUDAResources-OSX-8.0.4.1",
"Description" -> "{ToolkitVersion -> 4.0, MinimumDriver -> 270.0}",
"MathematicaVersion" -> "8.0.1+", "SystemIDs" -> {"MacOSX-x86-64"},
"MainLink" -> "", "Root" -> ".", "Contexts" -> {},
"Extensions" -> {"Resource"}, "ID" -> "", "Creator" -> "",
"CreatorURL" -> "", "Publisher" -> "", "SupportContact" -> "",
"Thumbnail" -> "",
"Location" ->
"/Users/sean/Library/Mathematica/Paclets/Repository/CUDAResources-\
OSX-8.0.4.1", "Installed" -> True, "Enabled" -> True,
"Loading" -> None, "Internal" -> False,
"Hash" -> "e7af8c48be63c5caca6ddda9dc04f92d"}}
In[3]:= CUDAInformation[]
Out[3]= {1 -> {"Name" -> "GeForce GT 650M", "Clock Rate" -> 900000,
"Compute Capabilities" -> 3., "GPU Overlap" -> 1,
"Maximum Block Dimensions" -> {1024, 1024, 64},
"Maximum Grid Dimensions" -> {2147483647, 65535, 65535},
"Maximum Threads Per Block" -> 1024,
"Maximum Shared Memory Per Block" -> 49152,
"Total Constant Memory" -> 65536, "Warp Size" -> 32,
"Maximum Pitch" -> 2147483647,
"Maximum Registers Per Block" -> 65536, "Texture Alignment" -> 512,
"Multiprocessor Count" -> 2, "Core Count" -> 64,
"Execution Timeout" -> 1, "Integrated" -> False,
"Can Map Host Memory" -> True, "Compute Mode" -> "Default",
"Texture1D Width" -> 65536, "Texture2D Width" -> 65536,
"Texture2D Height" -> 65536, "Texture3D Width" -> 4096,
"Texture3D Height" -> 4096, "Texture3D Depth" -> 4096,
"Texture2D Array Width" -> 16384,
"Texture2D Array Height" -> 16384,
"Texture2D Array Slices" -> 2048, "Surface Alignment" -> 512,
"Concurrent Kernels" -> True, "ECC Enabled" -> False,
"TCC Enabled" -> False, "Total Memory" -> 1073414144}}
Here's the thing - everything looks good as far as versions and such, and CUDA commands do work, but it took like 10+ seconds for the call toCUDAResourcesInformation[] to return, and over a minute for the call to CUDAInformation[] to come back. Are those delays normal?