Millions of images of passports, credit cards, birth certificates, and other documents containing personally identifiable information are likely included in one of the biggest open-source AI training sets, new research has found. Thousands of images—including identifiable faces—were found in a small subset of DataComp CommonPool, a major AI training set for image generation scraped from…
Sarah Smith, founder and managing partner of the eponymous Sarah Smith Fund, announced Thursday the final closing of a $16 million Fund I. Smith launched her eponymous fund in 2022 and is a solo GP. She said she’s “stunned” by what AI can unlock for firms like hers, solo and next-generation. “I can’t imagine doing […]