Generative AI, guided by inventive heuristics, can systematically and rapidly generate hundreds of ideas for engineering inventive design problems. This paper examines the reliability and effectiveness of AI-powered “idea funnelling,” a process that generates, evaluates, filters, and synthesizes raw ideas into feasible solution concepts. Key challenges include the consistency and objectivity of AI-driven evaluations, the robustness of concept generation, and the collaboration of multiple AI chatbots such as ChatGPT and Gemini. The study explores the integration of human expertise in hybrid problem-solving teams to improve feasibility, contextual relevance, and innovation quality. Through comparative experiments, it provides insights to improve the reliability of AI-driven concept creation and the performance of hybrid AI-human teams in solving complex engineering design problems.