Inference at the Edge: Running a Large Language Model Chatbot on Consumer Hardware Updated

Generative artificial intelligence and conversational chatbots like ChatGPT have made headlines in recent months. These virtual assistants sound nearly human because they are based on extremely large datasets that contain real human conversations or other sources of data that contain the desired output for a given input.  Hobbyists and open source enthusiasts are experimenting with ways to run size optimized versions of large language models that have been tuned for instruction based interaction with humans and run locally on consumer-grade hardware. Continue reading Inference at the Edge: Running a Large Language Model Chatbot on Consumer Hardware Updated