Artificial Intelligence (AI) | CoffeeWithShiva

All Posts in "Artificial Intelligence (AI)"

The Powerhouses of Modern Computing: CPUs, GPUs, NPUs, and TPUs

The rapid advancement of technology has necessitated the development of specialized processors to handle increasingly complex computational tasks. This article delves into the core components of these processing units – CPUs, GPUs, NPUs, and TPUs – and their primary use cases.

Central Processing Unit (CPU)

The CPU, often referred to as the “brain” of a computer, is a versatile processor capable of handling a wide range of tasks. It excels in sequential operations, making it suitable for general-purpose computing.

Key features: Sequential processing, efficient handling of complex instructions.
Primary use cases: Operating systems, office applications, web browsing, and general-purpose computing.

Graphics Processing Unit (GPU)

Originally designed for rendering graphics, GPUs have evolved into powerful parallel processors capable of handling numerous calculations simultaneously.

Key features: Parallel processing, massive number of cores, high computational power.
Primary use cases: Machine learning, deep learning, scientific simulations, image and video processing, cryptocurrency mining, and gaming.

Neural Processing Unit (NPU)

Designed specifically for artificial intelligence workloads, NPUs are optimized for tasks like image recognition, natural language processing, and machine learning.

Key features: Low power consumption, high efficiency for AI computations, specialized hardware accelerators.
Primary use cases: Mobile and edge AI applications, computer vision, natural language processing, and other AI-intensive tasks.

Tensor Processing Unit (TPU)

Developed by Google, TPUs are custom-designed ASICs (Application-Specific Integrated Circuits) optimized for machine learning workloads, particularly those involving tensor operations.

Key features: High performance, low power consumption, specialized for machine learning workloads.
Primary use cases: Deep learning, machine learning research, and large-scale AI applications.

Other Specialized Processors

Beyond these core processors, several other specialized processors have emerged for specific tasks:

Field-Programmable Gate Array (FPGA): Highly customizable hardware that can be reconfigured to perform various tasks. Ex: Signal processing
DPU or Data Processing Unit, is a specialized processor designed to offload data-intensive tasks from the CPU. It’s particularly useful in data centers where it handles networking, storage, and security operations. By taking over these functions, the DPU frees up the CPU to focus on more complex computational tasks. Primary use-cases include Data center infrastructure, Security & Encryption tasks
VPU or Vision Processing Unit, is specifically designed to accelerate computer vision tasks. It’s optimized for image and video processing, object detection, and other AI-related visual computations. VPUs are often found in devices like smartphones, AR/VR, surveillance cameras, and autonomous vehicles.

The Interplay of Processors

In many modern systems, these processors often work together. For instance, a laptop might use a CPU for general tasks, a GPU for graphics and some machine learning workloads, and an NPU for specific AI functions. This combination allows for optimal performance and efficiency.

The choice of processor depends on the specific application and workload. For computationally intensive tasks like machine learning and deep learning, GPUs and TPUs often provide significant performance advantages over CPUs. However, CPUs remain essential for general-purpose computing and managing system resources.

As technology continues to advance, we can expect even more specialized processors to emerge, tailored to specific computational challenges. This evolution will drive innovation and open up new possibilities in various fields.

In Summary:

CPU is a general-purpose processor for a wide range of tasks.
GPU is specialized for parallel computations, often used in graphics and machine learning.
TPU is optimized for AI/ML operations.
NPU is optimized for neural network operations.
DPU is designed for data-intensive tasks in data centers.
VPU is specialized for computer vision tasks.

by Shiva — July 23, 2024 in Artificial Intelligence (AI) 0

OpenAI’s Path to Artificial General Intelligence (AGI)

OpenAI, a leading artificial intelligence research laboratory, has outlined a five-level framework to measure progress towards achieving Artificial General Intelligence (AGI). This framework provides a structured approach to understanding the complexities and potential implications of AI development.

Level 1: Conversational AI – chatbots with conversational language

Focus: Developing AI systems capable of engaging in natural and informative conversations.
Example: ChatGPT, Google Bard
Benefits: Revolutionize customer service, education, and mental health support. Improve accessibility to information and facilitate human-computer interaction.

Level 2: Reasoners – human-level problem solving

Focus: Creating AI systems that can solve complex problems, requiring reasoning, planning, and learning.
Example: AI systems capable of drafting legal documents, conducting scientific research, or developing complex software.
Benefits: Accelerate scientific discovery, increase efficiency in various fields like medicine and engineering.

Level 3: Autonomous Agents – systems that can take actions independently

Focus: Building AI systems capable of operating independently in complex environments, making decisions, and taking actions.
Example: Self-driving cars, robots capable of performing household tasks, or AI systems managing complex infrastructure.
Benefits: Transform transportation, improve quality of life, and enhance efficiency in industries like manufacturing and logistics.

Level 4: Innovators – AI that can aid in invention

Focus: Developing AI systems capable of generating new ideas and solutions, demonstrating creativity and adaptability.
Example: AI systems designing new drugs, creating innovative products, or composing music.
Benefits: Drive economic growth, foster innovation, and potentially lead to breakthroughs in fields like art, science, and technology.

Level 5: Organizational Equivalents – AI that can do the work of an organization

Focus: Creating AI systems capable of operating as entire organizations, making strategic decisions, and adapting to changing environments.
Example: AI systems managing complex businesses, governments, or non-profit organizations.
Benefits: Revolutionize governance, economic systems, and societal structures. However, also raises significant ethical and societal challenges.

According to Bloomberg, OpenAI believes its technology is approaching the second level of five on the path to artificial general intelligence. It’s important to note that this framework is a conceptual roadmap and the exact boundaries between levels may be fluid. Additionally, achieving each level represents a significant technological leap and will likely require substantial advancements in hardware, algorithms, and data.

While the potential benefits of AGI are immense, it’s crucial to address the associated challenges and risks, such as job displacement, bias, and the potential for misuse. OpenAI and other leading AI research organizations are actively working on developing safety protocols and ethical guidelines to ensure that AGI benefits humanity as a whole.

References:

https://www.bloomberg.com/news/articles/2024-07-11/openai-sets-levels-to-track-progress-toward-superintelligent-ai?embedded-checkout=true&sref=HrWXCALa

https://www.forbes.com/sites/jodiecook/2024/07/16/openais-5-levels-of-super-ai-agi-to-outperform-human-capability

by Shiva — July 22, 2024 in GenAI 0

Figure Unveiled a Humanoid Robot in Partnership with OpenAI

A yet another milestone in the history of A.I. and Robotics!

Yes, I’m not exaggerating! What you could potentially read in a moment would be a futuristic world where humanoid robots can very well serve humanity in many ways (keeping negatives out of the picture for timebeing).

When I first heard this news, movies such as I, Robot and Enthiran, the Robot were flashing on my mind! Putting my filmy fantasies aside, the Robotics expert company Figure, in partnership with Microsoft and OpenAI, has released the first general purpose humanoid robot – Figure 01 – designed for commercial use.

Here’s the quick video released by the creators –

Figure’s Robotics expertise has been perfectly augmented by OpenAI’s multi-modal support in understanding and generating response of visual inputs such as image, audio, video. The future looks way more promising and becoming reality that these humanoids can be supplied to the manufacturing and commercial areas where there are shortage of resources for scaling the production needs.

In the video, it is seen demonstrating the ability to recognize objects such as apple and take appropriate actions. It is reported that Figure 01 humanoid robot stands at 5 feet 6 inches tall and weighs 132 pounds. It can carry up to 44 pounds and move at a speed of 1.2 meters per second.

Figure is backed by tech giants such as Microsoft, OpenAI Startup Fund, NVIDIA, Jeff Bezos (Bezos Expeditions) and more.

Lot of fascinating innovations happening around us thanks to Gen AI / LLMs, Copilot, Devin, Sora, and now a glimpse into the reality of Humanoid Robotics. Isn’t it a great time to be in?!

by Shiva — March 13, 2024 in Artificial Intelligence (AI) 0

Meet Devin, the first AI-based Software Engineer

Gen AI enables writing highly sophisticated code for the given problem statement. Developers can already take advantage of that!

What if a full-fledged tool that can write code, fix bugs, leverages online resources, collaborates with human, and solves gigs on popular freelancing sites such as Upwork?!

Is this a fiction? Well, not anymore.

Meet Devin, the first of its kind, AI-based software engineer, created by Cognition Labs, an applied AI labs company that builds apps focusing on reasoning.

The Tech World is already amazed with the capabilities of Copilot which assists in developing code snippets, however, Devin has a unique capability and is a step-up in terms of its features that it can cater to end-to-end software development.

According to the creators, Devin has the following key capabilities as of writing –

Learn how to use unfamiliar technologies.
Build and deploy apps end to end.
Autonomously find and fix bugs in codebases.
Train and fine tune its own AI models.
Address bugs and feature requests in open source repositories.
Contribute to mature production repositories.
Solve real jobs on Upwork!

Scott Wu, the founder and CEO of Cognition, explained Devin can access common developer tools, including its own shell, code editor and browser, within a sandboxed compute environment to plan and execute complex engineering tasks requiring thousands of decisions.

Devin resolved 13.86% of issues without human assistance in the tested GitHub repositories as per the publication by creators based on SWE-benchmark that asks agents to resolve challenging problems in the open-source projects such as scikit-learn, Django.

There’s sparkling conversation around the globe that AI could kill basic coding skills written by human and recently NVidia Founder talked about everyone is now a programmer thanks to AI. Of course, I think, human oversight is required to refine and meet user’s requirements.

Thanks to Devin, now the human can focus more on complex or interesting problems that requires our creativity and best use of our time. As of now, access to Devin is only limited to select individuals. Public access is still pending. For more info, visit cognition-labs.com/blog

by Shiva — March 13, 2024 in GenAI 0

Meta’s Large Language Model – LLaMa 2 released for enterprises

Meta, the parent company of Facebook, unveiled the latest version of LLaMa 2 for research and commercial purposes. It’s released as open-source unlike OpenAI GPT / Google Bard which is proprietary.

What is LLaMa?

LLaMa (Large Language Model Meta AI) is an open-source language model built by Meta’s GenAI team for research. LLaMa 2 which is newly released for research and commercial uses.

Difference between LLaMa and LLaMa 2

LLaMa 2 model was trained on 40% more data than its predecessor. Al-Dahle (vice president at Meta who is leading the company’s generative AI work) says there were two sources of training data: data that was scraped online, and a data set fine-tuned and tweaked according to feedback from human annotators to behave in a more desirable way. The company says it did not use Meta user data in LLaMA 2, and excluded data from sites it knew had lots of personal information.

Newly released LLaMa 2 models will not only further accelerate the LLM research work but also enable enterprises to build their own generative AI applications. LLaMa 2 includes 7B, 13B and 70B models, trained on more tokens than LLaMA, as well as the fine-tuned variants for instruction-following and chat.

According to Meta, its LLaMa 2 “pretrained” models are trained on 2 trillion tokens and have a context window of 4,096 tokens (fragments of words). The context window determines the length of the content the model can process at once. Meta also says that the LLaMa 2 fine-tuned models, developed for chat applications similar to ChatGPT, have been trained on “over 1 million human annotations.”

Databricks highlights the salient features of such open-source LLMs:

No vendor lock-in or forced deprecation schedule
Ability to fine-tune with enterprise data, while retaining full access to the trained model
Model behavior does not change over time
Ability to serve a private model instance inside of trusted infrastructure
Tight control over correctness, bias, and performance of generative AI applications

Microsoft says that LLaMa 2 is the latest addition to their growing Azure AI model catalog. The model catalog, currently in public preview, serves as a hub of foundation models and empowers developers and machine learning (ML) professionals to easily discover, evaluate, customize and deploy pre-built large AI models at scale.

OpenAI GPT vs LLaMa

A powerful open-source model like LLaMA 2 poses a considerable threat to OpenAI, says Percy Liang, director of Stanford’s Center for Research on Foundation Models. Liang was part of the team of researchers who developed Alpaca, an open-source competitor to GPT-3, an earlier version of OpenAI’s language model.

“LLaMA 2 isn’t GPT-4,” says Liang. Compared to closed-source models such as GPT-4 and PaLM-2, Meta itself speaks of “a large gap in performance”. However, ChatGPT’s GPT-3.5 level should be reached by Llama-2 in most cases. And, Liang says, for many use cases, you don’t need GPT-4.

A more customizable and transparent model, such as LLaMA 2, might help companies create products and services faster than a big, sophisticated proprietary model, he says.

“To have LLaMA 2 become the leading open-source alternative to OpenAI would be a huge win for Meta,” says Steve Weber, a professor at the University of California, Berkeley.

LLaMA 2 also has the same problems that plague all large language models: a propensity to produce falsehoods and offensive language. The fact that LLaMA 2 is an open-source model will also allow external researchers and developers to probe it for security flaws, which will make it safer than proprietary models, Al-Dahle says.

With that said, Meta has set to make its presence felt in the open-source AI space as it has announced the release of the commercial version of its AI model LLaMa. The model will be available for fine-tuning on AWS, Azure and Hugging Face’s AI model hosting platform in pretrained form. And it’ll be easier to run, Meta says — optimized for Windows thanks to an expanded partnership with Microsoft as well as smartphones and PCs packing Qualcomm’s Snapdragon system-on-chip. The key advantage of on-device AI is cost reduction (cloud per-query costs) and data security (as data solely remain on-device)

LLaMa can turn out to be a great alternative for pricy proprietary models sold by OpenAI like ChatGPT and Google Bard.

References:

https://ai.meta.com/llama/?utm_pageloadtype=inline_link

https://www.technologyreview.com/2023/07/18/1076479/metas-latest-ai-model-is-free-for-all/

https://blogs.microsoft.com/blog/2023/07/18/microsoft-and-meta-expand-their-ai-partnership-with-llama-2-on-azure-and-windows/

https://www.qualcomm.com/news/releases/2023/07/qualcomm-works-with-meta-to-enable-on-device-ai-applications-usi

https://techcrunch.com/2023/07/18/meta-releases-llama-2-a-more-helpful-set-of-text-generating-models/

https://www.databricks.com/blog/building-your-generative-ai-apps-metas-llama-2-and-databricks

by Shiva — July 19, 2023 in Artificial Intelligence (AI) 0

Difference between traditional AI and Generative AI

Generative AI is the new buzzword since late 2022. The likes of ChatGPT, Bard, etc. is taking the AI to the all new levels with wide variety of use-cases for consumers and enterprises.

I wanted to briefly understand the difference between traditional AI and generative AI. According to a recent report published in Deloitte, GenAI’s output is of a higher complexity while compared with traditional AI.

Typical AI models would generate output in the form of a value (Ex: predicting sales for next quarter), label (Ex: classifying a transaction as legitimate or fraud). GenAI models tend to generate a full page of composed text or other digital artifact. Applications like Midjourney, DALL-E produces images, for instance.

In the case of GenAI, there is no one possible correct answer. Deloitte study reports, this results in a large degree of freedom and variability, which can be interpreted as creativity.

The underlying GenAI models are usually large in terms of resources consumption, requiring TBs of high-quality data processed on large-scale, GPU-enabled, high-performance computing clusters. With OpenAI’s innovation being plugged into Microsoft Azure Services and Office suites, it would be interesting to see the dramatic changes in consumers’ productivity!

by Shiva — May 28, 2023 in Artificial Intelligence (AI) 0

AI & ML Funny Memes

This is a fun post compiling the AI Memes that I have come across.

P.S: These are random memes sourced from the internet.

by Shiva — March 26, 2023 in Artificial Intelligence (AI) 0

Top Use Cases of AI in Business

It appears as if the movie – Terminator – was released quite recently and many of us have talked about if machines could help us in our daily chore activities and supporting business operations.

Fast forward! We’re already realizing few changes around us where artificial intelligence enabled systems help us in many ways and the potential of it looks bright few years down the road.

I was researching about few top use cases of AI in business a couple of weeks. I thought to share it here and I’m sure you’re going to be excited to read & share.

Top Use Cases of AI in Business

1. Computer Vision – Smart Cars (Autonomous Cars): IBM survey results say 74% expected that we would see smart cars on the road by 2025. It might adjust the internal settings — temperature, audio, seat position, etc. — automatically based on the driver, report and even fix problems itself, drive itself, and offer real time advice about traffic and road conditions.

2. Robotics: In 2010, Japan’s SoftBank telecom operations partnered with French robotic manufacturer Aldebaran to develop Pepper, a humanoid robot that can interact with customers and “perceive human emotions.” Pepper is already popular in Japan, where it’s used as a customer service greeter and representative in 140 SoftBank mobile stores.

3. Amazon Drones: In July 2016, Amazon announced its partnership with the UK government in making small parcel delivery via drones a reality. The company is working with aviation agencies around the world to figure out how to implement its technology within the regulations set forth by said agencies. Amazon’s “Prime Air” is described as a future delivery system for safely transporting and delivering up to 5-pound packages in less than 30 minutes.

4. Augmented Reality: Ex: Google Glass: It can show the location of items you are shopping for, with information such as cost, nutrition, or if another store has it for less money. Being AI it will understand that you’re likely to ask for the weather at a certain time, or want reminders about meetings so it will simply “pop up” unobtrusively.

5. Marketing Personalization: Companies can personalize which emails a customer receives, which direct mailings or coupons, which offers they see, which products show up as “recommended” and so on, all designed to lead the consumer more reliably towards a sale. You’re probably familiar with this use if you use services like Amazon or Netflix. Intelligent machine learning algorithms analyze your activity and compare it to the millions of other users to determine what you might like to buy or binge watch next.

6. Chatbots: Customers want the convenience of not having to wait for a human agent to handle their call, or wait for a few hours to be replied to an email/twitter query. Chatbots are instant, 24×7 available & backed by robust AI offering Contextually relevant personalized conversation.

7. Fraud Detection: Machine learning is getting better and better at spotting potential cases of fraud across many different fields. PayPal, for example, is using machine learning to fight money laundering.

8. Personal Security: Airports – AI can spot things human screeners might miss in security screenings at airports, stadiums, concerts, and other venues. That can speed up the process significantly and ensure safer events.

9. Healthcare: Machine learning algorithms can process more information and spot more patterns than their human counterparts. One study used computer assisted diagnosis (CAD) when to review the early mammography scans of women who later developed breast cancer, and the computer spotted 52% of the cancers as much as a year before the women were officially diagnosed.

by Shiva — August 28, 2018 in Artificial Intelligence (AI) 0