LibGuides: Artificial Intelligence at Ringling: Artificial Intelligence Homepage

At Ringling College of Art and Design, we acknowledge the disruptive impact of artificial intelligence (AI) on the creative industries. We understand that AI has caused uncertainty, frustration, and even anger among many artists and designers who fear for their livelihoods and the future of their crafts. As we confront the challenges posed by AI, we remain committed to providing our students with the skills, knowledge, and ethical foundation necessary to navigate this rapidly changing landscape.

See more about Ringling's stance on AI.

Text Generation AI

ChatGPT

About: ChatGPT is a chatbot and virtual assistant developed by OpenAI and launched on November 30, 2022. Based on large language models (LLMs), it enables users to refine and steer a conversation towards a desired length, format, style, level of detail, and language.

ChatGPT is built on OpenAI's proprietary series of generative pre-trained transformer (GPT) models and is fine-tuned for conversational applications using a combination of supervised learning and reinforcement learning from human feedback.

ChatGPT is credited with starting the AI boom, which has led to ongoing rapid investment in and public attention to the field of artificial intelligence (AI). By January 2023, it had become what was then the fastest-growing consumer software application in history, gaining over 100 million users and contributing to the growth of OpenAI's current valuation of $86 billion.

Who made it? OpenAI was co-founded by Ilya Sutskever, Greg Brockman, John Schulman, and Wojciech Zaremba, with Sam Altman later joining as the CEO. [GeeksForGeeks]

Cost: ChatGPT was released as a freely available research preview, but OpenAI now operates the service on a "freemium" model due to its popularity. Users on its free tier can access GPT-4o and GPT-3.5. The ChatGPT subscriptions "Plus", "Team" and "Enterprise" provide additional features such as DALL-E 3 image generation and increased GPT-4o usage limit.

[Wikipedia]

Claude

About: Claude is a family of large language models developed by Anthropic.The first model was released in March 2023. Claude 3, released in March 2024, can also analyze images. The Claude 3 family includes three state-of-the-art models in ascending order of capability: Haiku, Sonnet, and Opus.

Who made it? Anthropic, an AI startup founded by former OpenAI members Daniela and Dario Amodei created Claude AI.

"Alignment is top of mind for the brother-and-sister duo. In industry lingo, the term means ensuring AI systems are “aligned” with human values. Dario, 40, and Daniela, 36—CEO and president of Anthropic, respectively—believe they are taking a safer and more responsible approach to AI alignment than other companies building cutting-edge AI systems." [Time]

Cost: Limited-use access using Claude 3.5 Sonnet is free of charge, but requires both an e-mail address and a cellphone number. A paid plan is also offered for more usage and access to all Claude 3 models. On May 1, 2024, Anthropic announced the Claude Team plan, its first enterprise offering for Claude, and a Claude iOS app.

[Wikipedia]

Gemini

About: Gemini, formerly known as Bard, is a generative artificial intelligence chatbot developed by Google. Based on the large language model (LLM) of the same name and developed as a direct response to the meteoric rise of OpenAI's ChatGPT, it was launched in a limited capacity in March 2023 before expanding to other countries in May.

Cost: Gemini has a limited free option, or users can opt for Gemini Advanced, which includes integration with Google Workspace and cloud storage with Google One.

As of August 2024, there is campus access to Google Gemini.

[Wikipedia]

MetaAI

About: Meta AI is an American company owned by Meta (formerly Facebook) that develops artificial intelligence and augmented and artificial reality technologies. Meta AI deems itself an academic research laboratory, focused on generating knowledge for the AI community, and should not be confused with Meta's Applied Machine Learning (AML) team, which focuses on the practical applications of its products.

Cost: As of July 2024, MetaAI is free to use without any paid subscription.

Concerns: Since May 2024, the Meta AI chatbot has summarized news from various outlets without linking directly to original articles, including in Canada, where news links are banned on its platforms. This use of news content without compensation has raised ethical and legal concerns, especially as Meta continues to reduce news visibility on its platforms

Copilot

About: Microsoft Copilot is a generative artificial intelligence chatbot developed by Microsoft. Based on a large language model, it was launched in February 2023 as Microsoft's primary replacement for the discontinued Cortana. The service was originally introduced under the name Bing Chat. Copilot utilizes the Microsoft Prometheus model, built upon OpenAI's GPT-4 foundational large language model, which in turn has been fine-tuned using both supervised and reinforcement learning techniques.

in December 2023, Copilot was added without payment to many Windows 11 installations. Later that month, a standalone Microsoft Copilot app was released.

Microsoft Copilot in Windows supports the use of voice commands. By default, it is accessible via the Windows taskbar. Copilot in Windows can provide information on a website currently being browsed by a user in Microsoft Edge. Copilot can also be used to rewrite and generate text based on user prompts in Microsoft 365 services including Microsoft Word, Excel, PowerPoint, Outlook, and Teams.

Cost: Microsoft operates Copilot on a freemium model. Users on its free tier can access most features, while priority access to newer features, including custom chatbot creation, is provided to paid subscribers under the "Microsoft Copilot Pro" paid subscription service. Several default chatbots are available in the free version of Microsoft Copilot, including the standard Copilot chatbot as well as Microsoft Designer, which is oriented towards using its Image Creator to generate images based on text prompts.

[Wikipedia]

Mistral

About: Mistral AI is a French company specializing in artificial intelligence (AI) products. The company focuses on producing open-source large language models, emphasizing the foundational importance of free and open-source software, and positioning itself as an alternative to proprietary models.

Who made it? Founded in April 2023 by Arthur Mensch, Guillaume Lample and Timothée Lacroix, the company has quickly risen to prominence in the AI sector. Before co-founding Mistral AI, Arthur Mensch worked at Google DeepMind which is Google's artificial intelligence laboratory, while Guillaume Lample and Timothée Lacroix worked at Meta Platforms.

Cost: Mistral has a variety of "pay-as-you-go" pricing options.

[Wikipedia]

Image Generation AI

Firefly

About: Adobe Firefly is a generative machine learning model included as part of Adobe Creative Cloud. It is currently available to all Adobe Creative Cloud subscribers.

Adobe Firefly is developed using Adobe's Sensei platform. Firefly is trained with images from Creative Commons, Wikimedia and Flickr Commons as well as 300 million images and videos in Adobe Stock and the public domain. It uses image data sets to generate various designs.It learns from user feedback by adjusting its designs.

Cost: Generative credits that allow the use of generative AI features powered by Firefly are given to Adobe Creative Cloud users. The consumption of generative credits depends on the generated output's computational cost and the value of the generative AI feature used. Learn about when the credits renew.

[Wikipedia]

4o Image Generation in ChatGPT

About: See this page for information about 4o Image Generation, launched 3/25/25. ChatGPT is a chatbot and virtual assistant developed by OpenAI and launched on November 30, 2022. Based on large language models (LLMs), it enables users to refine and steer a conversation towards a desired length, format, style, level of detail, and language.

Who made it? OpenAI was co-founded by Ilya Sutskever, Greg Brockman, John Schulman, and Wojciech Zaremba, with Sam Altman later joining as the CEO. [GeeksForGeeks]

[Wikipedia]

Midjourney

About: Midjourney is a generative artificial intelligence program and service created and hosted by the San Francisco–based independent research lab Midjourney, Inc. Midjourney generates images from natural language descriptions, called prompts, similar to OpenAI's DALL-E and Stability AI's Stable Diffusion. It is one of the technologies of the AI boom.

Who made it? The Midjourney team is led by David Holz, who co-founded Leap Motion.

Cost: Midjourney has four subscription tiers. Pay month-to-month or for the entire year for a 20% discount. Each subscription plan includes access to the Midjourney member gallery, the official Discord, general commercial usage terms, and more.

[Wikipedia]

Stable Diffusion

About: Stable Diffusion is a deep learning, text-to-image model released in 2022 based on diffusion techniques. The generative artificial intelligence technology is the premier product of Stability AI and is considered part of the ongoing artificial intelligence boom.

It is primarily used to generate detailed images conditioned on text descriptions. However, it can also be applied to other tasks such as inpainting, outpainting, and generating image-to-image translations guided by a text prompt.

Stable Diffusion is a latent diffusion model, a type of deep generative artificial neural network. Its code and model weights have been released publicly, and it can run on most consumer hardware equipped with a modest GPU with at least 4 GB VRAM. This marked a departure from previous proprietary text-to-image models such as DALL-E and Midjourney, which were accessible only via cloud services.

Who made it? Its development involved researchers from the CompVis Group at Ludwig Maximilian University of Munich and Runway with a computational donation from Stability and training data from non-profit organizations.

Cost: Monthly plans start at $27/month, access to 100+ AI APIs and models.

[Wikipedia]

Leonardo.AI

About: Leonardo.Ai is an AI-powered platform specializing in image generation and manipulation. They combine generative AI technology with unparalleled creator control, reinforcing human creativity instead of replacing it.

The company platform emphasizes granular control in every step of the content generation process. Leveraging a slew of cutting-edge backend features, Leonardo excels in model fine-tuning, prompt adherence, training speed, inference pace, and multi-image prompting capabilities. They address common obstacles such as image degradation and have introduced custom upscaling, all with commitment to continuous improvement and expansion.

Who made it? Sydney, Australia-based CEO and co-founder J.J. Fiasson became interested in generative AI when Google Deep Dream launched, and continued to explore generative AI at his last startup, gaming studio Raini Studios. As of July 2024, Canva has acquired Leonardo.ai, as the company looks to broaden the scope of its AI tech stack. The financial terms of the deal weren’t disclosed, but Canva co-founder and chief product officer Cameron Adams said it’s a mix of cash and stock. All of Leonardo.ai’s 120 employees will be joining Canva, including the executive team.

Cost: Leonardo offers a free tier that includes a daily quota of tokens. These can be used for your creative projects within our platform. This free offering doesn’t come with an expiry date, so you can use it for as long as you like. Our paid subscriptions come with extra benefits: an increased token allowance, faster image generation, and access to premium features.

[Leonardo FAQ]

Google Imagen in Gemini

About: Imagen, a text-to-image diffusion model with an unprecedented degree of photorealism and a deep level of language understanding, Imagen is an AI system that creates photorealistic images from input text. Imagen builds on the power of large transformer language models in understanding text and hinges on the strength of diffusion models in high-fidelity image generation.

With Imagen, you can do the following:

Generate novel images using only a text prompt (text-to-image AI generation).
Edit an entire uploaded or generated image with a text prompt.
Edit only parts of an uploaded or generated image using a mask area you define.
Upscale existing, generated, or edited images.
Fine-tune a model with a specific subject (for example, a specific handbag or shoe) for image generation.
Get text descriptions of images with visual captioning.
Get answers to a question about an image with Visual Question Answering (VQA).

Cost: A free trial is available, and cost is based on image input, character input, or custom training. You can estimate costs via Google Cloud's pricing calculator.

[Google]

MetaAI

About: You can generate custom images using Meta AI. When you begin typing a prompt, you'll see an example image. When you submit the prompt, Meta AI will generate four images based on it including the example image. You can download the generated images, regenerate images based on the same prompt, or submit a new prompt. As of July 2024, the feature is currently in beta.

Note: To generate images on www.meta.ai you must log in with your Facebook or Instagram account. If you don't log in, you can still chat with Meta AI.

To generate images with Meta AI:

Go to www.meta.ai in your browser.
Click Ask Meta AI anything… at the bottom of the page.
Type in what you want to generate, starting with “Imagine.” Be detailed in your prompt. For example, you can type “Imagine a tiger wearing a vest drinking tea at a cafe.” You’ll see an example image generated while you type your prompt. If you aren’t logged in with your Facebook or Instagram account, you’ll be asked to log in.

Cost: free with Meta account (Facebook or Instagram).

[Meta]

Vizcom

About: "Vizcom builds a tool for industrial designers to create photorealistic renderings from sketches and collaborate securely with their teams. For those who aren’t familiar with industrial design, take a look at all of the physical objects around you – chairs, cars, shoes, and so on. Each one was crafted by an industrial designer.

Vizcom is built to complement the existing 2D sketch workflows that concept artists already have. Users connect their tablet to Vizcom’s infinite canvas to start a sketch and then can iterate by using AI to create renders, touch up via their pencils, and then render again. This elevates the artists and lets them productively co-work with AI." -- Index Ventures

Who made it? Jordan Taylor (ex-industrial designer at Honda and Nvidia) and Kaelan Richards.

Cost: Free trial, free Pro Account for students and educators (sign up with your school email) for 1 year.

Krea

About: "3D modeling has always been a complex process, often requiring advanced software and technical expertise. But what if you could turn any 2D image into a 3D object with just a few clicks? That’s exactly what Krea AI offers — an AI-powered tool that simplifies 3D object creation in real time." Medium

Who made it? Co-founded by Victor Pérez and Diego Rodriguez.

Cost: Free daily generations, with options for paid subscription Basic, Pro, and Max tiers.

Draw Things

About: "Draw Things is a AI-assisted image generation tool to help you create images you have in mind in minutes rather than days. Master the cast spells and your Mac will draw what you want with a few simple words. Draw Things runs locally on your phone to protect your privacy" (Apple).

Who made it? San Francisco-based developer named Liu Liu (Ars Technica).

Cost: Free

Integrates well with Source.Plus.

Invoke

About: Create images with the precision and control that professionals require. Invoke began as an open-source project, and they continue to give our Community Edition software for free to creatives (or anyone aspiring to be one!) who want to self-host and install with their own compatible hardware.

Who made it? Kent Keirsey is a creative technologist who has served as a Product and Business leader in startups across B2B, B2C, and Enterprise SaaS. He is the founder and CEO of Invoke, an open-source Enterprise platform built to empower creatives to co-create with custom/fine-tuned AI products.

Cost: Free for the Community Edition, with three professional paid tiered options.

Video Generation AI

Genmo

About: Genmo's most recent product is Replay, a new, high-quality AI video generator available to the public for free. Replay is available at https://genmo.ai with an easy to use and fast interface.

Who made it? Genmo was started by two ex-Googlers and academics, including one of the co-authors of the DDPM paper.

Key Features of Genmo.AI:

Generation of high-quality images, videos, and 3D models.
Customizable prompts that allow for creative flexibility.
Tools such as Genmo Replay and AnimateLegacy (alpha) for animation creation.
Community engagement through features like Genmo Chat and Discord integration.
Downloadable content for sharing and personal use.

Cost: Genmo is 100% free to use under the CC BY-NC 4.0 terms. Your generations will come with a watermark, which can be removed by upgrading to Turbo. Genmo Turbo costs $10 per month.

[Genmo]

Pika

About: Pika 1.0 allows users to generate and edit videos in diverse styles such as 3D animation, anime or cinematic – from simple text prompts.

Who made it? In April 2023, Demi Guo and Chenlin Meng dropped out of Stanford to launch Pika to build an easier-to-use AI video generator.

Cost: Pika AI offers four main pricing tiers: Free Basic, Standard, Unlimited, and Pro, each designed to cater to different user needs and preferences. The Free Basic plan includes 250 initial credits with daily refills to 30 credits, the ability to download videos, and Lip Sync audio generations at a cost of 2 credits each. The Standard Plan is priced at $8 per month, and offers 700 monthly credits, free Lip Sync audio generations, video downloads, upscale resolution, extended video lengths, and the option to purchase more credits at a discount, without watermarks. The Unlimited Plan costs $28 per month. The Pro Plan, priced at $58 per month (yearly billed as $696), provides unlimited Lightning generations, infinite credits, early feature access, commercial terms, and more, tailored for professional use.

[PikaArt AI]

Kaiber

About: Kaiber is an AI creative lab made up of Kaiber Studio and the Kaiber App.

The Kaiber App is a creative platform that uses artificial intelligence (AI) to generate videos and images based on user inputs. You can provide text inputs, which are known as "Prompts," to generate unique assets from scratch. Or you can upload your own images, music, videos, and other content to incorporate into your creations. As part of Kaiber's video generation process, they present "Preview Frames" to give you a sense of what your video will look like.

Who made it? Eric Gao, aka oksami, is Kaiber's co-founder and CTO. His journey from music producer and computer engineer to tech entrepreneur has shaped Kaiber into what it is today.

Cost: Kaiber's basic tier starts at $5/month. They have a 7 day free trial period, and a Pro ($10/month) and Artist($25/month) tier for access to additional features and credits.

[Kaiber]

Invideo

About: Invideo's intuitive, AI-powered platform lets you transform any text prompt into a complete, publish-ready video. It effortlessly handles script generation, visual matching, subtitles, voiceovers, and background music, allowing you to create professional-quality videos with zero prior video creation skills.

Beyond simple translation, you can prompt and create content directly in your preferred language or translate your projects into over 50 languages using simple text commands. Additionally, personalize your videos further with AI voice cloning to make your videos sound exactly like you, saving hours of recording time.

The San Francisco-headquartered company bills itself as the easiest way for anyone to create professional-quality videos, using a drag-and-drop interface along with a library of templates and stock photos and videos. The resulting videos can then be optimized for Facebook, Instagram, YouTube and other platforms. [TechCrunch]

Who made it? Sanket Shah is the founder and CEO of InVideo.io. In 2022, Anshul Khandelwal joined InVideo as co-founder and CTO.

Cost: Free trial available (10 mins/wk of AI generation), $20/month for Plu(50 mins/mo of AI generation with 80/mo iStock, 100 GB storage, Unlimited exports, and 2 voice clones), and $48 for Max (200 mins/mo of AI generation, 320/mo iStock, 400 GB storage, Unlimited exports, and 5 voice clones).

[ElevenLabs]

Runway

About: Runway's Gen-3 Alpha is the first of the next generation of foundation models trained by Runway on a new infrastructure built for large-scale multimodal training. It is a major improvement in fidelity, consistency, and motion over Gen-2, and a step towards building General World Models. Despite facing controversy — as many AI model providers have lately — over its data scraping and training practices, the New York City-based startup Runway is plowing ahead with new marquee features for its realistic generative AI video platform.

Trained jointly on videos and images, Gen-3 Alpha will power Runway's Text to Video, Image to Video, and Text to Image tools, existing control modes such as Motion Brush, Advanced Camera Controls, Director Mode as well as upcoming tools for more fine-grained control over structure, style, and motion.

Who made it? Runway was founded in 2018 by Cris Valenzuela, Alejandro Matamala and Anastasis Germanidis. Valenzuela met Matamala and Germanidis while in art school at NYU, where the trio came to realize that they shared a curiosity in AI’s creative potential. From there, Valenzuela, Matamala and Germanidis started building a suite of AI-powered tools geared toward moviemakers, cinematographers and photographers. [TechCrunch]

Safety measures in place: The model automatically attempts to detect and block the creation of video from explicit still imagery or well-known figures such as politicians.

Runway is among several AI companies currently being sued in class action lawsuits by creators who allege that the general practice of AI companies scraping and training on publicly posted material — including copyrighted material — without express permission, authorization, compensation, or consent is a violation of copyright law.

Cost: There is a free "basic" option that comes with 125 one-time credits, and other tiers include Standard ($12/month), Pro ($28/month), and Unlimited ($76/month). A 10-second generated video requires 40 credits through Runway’s pay-to-play and subscription tiers, while a 5-second generation costs 20 credits.

[VentureBeat]

Kling

About: Like OpenAI’s Sora model, Kling is able to generate videos “up to two minutes long with a frame rate of 30fps and video resolution up to 1080p,” the company says on its website. But unlike Sora, which still remains inaccessible to the public four months after OpenAI trialed it, Kling soon started letting people try the model themselves (MIT Technology Review). It can interpret prompts to generate videos that mimic the physical world and create imaginative scenes from text instructions. The Kling AI model “employs a unique 3D Variational Autoencoder (VAE) for face and body reconstruction, enabling detailed expression and limb movement from a single full-body image. This technology is further enhanced by a 3D spatiotemporal joint attention mechanism, which allows the model to handle complex scenes and movements, ensuring that generated content adheres to the laws of physics” (VentureBeat).

Who made it? The South China Morning Post (SCMP) newspaper and website reported that the new AI video model was developed by Kuaishou Technology, the maker of Kuaishou, the number two most popular short video creation and viewing app in China (branded Kwai outside of the country), with 400 million daily active users (DAUs). (That puts Kuaishou/Kwai just behind Douyin, the Chinese version of TikTok from ByteDance, which counts 600 million DAUs) (VentureBeat).

Cost: Basic plan free, with paid tiered subscription levels for Standard, Pro, and Premier.

Sora

About: Sora is an AI model that creates videos from text prompts.

Who made it? It was developed by OpenAI and released in December 2024.

Cost: Sora is available to ChatGPT Plus and ChatGPT Pro subscribers.

Luma Dream Machine

About: The launch of Dream Machine represents a major milestone in the democratization of AI-powered video generation. While rival systems like OpenAI’s Sora and Kuaishou’s Kling have showcased impressive capabilities, they remain accessible only to a select group of partners. In contrast, Luma AI has made Dream Machine available for anyone to experiment with for free on its website, with plans to release APIs and plugins for popular creative software.

This open approach could give Luma AI a head start in building a vibrant community of creators and developers around its platform. By lowering the barriers to entry, Dream Machine has the potential to spark a wave of innovation and creativity as users explore the possibilities of AI-generated video (VentureBeat).

Who made it? Dream Machine was created by San Francisco artificial intelligence startup Luma AI.

Cost: Free options (with watermarks) and paid tiered subscription options.

Audio Generation AI

ElevenLabs

About: ElevenLabs is an AI audio research and deployment company with the mission is to make content universally accessible in any language and any voice. Their research team develops AI audio models that generate realistic, versatile and contextually-aware speech, voices, and sound effects across 32 languages.

The technology is used to voice audiobooks and news articles, animate video game characters, help in film pre-production, localize media in entertainment, create dynamic audio content for social media and advertising, and train medical professionals. According to the company, ElevenLabs has also given back voices to those who have lost them and helped individuals with accessibility needs in their daily lives. "We develop our tools mindful of their impact. AI voices offer a preview into the future of digital interaction and making them safe is our priority. Our goal is to ensure that our products are developed, deployed, and used safely while continuing to drive positive and creative applications."

Cost: You can use ElevenLabs for free and have access to 10k Characters/mo (~10 mins audio). With the free version, you can generate speech in 32 languages using thousands of unique voices, translate content with automatic dubbing, create custom, synthetic voices, generate sound effects, and have API access. You also have the option to subscribe at various levels with increasing services provided at each level.

[ElevenLabs]

Udio

About: Udio is a generative artificial intelligence model that produces music based on simple text prompts. It can generate vocals and instrumentation. Its free beta version was released publicly on April 10, 2024. Users can pay to subscribe monthly or annually to unlock more capabilities such as audio inpainting.

"Udio works like an AI art generator. You can either specify a prompt and let Udio do all the heavy lifting, from music to lyrics, or get as detailed as you want. Did I mention that there are lyrics? And vocals? Yes, Udio will even sing the lyrics for you, in voices that sound incredibly realistic and even emotional. Each “song” is generated as a 30-second clip, but you have the option of extending it with extra time, and even intros and outros. The maximum length of the song is short, though: about 90 seconds, Udio says. [...]

A song can be “remixed,” in that you can try an entirely new approach or keep the same lyrics and let Udio’s AI tweak it for you. But there’s an odd quirk, too: at a certain point a song may be too long to “remix” but not too long to extend. It just means that you’ll end up with multiple extended songs as you try and figure out what works.

Unlike some AI art generators, Udio seems at least vaguely aware of copyright; you can’t ask it to sing Frank Sinatra’s “My Way” (or any song, actually) in the vocals of Taylor Swift. But there doesn’t seem to be any prohibition against taking the “My Way” lyrics and writing a drum-and-bass song around them, if you want." [PC World]

Who made it? Founded in December 2023 by a team of former researchers for Google DeepMind headed by Udio's CEO, David Ding, the program received financial backing from the venture capital firm Andreessen Horowitz and musicians will.i.am and Common, among others. Critics praised its ability to create realistic-sounding vocals while others raised concerns over the possibility that its training data contained copyrighted music.

Cost: Free, Standard, or Pro versions available for subscription.

[Wikipedia]

Suno

About: Suno AI, or simply Suno, is a generative artificial intelligence music creation program designed to generate realistic songs that combine vocals and instrumentation, or are purely instrumental. Suno has been widely available since December 20, 2023, after the launch of a web application and a partnership with Microsoft, which included Suno as a plugin in Microsoft Copilot. The program operates by producing songs based on text prompts provided by users. Suno does not disclose the dataset used to train its artificial intelligence but claims it has been safeguarded against plagiarism and copyright concerns.

Who made it? Suno was created by a team of musicians and artificial intelligence experts based in Cambridge, MA (Michael Shulman, Georg Kucsko, Martin Camacho, and Keenan Freyberg). They are proud alumni of pioneering tech companies like Meta, TikTok and Kensho, where their founding team worked together before starting Suno. [Suno]

As of July 2024, Suno released an iOS version of its app. In doing so, Suno arguably made it easier than ever for regular folks to whip up music on the fly.

In June 2024, a lawsuit, led by the Recording Industry Association of America, was filed against Suno and Udio alleging widespread infringement of copyrighted sound recordings. They argue that the company’s tool can only generate tunes because it chewed on untold numbers of their copyrighted songs to learn how. (Suno, for its part, has said its technology is “transformative.”) The lawsuit sought to bar the companies from training on copyrighted music, as well as damages of up to $150,000 per work from infringements that have already taken place.

[Washington Post, Wikipedia]

Research Assistance AI

Perplexity

About: Perplexity AI is an AI chatbot-powered research and conversational search engine that answers queries using natural language predictive text. Launched in 2022, Perplexity generates answers using sources from the web and cites links within the text response.

Who made it? Perplexity was founded in 2022 by Aravind Srinivas, Denis Yarats, Johnny Ho and Andy Konwinski, engineers with backgrounds in back-end systems, AI and machine learning. Yarats, the CTO, was an AI research scientist at Meta, while Srinivas, the CEO, worked at OpenAI as an AI researcher. Ho, the Chief Strategy Officer, worked as an engineer at Quora, then as a quantitative trader on Wall Street, and Konwinski was among the founding team at Databricks.

Cost: Perplexity works on a "freemium" model; the free product uses the company's standalone large language model (LLM) that incorporates natural language processing (NLP) capabilities, while the paid version Perplexity Pro has access to GPT-4, Claude 3.5, Mistral Large, Llama 3 and an Experimental Perplexity Model.

Concerns: In June 2024, Forbes publicly criticized Perplexity for their use of Forbes' content. According to Forbes, Perplexity published a story which was largely copied from a proprietary Forbes article, without mentioning or prominently citing Forbes. In response, Srinivas said that the feature had some "rough edges" and accepted feedback, but maintained that Perplexity only "aggregates" rather than plagiarizes information.

Separate investigations by the magazine Wired and web developer Robb Knight found that Perplexity does not respect the robots.txt standard, which allows websites to stop web crawlers from scraping content, reportedly despite Perplexity claiming the opposite. Perplexity also lists the IP address ranges and user agent strings of their web crawlers publicly, but according to Wired and Robb Knight, they use undisclosed IP addresses and spoofed user agent strings when ignoring robots.txt.

[Wikipedia]

Elicit

About: Elicit uses language models to extract data from and summarize research papers. It can "automate time-consuming research tasks like summarizing papers, extracting data, and synthesizing your findings."

"Andreas Stuhlmüller (founder) claims Elicit has taken steps to ensure its AI is more reliable than many of the purpose-built platforms.[...] Elicit breaks down into “human-understandable” pieces the complex tasks that its models perform. This enables Elicit to know, for instance, how often different models are making things up when they generate summaries, and subsequently help users identify what answers to check — and when. Elicit also attempts to compute a scientific paper’s overall “trustworthiness,” taking into account factors like whether the trials conducted in the research were controlled or randomized, the source of the funding and potential conflicts and the size of the trials (TechCrunch).

Who made it? Elicit is a for-profit venture spun out from Ought, a nonprofit research foundation launched in 2017 by Andreas Stuhlmüller, a former researcher at Stanford’s computation and cognition lab. Elicit’s other co-founder, Jungwon Byun, joined the startup in 2019 after leading growth at online lending firm Upstart (TechCrunch). Read more about their team on the Elicit website.

Cost: Free "casual exploration" model (unlimited search across more than 125 million papers, unlimited summaries of 4 papers at once, unlimited chat with 4 papers at once, extract data from 10 papers per month, view sources for answers, import from Zotero) - Plus (for deeper research) is $10/month and Pro (for systematic reviews) is $42/month.

ResearchRabbit

About: ResearchRabbit is a free online “citation-based literature mapping tool." It is a visual literature review software mapping tool that is similar to Spotify. The tool connects your research interests to related articles and authors.

The tool allows users to create collections based on their research focus. You can start by looking up a title, DOI, PMID, or keywords. When you find a paper in the results, you can add it to a collection. You can expand the details of the paper by displaying the paper's abstract and comments you've written. From there, selecting a paper will allow you to explore similar papers, references in your original paper, and citations in your original paper. You can also explore additional works by the paper's author and suggested authors, as well as linked content (UND Libraries).

Who made it? It was developed in 2021 by a team of three in Seattle.

Cost: Free.

Notebook LM

About: Upload PDFs, websites, YouTube videos, audio files, Google Docs, or Google Slides, and NotebookLM will summarize them and make interesting connections between topics, all powered by Google's Gemini 2.0’s multimodal understanding capabilities. With all of your sources in place, NotebookLM gets to work and becomes a personalized AI expert in the information that matters most to you. NotebookLM provides clear citations for its work, showing you the exact quotes from your sources. The Audio Overview feature can turn your sources into engaging “Deep Dive” discussions with one click (Notebook LM).

Who made it? Popular science author Steven Johnson and product manager Raiza Martin for Google Labs.

Cost: In December 2024, Google launched the paid version of NotebookLM, called NotebookLM Plus, to enterprises and paid Gemini subscribers.