Featured Archives

Editor’s Note: This blogpost is a summarised repost of the original content published on 25 April 2025, by Jonathan Victor from Ansa Research. Ansa Research is a research firm focused on distributed infrastructure. The firm covers digital networks aiming to rebuild how internet infrastructure operates.

Introduction

In the evolving landscape of blockchain technology, understanding how a network accrues and sustains its value is paramount for its long-term success. This fundamental question of value accrual is critically examined for Filecoin, drawing comparisons to established models like Bitcoin and other Layer 1 (L1) blockchains such as Ethereum and Solana.

While many Layer 1 networks accrue value through internal services or consensus mechanisms, Filecoin’s path stands apart. Filecoin’s long-term value proposition hinges on external demand – particularly the sale of decentralized storage and Filecoin-backed services.

This ecosystem-wide strategy raises two critical questions:

How can external service adoption be meaningfully tied to the network’s long-term value?
How can this model integrate with the rise of Layer 2s in a way that strengthens the broader Filecoin economy?

The path forward depends on cultivating sustainable, non-speculative value flows. This blogpost explores the core challenges in aligning external demand, protocol design, and Layer 2 development to unlock Filecoin’s full potential.

Understanding Value Accrual Models

To contextualize value accrual mechanisms, let’s start with Bitcoin’s model.

Bitcoin’s Value Accrual Model:

Bitcoin’s value accrual loop is fundamentally tied to its proof-of-work (PoW) security. This security mechanism underpins a compelling narrative, positioning Bitcoin as a decentralized store of value that is backed by energy.

This narrative is crucial because it gives value to the token rewards. These token rewards, in turn, serve to incentivize the proof-of-work process itself. This creates a self-reinforcing cycle or “nice little loop” where the security of the network reinforces the value of its token, and the value of the token incentivizes the maintenance of that security. The ecosystem has a strong tie around this narrative value, emphasizing concepts like the fixed supply of 21 million Bitcoins backed by energy and proof-of-work.

Ethereum and Solana: Value Through Utility

Unlike Bitcoin’s narrative-driven model, Ethereum and Solana accrue value through direct network usage. Their Proof-of-Stake (PoS) systems underpin a utility-centric loop: users pay in native tokens (ETH, SOL) to access core network services, creating real demand.

Key drivers of this value loop include:

Blockspace sales – users pay to include transactions
Execution fees – costs to run smart contracts
Storage fees – charges for persisting data
Transaction prioritization – optional fees to expedite processing

This model links token value to actual on-chain activity. The more developers build and users interact, the more these L1 tokens accrue value – not from external narratives, but from tangible utility baked into the protocol itself.

Source: Filecoin value accrual: now and in the future

Filecoin’s Chosen Path: External Services

While networks like Bitcoin, Ethereum, and Solana accrue value through native mechanisms – whether through security incentives or on-chain usage, Filecoin’s approach centers on external demand.

Filecoin’s proposed path for long-term value accrual lies in external services. This strategy involves actively selling Filecoin storage or Filecoin-backed services to the broader world..

This strategic direction raises fundamental questions regarding its execution:

How to effectively connect the sale of these external services to the actual Filecoin value accrual story
How this strategy will function within the growing landscape of Layer 2s (L2s), particularly ensuring that the growth of L2s (especially if they become a significant source of demand generation) is symbiotic and aligned with the Filecoin network

Source: Filecoin value accrual: now and in the future

Visualizing Value Flow in the Filecoin Economy

Understanding how value flows throughout the Filecoin economy is crucial for its value accrual. Value flows into the Filecoin economy when people buy FIL to utilize services that Filecoin pushes out into the world. The more that internal transactions can be generated, and mechanisms for locking FIL are created, the more internal demand can be fostered within the ecosystem.

Additionally, consumptive elements, such as gas fees, permanently remove Filecoin from circulation. Conversely, “imports” occur when Filecoin is used to fund external operational or capital expenses.

The goal is to increase the balance of trade so that FIL is more directly connected to the actual value capture loop, rather than merely existing passively on the side. This means ensuring that revenue generated through external services ultimately accretes into the broader Filecoin story and can be measured on-chain.

Source: Filecoin value accrual: now and in the future

Filecoin’s Strengths, Challenges, and Opportunities

Filecoin holds a distinct position in the crypto landscape – especially in the DePIN (Decentralized Physical Infrastructure Networks) category. Understanding its current standing requires looking at its core strengths, the hurdles it faces, and where the biggest opportunities lie as it pushes toward sustainable value accrual.

What Sets Filecoin Apart

Filecoin brings a combination of physical infrastructure and economic design that differentiates it from other Layer 1s:

Hardware-Heavy Network: Unique especially among DePINs due to the total aggregate value of the capital it holds, with its deployed hardware serving as the economic base for productive monetization.
Established Brand & Distribution: Broad awareness and institutional credibility position Filecoin well for onboarding partners and developers.
Early Focus on Real Revenue: The ecosystem leads most crypto networks in thinking about sustainable monetization and generating cash flows into the Filecoin economy.

Key Challenges on the Road to Value Accrual

Despite these advantages, Filecoin faces several challenges as it aims to solidify its value loop and drive demand for FIL:

Infrastructure Gaps
- Latency-sensitive use cases remain challenged by Filecoin’s block times and finality constraints
- On-chain payments can feel clunky for end users, particularly when compared to the smoother experiences on more payment-oriented chains
Ecosystem Positioning and Alignment
- As Layer 2s emerge, ensuring alignment with FIL’s value accrual model is crucial. Fragmented tokens and narratives could dilute FIL’s role if not carefully managed
- Some new DePIN projects default to other L1s like Solana – Filecoin must remain proactive in competing for developer and user mindshare
Strengthening On-Chain Value Capture
- Converting real-world demand (e.g., storage usage) into on-chain value remains a work-in-progress. FIL-based settlement is not yet the default for many clients, especially those operating in fiat or off-chain contexts.
- DeFi and dynamic applications still face adoption friction, with FIL-denominated flows requiring better incentives and infrastructure.
Clarity and Proof of Progress
- The ecosystem is in a “show me” phase – with more focus on proving real usage, rather than relying solely on forward-looking narratives
- Complexity around tokenomics (e.g., Baseline vs. simple minting) remains a source of confusion, though this is likely to ease post-2026

Opportunities That Could Drive the Next Growth Cycle

If executed well, these strategic opportunities could solidify Filecoin’s role as a foundational layer for decentralized infrastructure:

Strengthen FIL-Centric Economics
- Anchor core services (storage, compute, sealing-as-a-service) into measurable on-chain FIL flows
- Offer tangible benefits (e.g., fee discounts, better credit terms) to incentivize native FIL payments
- Identify where value leaks from the ecosystem and redesign those flows to keep more within the Filecoin economy
Activate the Storage Provider Base
- Help SPs monetize their deployed capacity through tools like Ramo and aligned L2s that translate hardware into cash flow.
Scale Through Layered Ecosystem Plays
- Build derivative businesses, from compute networks to data marketplaces – atop established usage patterns.
- As flywheel momentum builds, Filecoin can move into adjacent sectors like AI data infra, persistent rails, and verifiable compute.
Ensure Ecosystem Alignment
- Define what “aligned L2s” look like and structure token incentives to reward symbiotic behavior.
- Encourage system coherence to ensure long-term value accrual flows back to FIL

The Three-Step Pitch for Maximizing Value

To reach its full potential, Filecoin must go beyond narratives and demonstrate measurable, on-chain economic activity. The goal is to establish a value loop that drives real revenue, aligns with tokenomics, and reinforces the network’s long-term utility.

1. Anchor Non-Speculative Value On-Chain

The primary goal is to settle meaningful, revenue-generating activity on-chain to counter inflation and establish durable demand for FIL.
What counts as non-speculative value:
- Monetizing hardware (e.g., storage providers earning from sealing, compute jobs, etc.)
- Services like “sealing-as-a-service” and running compute jobs on unused storage
- Tools like CIDGravity, Storacha, Akave, Lighthouse, Curio, and Ramo make it easier for storage providers to convert hardware capacity into fiat-based revenue streams
On-chain settlement matters:
- Enables visibility and measurement
- Avoids fragmentation across L2s/subnets where value might accrue to other tokens instead of FIL, letting FIL remain as the “default currency” of value

2. Develop Derivative Businesses

Once core economic activity is on-chain, the next step is to build businesses that leverage this foundation.
- These could include analytics, marketplaces, compute services, or data tooling tied to Filecoin’s storage layer
- This helps translate ecosystem growth into value that extends beyond storage alone

3. Expand the Total Addressable Market (TAM)

With the value flywheel spinning, Filecoin can broaden its scope:
- Serve more use cases (e.g., AI, big data, verifiable compute)
- Attract new market segments beyond Web3
- Strengthen its position as the default infrastructure for decentralized data

Conclusion

Filecoin’s challenge and opportunity lie in building a credible value accrual loop—anchored in real-world utility, on-chain revenue, and aligned ecosystem growth. Moving beyond narrative, its long-term success depends on turning external demand into sustained economic gravity.

To learn more about Filecoin Value Accrual, explore the following talk that happened during FDS-6:

Filecoin value accrual: now and in the future (by Jonathan Victor, Ansa Research)

Disclaimer: This information is for informational purposes only and is not intended to constitute investment, financial, legal, or other advice. This information is not an endorsement, offer, or recommendation to use any particular service, product, or application.

Editor’s Note: This blogpost is a summarised repost of the original content published on 25 April 2025, by Molly Mackinlay from FilOz. Founded in 2024, FilOz is a team of 14 protocol researchers, engineers, TPMs, and community engineers focused on securing, upgrading, and expanding the Filecoin network.

Introduction

Nearly five years into its launch, Filecoin – now the world’s largest decentralized storage network, caters to more than one type of user. It serves a diverse range of customer segments where adoption and traction are already taking hold. A strong signal of product-market fit is paying demand – users who see enough value to pay for the service.

This blogpost offers a comprehensive look at Filecoin’s Ideal customer profiles (ICPs), analyzes current adoption trends, identifies high-opportunity areas based on payment signals and data volume, and outlines key strategies to drive future client success.

Identifying and Targeting Ideal Customer Profiles (ICPs)

At the heart of this demand push is a clear focus on Ideal Customer Profiles (ICPs) – specific categories of users the network is targeting for adoption. These ICPs represent use cases where Filecoin’s decentralized storage offers immediate value. Paying demand is one of the strongest indicators of product-market fit, and Filecoin is seeing traction in several key segments:

Four Core Vertical Markets:

Large-Scale Data Clients (Primarily Web2):
Traditionally focused on archival storage, now evolving toward faster reads/writes due to AI workloads. Think multi-exabyte archives with 24-hour retrieval tolerances.
Web2 Object Storage (e.g., AWS S3 alternative):
Demands fast access, pricing competitiveness, Snowflake integrations, and strict controls over data locality—especially amid rising geopolitical tensions.
Web3 Object Storage:
Adoption from decentralized websites, NFTs, social apps, and AI agents. Nascent potential includes data DAOs and decentralized AI use cases.
Web3 Chain Storage:
Early traction from chains like Solana and Cardano, with data from Ethereum L2s scaling up to terabytes and petabytes. Strong potential, but more robust on-ramps are needed.

Source: State of Client Adoption and ICPs

It’s also worth noting that beyond these four verticals, DePINs – particularly those collecting large volumes of consumer data are a key focus for 2025. In addition, edge computing and AI are emerging as high-potential sectors, driven by their rapidly growing, data-intensive workloads.

Mapping Filecoin’s Strengths to Its Most Promising Use Cases

To effectively serve the right customer segments, it’s essential to ask: what core strengths does the Filecoin network offer – and how can those be applied to the ideal customer profiles that stand to benefit most?

The following slide summarizes Filecoin’s strengths (and its learnings):

Source: State of Client Adoption and ICPs

Building on Filecoin’s core strengths, the next slide highlights the key Ideal Customer Profiles (ICPs) driving real-world adoption today. It identifies which client segments are actively paying, which have rapidly growing data volumes, and where early adoption is gaining momentum:

Source: State of Client Adoption and ICPs

This alignment between strengths and needs is essential for driving meaningful adoption and long-term growth.

Scaling Client Success & Ideal Customer Profiles (ICPs)

While Filecoin has attracted paying users across various sectors, scaling from a few customers to a broad, thriving user base remains a key challenge. A major bottleneck is the lack of strong on-ramps specialized Layer 2 solutions – that serve high-volume, fast-growing ICPs like Web3 chain storage. This gap presents a clear opportunity for builders to develop targeted solutions.

Scaling client success ties directly to Filecoin’s 2025 core KPIs:

Revenue from on-chain paid storage deals: The main indicator of product-market fit. Despite off-chain payments, on-chain revenue is near zero. Bridging this gap with Proof of Data Possession (PDP) and Filecoin Web Services (FWS) is critical.
Growing a satisfied client base: Paying clients exist, but better transparency and dashboards are needed to track success.
Increasing service activity and value accrual: Tools like the USDFC stablecoin and FIP-100 protocol are in place, but accelerating on-chain payments and fee flows is essential.

To achieve this growth, several strategic steps are required, as outlined below:

Source: State of Client Adoption and ICPs

Conclusion

Filecoin is actively transitioning from a capacity-focused infrastructure to a user-driven ecosystem. With a sharpened focus on ICPs, a maturing network of on-ramps, and powerful protocol innovations, it’s well-positioned to scale demand and adoption. Key challenges, especially around on-chain revenue and retrievability remain, but the foundation for sustained growth is rapidly taking shape.

To learn more about State of Client Adoption and ICPs for Filecoin, explore the following talk that happened during FDS-6:

State of Client Adoption and ICPs (by Molly Mackinlay, FilOz)

Editor’s Note: This blogpost is a summarised repost of the original content published on 25 April 2025, by HQ Han from Ansa Research. Ansa Research is a research firm focused on distributed infrastructure. The firm covers digital networks aiming to rebuild how internet infrastructure operates.

Executive Summary

Filecoin has now entered the next phase of development, focusing on users, demand generation and adoption.

In this regard, Filecoin is not alone – while DePIN networks have now proven very successful at bootstrapping supply, the focus of this sector has also now turned towards demand.

There has been some exciting progress on Filecoin’s demand story:

Growth in the underlying demand for Filecoin’s services
- We are seeing data clients in both Web2 and Web3 paying for storage on Filecoin. In some cases, Filecoin has been chosen in parallel or over Web2 providers.
- The storage provider landscape is reflective of this – whilst capacity and # of SPs have dropped, there has been an increase in quality and data utilization on the network – showing a shift in network resources towards demand
There is a pathway to scale
- Filecoin is still expanding its core services to cater to demand with important protocol level upgrades – F3, PDP, and Filecoin Web Services (FWS)
- New Filecoin-powered storage solutions have come to market (Akave, Storacha, Recall etc) – and all already charging for deals within their target markets
- DeFi continues to scale on FVM – notably, FIL-backed stablecoins coming online
- Storing more data on the network and creating multi-service APIs creates the building blocks to bring compute needs to the data

Filecoin is at an inflection point as its services mature to meet the demands of AI, enterprise & nation state focus on data locality, and global cost-cutting driving orgs away from costly Web2 cloud providers.

This blog post examines Filecoin’s adoption, including milestones and use cases, its scaling path via infrastructure, tooling, and coordination, and cryptoeconomics, covering incentives, token dynamics, and network sustainability.

Adoption & Client Demand

Over the past year, Filecoin has seen a rise in paying customers, with projections suggesting over 1 EiB of paid storage deals could close in 2025. This would raise the network’s utilization from its current 29% to nearly 100% with fully paid usage.

The number and quality of leads are growing, driven by efforts from teams like Ansa Research and the Filecoin Foundation, who are actively sourcing paid deals for on-ramps and Storage Providers. Business development has also shifted focus toward high-potential DePIN categories, particularly those collecting large volumes of consumer data as well as Web2 clients with archival and hot/cold storage needs.

Recent notable paid deals include:

Heurist / Akave (AI)
375ai / Akave (DePIN)
Humanode / Storacha (Identity)
Gaianet / Storacha (AI)
FanTV / Lighthouse (Video)
Intuizi / Akave (Web2 SaaS)
The Defiant / Akave (Media)
Cornell University astrophysics simulation data / Ramo (Web2 R&D Data)

Source: State of Filecoin to Investors

Utilization and Go-to-Market Momentum

Filecoin’s network utilization has risen to around 29%, signaling increased demand. A growing number of large-scale clients (storing over 1,000 TiB) possibly in the enterprise and/or long-term archival space, demonstrates that efforts by on-ramps and Storage Providers (SPs) are on the right path.

After earlier declines as the network shifted its focus from generating supply to generating demand, average daily new deals have recently increased by over 10% from Q3’24 to Q4’24.

Source: State of Filecoin to Investors

Storage Providers Evolving Beyond Capacity

The Storage Provider (SP) landscape is shifting from a focus on raw capacity to delivering useful, client-driven storage. In its early phase, the network prioritized onboarding as much storage as possible, often without regard for actual usage or retrievability. Now, as Filecoin emphasizes demand generation and adoption, SPs are adapting by pursuing paid deals and ensuring robust data retrieval. This marks a clear move from quantity to quality, with incentives increasingly aligned to real-world client needs – reflected in a 388% year-over-year surge in the number of SPs achieving successful retrievals.

Source: State of Filecoin to Investors

Path To Scale

The network’s path to scale is envisioned along three main lines:

Expanding core protocol capabilities with the introduction of Proof of Data Possession (PDP), Fast Finality (F3), and Filecoin Web Services (FWS)
Launching new on-ramps and Layer 2 solutions to support vertical-specific adoption
Improving economic efficiency through decentralized finance (DeFi)

1. Expanding Core Protocol Services

Filecoin will go through key protocol updates, primarily;

Proof of Data Possession (PDP) – that has shipped as of 8th May 2025, and Storage Providers can participate in PDP SPX, a short-term initiative to onboard select Storage Providers to test, validate, and demonstrate Proof of Data Possession (PDP).
Fast Finality (F3) – that has arrived early and went live on Filecoin Mainnet as of April 2025, bringing 100x improvement on transaction speeds
Filecoin Web Services (FWS) – a composable service marketplace for offering multi-service deals

Source: State of Filecoin to Investors

Vision for FWS Architecture

Introduced last year as part of Filecoin’s broader vision, Filecoin Web Services (FWS) marks a major step toward expanding the network’s capabilities beyond storage. At its core, FWS is a composable service marketplace that enables users to bundle multiple services—such as cold and hot storage, retrieval, compute, and encryption—into a single deal.

FWS aims to offer a more flexible and integrated alternative to traditional Web2 cloud platforms. It also opens the door for other DePIN networks to offer and resell their services within the Filecoin ecosystem. With integrated payments and escrow, FWS supports the creation of customizable service combinations, enhancing utility for both consumers and enterprises.

Source: State of Filecoin to Investors

2. New On-Ramps and L2s

A key part of Filecoin’s scaling strategy is the emergence of new storage on-ramps, functioning like Layer 2s – that are launching mainnets and targeting specific verticals. These startups tailor Filecoin’s storage stack to meet the needs of niche markets, helping to establish beachhead use cases. Notable examples include:

Akave: Focused on Web2 enterprises, Akave offers a hot storage layer on top of Filecoin, supporting archival, hot/cold storage, and S3 integrations. They’ve also integrated with Snowflake.
Storacha: Targeting Web3 applications in social, AI, and identity, they specialize in storage solutions for decentralized platforms.
Recall: Aimed at the AI sector, specifically towards AI agent storage and data processing.

Source: State of Filecoin to Investors

3. Scaling DeFi

Filecoin’s DeFi ecosystem is growing, playing a key role in improving economic efficiency across the network. A major focus is on stablecoins, which help retain economic activity within the ecosystem. Secured Finance has introduced USDFC, a FIL-backed stablecoin that allows FIL holders and Storage Providers to use their tokens as collateral instead of selling them—similar to MakerDAO on Ethereum.

USDFC is now live, as recently announced at FDS-6 in Toronto.

Source: State of Filecoin to Investors

Cryptoecon Update

The central forecast: FIL’s circulating supply growth is expected to slow and may turn negative or deflationary by late 2026. This shift stems from a combination of reduced token issuance and increased demand sinks that lock or remove FIL from circulation.

Supply-Side Pressures Easing

Several key developments are reducing new FIL issuance:

Vesting Completion: Token vesting from early stakeholders, including Protocol Labs and the Filecoin Foundation, ends in October 2026 – removing a major source of new tokens.
Decreasing Block Rewards: FIL block rewards follow a declining emission schedule by design.
FIL-Backed Stablecoins: Stablecoins like USDFC by Secured Finance allow FIL holders to use tokens as collateral instead of selling, keeping more value within the network and reducing sell pressure.

Demand-Side Sinks Growing

At the same time, FIL demand is increasing through new utility and locking mechanisms:

Rising Collateral Requirements: Storage Provider collateral is set to increase in 2025, partly due to a fix under FIP-81 that enhances locking behavior.
Increased Protocol Revenue: FIP-100 is projected to boost FIL-denominated revenue, much of which is burned or otherwise removed from circulation.

Together, these trends suggest a pivotal moment in Filecoin’s economic evolution: a potential transition to a deflationary supply model, signaling a tighter and potentially more valuable FIL economy.

[Disclaimer: Circulating supply analysis is based on a 3rd party model: https://mechafil-jax-web-levers.streamlit.app/
These models are based on many assumptions, and should not be relied upon as the source of truth. There are many factors that can and will affect the actual numbers. Simulations should not be relied upon and are for illustrative purposes only. DYOR and adjust the model yourself, or build your own models in Dune.]

Source: State of Filecoin to Investors

Filecoin Data (where to get them to DYOR)

For those looking to dive deeper into the Filecoin ecosystem, Ansa Research has compiled a curated directory of key metrics and data sources. These resources provide essential insights into network health, development trends, and adoption signals – making them useful for both regular monitoring and deeper research.

Whether you’re tracking protocol upgrades, storage deals, or adoption patterns, this data directory is a valuable starting point for your own analysis.

👉 Access the Filecoin Data Directory here

In Conclusion

Filecoin is entering a new phase centered on demand, adoption, and long-term sustainability. With supply successfully bootstrapped, focus has shifted to real usage – evidenced by rising paid storage deals, potentially exceeding 1 exabyte by 2025, and a shift toward higher-quality, retrievable data.

Key upgrades like Proof of Data Possession (PDP), faster finality (F3), and new Layer 2 solutions are unlocking capabilities across data and DeFi, including FIL-backed stablecoins that help retain value within the network.

At the same time, token issuance is set to slow, with vesting ending in late 2026 and block rewards declining – while demand sinks like collateral locking (FIP-81) and protocol revenue (FIP-100) increase. Together, these trends suggest a potential shift to a deflationary FIL supply and a more mature, sustainable network economy.

To listen to the entire talk by HQ Han (Ansa Research) at FDS-6, watch here on YouTube:

State of Filecoin for Investors

Editor’s Note: This blogpost is a summarised repost of the original content published on 26 March 2025, by Luca from FilOz. Founded in 2024, FilOz is a team of 14 protocol researchers, engineers, TPMs, and community engineers focused on securing, upgrading, and expanding the Filecoin network.

Introduction

As Web3 applications mature, fast and reliable access to data becomes just as important as storing it. Whether it’s video streaming, serving assets for dApps, or powering AI agents with on-demand data, retrievability is a foundational piece of user experience.

Unlike traditional cloud providers where data retrieval is instant and guaranteed by a centralized service (often aided by CDNs), retrieval of data in Web3 introduces new challenges — spanning considerations around network performance, data redundancy, storage provider reliability, and incentive alignment. Filecoin has built the world’s largest decentralized storage network and how to tackle retrievals on that foundation deserves some exploration. How should Filecoin evolve its retrieval capabilities in a decentralized internet? What is the best approach for its users?

This guide aims to unpack the state of Retrievability on Filecoin, exploring where we are today and where improvements are needed. What we will aim to cover:

Overview of Retrievability on Filecoin, including key strategies, challenges, and improvements
Explore retrieval strategies and protocols, their guarantees, and limitations
Outline payment models and how SPs and clients can select each other
Highlight potential protocol-level improvements to enhance retrievability

Retrievability on Filecoin

Filecoin enables decentralized storage, allowing clients to store data with Storage Providers (SPs) and retrieve it on demand. Unlike storage, which is provable through Proof of Replication and Proof of SpaceTime, retrieval is a separate process that isn’t always provable, depending on protocols and strategies addressing various challenges.

Retrievability on Filecoin is influenced by factors such as:

Network and SP performance
Data availability
Retrieval optimization protocols

There is no one-size-fits-all solution for retrievability. The best strategy depends on a client’s needs, including retrieval speed, reliability, and cost constraints. Clients can choose simple retrieval from a single SP for non-critical data or more complex solutions like redundancy, SLAs, or off-chain backups for mission-critical files. Advanced protocols, like Spark or CDN integration, offer higher performance but come with added costs or complexity.

Trust plays a crucial role in decentralized storage networks like Filecoin. Without trust, retrieval failures or delays can occur, especially with payment strategies like upfront payments. A CDN-like solution (e.g., retriev.org) could address these trust challenges by:

Providing monitoring services to ensure SPs meet obligations
Offering arbitration to resolve disputes and penalize non-performing SPs
Ensuring retrieval promises are backed by financial incentives and penalties

Filecoin offers various retrievability options, from redundancy models to retrieval networks and off-chain solutions. Below is a summarized table of these options:

For the full breakdown on Retrievability Options on Filecoin, read here.

Key Metrics

When evaluating retrievability, clients need to consider several performance metrics that measure different aspects of the data retrieval process. These metrics ensure that data is not only stored but also accessible and retrievable efficiently when required. Key metrics include:

Availability Metrics: Measure the likelihood of data being accessible and quickly recoverable when issues arise. High availability and redundancy ensure retrieval even if some systems or replicas fail.
Performance Metrics: Assess retrieval speed and responsiveness, impacting user experience. Factors like throughput and latency are influenced by network bandwidth and data availability.
Reliability Metrics: Reflect the consistency and stability of data retrieval, including success rate, error rate, and data integrity. High uptime ensures high availability, while a low error rate guarantees data accuracy and successful retrieval attempts.
Cost-related Metrics: Help balance performance with cost efficiency, particularly in managing retrieval speed, bandwidth usage, and associated costs.
Quality Metrics: These metrics measure the overall quality of the retrieval process, ensuring a satisfactory user experience.

By grouping the metrics into these categories, clients can evaluate retrievability from multiple dimensions, ensuring efficient, reliable, and cost-effective data access. Below is a comprehensive list of key retrievability metrics, grouped by their category:

For the full breakdown on key metrics, read here.

Payment Models

Retrievability on Filecoin combines payment options (how value is transferred) with payment strategies (when and under what terms payments occur). This modular structure allows clients and Storage Providers (SPs) to tailor agreements based on performance, trust, and cost considerations.

Payment Options – methods used to pay for data retrieval:

Off-Chain Payments: Peer-to-peer arrangements that are typically fast and gas-free but rely on trust between parties.
On-Chain Payments (FIL): The native token used for transparent, trustless retrieval settlements.
- On-Chain Payments (Stablecoins): e.g., USDFC, offering price stability and compatibility with broader ecosystems.
- On-Chain Payments (ERC-20 Tokens): Enables use of other tokens for interoperability with DeFi and cross-chain networks.

Payment Strategies — structures for when and how payments are made:

Upfront Payment: Clients pay the full cost of retrieval in advance.
Pay-to-Retrieve: Payments are made on a per-retrieval basis, similar to a metered model.
Periodical Payment: Recurring payments that grant ongoing or unlimited retrieval access.
Retrievability Tickets: Prepaid, redeemable claims for future data retrievals.
Hybrid Models: Flexible approaches that combine aspects of multiple strategies to optimize for specific needs.

Retrieval Services: SP Selection & Client Selection Strategies

Selecting Storage Providers (SPs) for retrievability in Filecoin requires balancing control, cost, reliability, and trust. This process involves two key aspects: the deal-making process and the selection mechanism. Just as clients must carefully choose SPs, SPs also evaluate which client retrieval requests to fulfill, following the same two key components—deal-making and selection mechanisms.

Deal-Making Process

The deal-making process determines how clients and Storage Providers (SPs) establish retrieval agreements, balancing control, efficiency, and risk. It involves two key approaches:

Direct Negotiation: Clients and SPs engage directly to define retrieval terms, including cost, performance guarantees, and service conditions. This method offers full control but requires manual effort and carries risks such as extended negotiations, misunderstandings, and potential SP unreliability.
Automated or Delegated Deal-Making: Intermediaries or automated systems—such as content delivery networks (CDNs), smart contracts, or auction systems—facilitate the process. This reduces manual effort, optimizes terms based on real-time data, and enables market-driven pricing. However, it can introduce additional costs, reduced control, and reliance on third-party mechanisms for enforcement and dispute resolution.

SP Selection Mechanisms (POV of a Client)

Once the deal-making process is determined, the actual selection mechanism defines how the SPs are chosen. These can range from reputation-based systems to auction-based or automated selections.

Below is summarised table on the various selection mechanisms:

For the full breakdown on SP Selection Mechanism, read here.

Client Selection Mechanisms (POV of a Storage Provider)

Once the deal-making process is determined, the client selection mechanism helps further narrow down on which clients’ retrieval requests the SP wishes to fulfill. Below is a summarised table to cover the various mechanisms:

For the full breakdown on Client Selection Mechanism, read here.

Next Steps

Retrievability guarantees for data stored on the Filecoin Network is essential for its long-term success and sustainability. However, we also recognize that each user may have distinct needs, preferences, and requirements when it comes to data accessibility and security guarantees.

To address this, we foresee a modular approach that allows users to select from a diverse range of services and combine them in a way that meets their specific retrievability and reliability goals. This flexibility will enable users to tailor their storage solutions to their unique use cases, ensuring both customization and scalability.

A promising path forward for enhancing retrievability guarantees on the Filecoin Network involves integrating advanced protocols and tools. By leveraging technologies and protocols like CDN Gateways, reputation systems, smart contract-powered storage solutions and incentives, we can create a more robust and reliable infrastructure.

These combined innovations will not only improve data accessibility and security but will also foster the overall growth and resilience of the Filecoin ecosystem.

For more pieces from FilOz, check out their Medium page here.

To stay updated on the latest in the Filecoin ecosystem, follow the @Filecointldr handle or join us on Discord.

2024 has been a pivotal year for Filecoin, with significant progress in the Filecoin Virtual Machine (FVM), Storage, Retrievals and Compute. In this blogpost, we’ll recap the key milestones of 2024 and take a look at the major growth drivers shaping Filecoin’s path into 2025.

Source: Charting Success for Filecoin, 2024

2024 Retrospective

In our earlier blogpost ‘Charting Success for Filecoin 2024’, we mapped out three key priorities for the ecosystem in 2024:

Accelerating Paid Deals: Boosting paid services (storage, retrieval, compute) on Filecoin to generate cashflow for service providers. This helps to support more sustainable hardware funding beyond token incentives.
Growing On-Chain Activity: Increasing activity through programmable services, DeFi, and new use cases.
Becoming Indispensable: Establishing Filecoin as an integral component of other projects and businesses.

These priorities are not mutually exclusive – they layer onto each other and are all signs that the Filecoin ecosystem is growing increasingly valuable.

So how did we fare across these priorities in 2024?

1. Accelerating Paid Deals

Paid Deals is an ecosystem-level metric that reflects the volume of paid services within the Filecoin network. FilecoinTLDR is currently tracking this metric here.

In 2024, Filecoin made significant strides in accelerating paid deals by reducing friction for businesses entering the ecosystem, with key advancements like the development of Proof of Data Possession (PDP) and the emergence of Layer 2 solutions.

Enabling Efficient Hot Storage with PDP
- Projected for Q1 2025, Proof of Data Possession (PDP) introduces a new proof primitive to the Filecoin network, marking the first major proof development since Proof of Replication (PoRep) and Proof of Spacetime (PoSt). Unlike PoRep, which excels at cold storage through sealed sectors, PDP is designed for “hot data”, which is data that needs fast and frequent retrieval.
- This new proof type enables cost-effective “cache” storage on Filecoin without sealing and unsealing, enabling rapid data onboarding and retrieval. PDP opens the door for a new class of storage providers focused on hot storage and fast retrievals, benefiting onramps like Basin, Akave, and Storacha.

Scaling Filecoin with L2s
- In 2024, we saw a rise in Layer 2 solutions built on top of Filecoin (We also covered this in our earlier blogpost “State of L2s on Filecoin”). L2s like Basin, Akave and Storacha enable both horizontal and vertical scaling with secure, customizable subnets. These L2s enhance Filecoin by unlocking new use cases: including managing data-intensive workloads, supporting AI and unstructured data, powering gaming and privacy-focused applications — all of which create more opportunities for paid deals.

2. Growing On-Chain Activity

Filecoin has made notable progress in accelerating on-chain activity through the FVM, which spurred growth in its DeFi economy. The proposed Filecoin Web Services (FWS) and launch of FIL-collateralized stablecoins are set to further boost this momentum.

Source: Defillama (as of December 16, 2024)

DeFi Milestones
- As of December 16 2024, more than 4,700 unique contracts have been deployed on FVM, enabling over 3 million transactions. DeFi activity on FVM saw average net deposits exceeding 30M FIL ($200M), driven by staking, liquid staking, and DEXs, with GLIF leading at 62%, followed by FilFi (10%) and SFT Protocol (9%). Net borrows averaged 26M FIL ($173M), highlighting strong growth in Filecoin’s DeFi ecosystem.

FIL-Collateralized Stablecoin for the Filecoin Ecosystem
- USDFC is a FIL-backed stablecoin launched by Secured Finance in Q4 2024 to address key challenges in the Filecoin ecosystem. It introduces stability to a network previously lacking stablecoin options, reducing volatility and enhancing value storage, much like DAI did for Ethereum.
- By allowing FIL holders and SPs to collateralize their assets for USD, USDFC helps cover operational costs without selling FIL, preserving asset value and network support. It also boosts liquidity in lending markets by providing FIL-backed stablecoin liquidity, driving more efficient capital flows within the Filecoin ecosystem.

3. Becoming Indispensable

DePIN gained prominence, with Filecoin strengthening its position through key partnerships with AI and compute projects. Meanwhile, on-chain archival received significant recognition through major on-ramp partnerships.

“…thanks to Filecoin for building an awesome decentralized archive layer. “ – Anatoly (Solana Co-Founder)

Notable On-Ramps of 2024
- At Solana Breakpoint this year, Filecoin founder Juan Benet highlighted how Filecoin’s zero-knowledge (ZK) storage is securing the entire Solana ledger.
- Similarly, Cardano apps now have the opportunity to boost data redundancy and decentralization through the Blockfrost integration with Filecoin.
- SingularityNET’s integration with Filecoin (via Lighthouse) emphasizes the growing need for scalable and cost-effective storage in the AI-driven era, where managing vast amounts of data efficiently is critical.
- These meaningful partnerships help signal Filecoin as a key player in both the Chain Archival and AI narratives.

Source: Filecoin (X)

Compute & AI Partnerships
- This year, Filecoin has positioned itself as a key player in the growing field of Decentralized AI. The onset of projects within the ecosystem like Ramo (network participation), Bagel (AI & cryptography research), Swan Chain (AI training and development), and Lilypad (distributed compute for AI) highlight Filecoin’s expanding role in powering AI innovation.

2024 Filecoin Challenges

Despite the immense progress, we noted some challenges that the community faced. Though bearing in mind that Web3 products are still very early, and the problem statement of forming a credible alternative to the centralized cloud is a huge one.

Product Market Fit:

Roadblocks like limited retrievability and high costs (driven by data replication), challenge the efficiency of the Filecoin network.
There is a need to make payments easier by allowing transactions directly on the Filecoin network, using methods like stablecoins or flexible payment options.
Improving visibility into the onboarding process and using customer data can help refine strategies and boost performance in key areas.

Building a Sustainable Economic Model + Stronger Economic Loops:

Viewing Filecoin as an island economy highlights its focus on accruing value by exporting goods and services while also keeping as much value as possible within the network by minimizing outflows.

Source: Realizing Filecoin’s Vision (Part 2) – Juan Benet

A key challenge lies in reducing external outflows while finding ways to boost exports and capture more demand within the ecosystem.
Ensuring that transactions remain on-chain is equally crucial to strengthening this economic model and creating stronger economic loops.

Filecoin’s 2025 Outlook

Looking ahead to 2025, Filecoin’s evolution continues. Here are three key themes that could drive transformative growth for the network while addressing the 2024 challenges outlined above.

“Filecoin is at an inflection point.” – Blockworks Research

1. Accelerating Filecoin by 450x with Fast Finality (F3)

Fast Finality (F3), is one of the most impactful upgrades to Filecoin’s consensus layer since the launch of its mainnet. By drastically reducing transaction finality times, F3 overcomes a key limitation of the network’s original consensus mechanism. This enhancement is scheduled to go live on the mainnet in Q1 2025.

Old vs. New Finality:

Before F3, Filecoin’s consensus mechanism ensured secure block validation but required 7.5 hours (900 epochs) to finalize transactions, which was too slow for applications like smart contracts or cross-chain bridges.
With F3, transactions can now optimistically finalize in minutes—a 450X improvement.

What this means for Filecoin:

Enhanced Speed & UX: Transactions finalize within minutes, enabling low-latency applications and eliminating the long waits previously experienced.
Expanded Use Cases & Accessibility: L2 subnets like Interplanetary Consensus (IPC), Efficient smart contracts and decentralized applications, Blockchain bridges for interoperability with other chains.

Ultimately, this allows Filecoin to improve its usability across a wider variety of applications.

2. Moving Beyond Storage with FWS

Filecoin Web Services (FWS), emerged this year as a pivotal concept. It represents a strategic shift for Filecoin, expanding its scope from primarily a decentralized storage network to a broader marketplace for blockchain-based cloud services. This diversification can attract a wider range of users and use cases, potentially creating more positive economic loops within the network. Here are some pointers on why FWS should be on your radar:

Strengthening Filecoin’s Competitive Edge: FWS will introduce features like Programmatic SLAs (which automate and enforce service agreements through smart contracts, ensuring clear performance expectations and penalties) and Verifiable Proofs (which provide cryptographic evidence of service delivery, allowing clients to independently verify service execution).
Expands Filecoin’s Capabilities: Goes beyond Proof of Replication (PoRep) by adding Proof of Data Possession (PDP), enabling robust hot storage use cases. PDP will help improve data retrievability, a crucial factor in achieving product-market fit that has been widely discussed within the Filecoin community this year.
Positions Filecoin as a leading platform in the decentralized web: FWS will facilitate the integration of multiple networks and protocols, creating a cohesive marketplace for storage, compute, bandwidth, and other services. This could make Filecoin a key player in the growth of the decentralized web.

FWS is currently a concept in development, with a new storage service featuring PDP (v0) underway. Following this milestone, the development of the FWS marketplace will begin with its expected launch in Q1 2025.

3. Unlocking new value streams in Filecoin

As a Layer 1 blockchain, Filecoin primarily generates revenue through gas fee burns (which happen when chain resources are used or when faults arise). However, relying on gas fee burns as a main source of revenue is not scalable and more importantly increases operational expense costs as well as service costs.

A sustainable approach involves value returning to the Filecoin economy through the use of services in the FWS marketplace, fostering a more scalable and balanced revenue model. A proposed value accrual mechanisms includes:

FWS Fees: Commission (%) charged based on the transaction volume in the marketplace.
Service Fees: Applied when a user accesses a service or a vendor provides one
SLA Penalties: Imposed on service providers who fail to meet agreed-upon performance standards

This shift promises a more robust and diversified revenue stream, ensuring Filecoin’s continued relevance and profitability in the evolving market.

Final Thoughts

As data grows in value, we expect advancements in privacy-preserving machine learning, data-driven business models, and the increasing role of AI agents in unlocking decentralized storage’s potential.

Looking towards 2025, with the upcoming Fast Finality (F3) launch on the mainnet and the continued development of Filecoin Web Services, Filecoin is set to play a central role in shaping the future of data and AI within decentralized ecosystems. We expect to see these advancements positioning Filecoin beyond storage and unlocking a sustainable economic model through new revenue streams generated by FWS.

To stay updated on the latest in the Filecoin ecosystem, follow the @Filecointldr handle or join us on Discord.

Many thanks to HQ Han and Jonathan Victor for reviewing and providing valuable insights to this piece.

At first, the convergence of artificial intelligence (AI) and blockchain seemed like an awkward pairing of buzzwords—a notion often met with skepticism among early adopters. But in merely a year’s time, decentralized AI has evolved from being an obscure idea to one that is central to conversations around the Web3 environment. Such swift transformation owes its momentum to a few crucial elements:

Influence of AI: AI is set to significantly impact how we interact with the world. As AI agents grow more sophisticated, they will manage tasks like financial transactions and personal coaching. This evolution raises important questions about control and governance in AI development.
The Risks of Centralized Power: Centralized AI models controlled by a few tech giants pose serious risks, including bias, censorship, and data privacy concerns. This concentration of power stifles innovation and creates vulnerabilities, as highlighted by the recent security breach at Hugging Face.
The Demand for an Inclusive AI Ecosystem: Decentralized AI offers a pathway to a more equitable and accessible AI landscape by distributing computational processes across various systems. Key benefits include:
- Reduced Costs: Lower barriers enable smaller developers and startups to innovate in AI.
- Enhanced Data Integrity: Verifiable data provenance increases transparency and trust in AI models.
- Combating Censorship: Aligning AI development with market needs fosters a more democratic technological environment.

These points highlight the value of an alternative approach to centralized AI.

The Future of AI Data is Decentralized Storage

That's it. That's the tweet.
— Filecoin (@Filecoin) July 4, 2024

The Pillars of Decentralized AI

Decentralized AI comprises 3 pillars: leverages idle computing power from users, utilizes secure decentralized storage, and implements transparent data labeling.

Decentralized Storage: Utilizing decentralized storage networks like Filecoin ensures secure and verifiable storage for large datasets.
Decentralized Compute: By leveraging idle computing power from individual users and distributing tasks across a network, Decentralized AI makes AI development more accessible and cost-effective.
Decentralized Data Labeling and Verification: Transparent and verifiable data labeling processes help ensure data quality and reduce bias, fostering trust in AI systems.

A closer look: Decentralized AI Projects in the Filecoin Ecosystem

To take a closer look into how the Web3 stack can offer benefits to the AI space, we’ll explore the various approaches 4 decentralized AI projects are taking. These projects are utilizing some or all of the pillars of decentralized AI as outlined above.

Source: Unlocking Decentralized Storage for AI Workloads and Beyond – Vukasin Vukoje

Ramo – Simplifying Decentralized Network Participation (Funding Stage: Seed)

Ramo plays a crucial role in powering AI workloads by coordinating capital and hardware. By merging resources from various providers, Ramo facilitates the execution of complex tasks such as storage, SNARK generation, and computation, while allowing hardware resources to be jointly funded across multiple networks.

Multi-Network Jobs: Ramo supports jobs across multiple networks (e.g., read from Filecoin, process on Fluence, write back to Filecoin), helps maximize hardware providers’ revenue and reduces coordination complexity.

Source: Decentralized Business Intelligence with Swan Chain’s AI Agent – Charles Cao

Swan Chain – Decentralized AI Training and Deployment (Funding Stage: Seed)

Swan Chain is a decentralized compute network, connecting users with idle computing resources for AI tasks like model training. Filecoin serves as its primary storage layer, ensuring secure, transparent, and accessible storage of AI data, aligning with the principles of decentralized AI.

Decentralized Compute Marketplace: Swan Chain aggregates global computing resources, offering a cost-effective alternative to centralized cloud services. Users can bid for computing jobs, and Swan Chain matches them with suitable providers based on requirements.
Filecoin Integration for Secure Data Storage: Swan Chain utilizes Filecoin and IPFS to securely store AI models and outputs, ensuring transparency and accountability in the AI development process.
Support for Diverse AI Workloads: Swan Chain supports various AI tasks, including model training, inference, and rendering, with examples like large language models and image/music generation.

Source: The role of open, verifiable systems in AI with Filecoin and Lilypad – Ally Haire

Lilypad – Distributed Compute for AI (Funding Stage: Seed)

Lilypad aims to build a trustless, distributed compute network that unleashes idle processing power and creates a new marketplace for AI, machine learning, and other large-scale computations. By integrating Filecoin and utilizing IPFS for hot storage, Lilypad ensures secure, transparent, and verifiable data handling throughout the AI workflow, supporting an open and accountable AI development landscape.

Job-Based Compute Matching: Lilypad’s job-based model matches user-defined compute needs (e.g., GPU type, resources) with providers, creating a marketplace for developers to share and monetize AI models within the decentralized AI ecosystem.

Source: bagel.net

Bagel – AI & Cryptography Research Lab (Funding Stage: Pre-Seed)

Bagel is an AI and cryptography research lab creating a decentralized machine learning ecosystem that enables AI developers to train and store models using the computing and storage power of decentralized networks like Filecoin. Its innovative GPU Restaking technology enhances Filecoin’s utility for AI applications by allowing storage providers (SPs) to contribute to both storage and compute networks simultaneously, thereby expanding support for AI developers and generating new revenue opportunities for SPs.

Increased Revenue for Filecoin SPs: Bagel helps storage providers monetize both storage and compute resources, boosting their income and incentivizing greater network participation.
Optimized Compute Utilization: With dynamic routing, Bagel directs GPUs to profitable networks, maximizing efficiency and returns for providers and users.

In Conclusion

The intersection of Filecoin and AI marks a significant step forward in the evolution of technology. By combining verifiable storage with computing networks, we are not only addressing current challenges but also paving the way for future innovations. As these technologies continue to develop, their impact on AI and beyond will be profound, offering new possibilities for businesses and developers alike.

To understand more about Ramo, Swan Chain, Lilypad or Bagel dive into the respective keynotes and links here:

To stay updated on the latest in the Filecoin ecosystem, follow the @Filecointldr handle or join us on Discord.

Many thanks to HQ Han and Jonathan Victor for reviewing and providing valuable insights to this piece.

Layer 2 solutions (L2s) are essential innovations in blockchain technology that enhance the scalability, efficiency, and functionality of their respective networks. For Filecoin, an L1 focused on decentralized storage, L2 solutions play a crucial role in bringing new capabilities to the base infrastructure of the network.

As Filecoin continues to grow, L2 solutions are helping bring Filecoin to market, and creating tailored offerings to builders focused on specific verticals. This post explores the current state of L2 solutions on Filecoin, highlighting the pioneering advancements and future directions.

Underlying Architecture

Before diving into the L2s, it’s useful to understand the shared framework that many of Filecoin’s L2s are built on: Interplanetary Consensus (IPC).

InterPlanetary Consensus (IPC) is a framework designed to solve the problem of scalability in decentralized applications (dApps). IPC achieves this by allowing the creation of subnets, which are independent blockchains that can be customized with specific consensus algorithms tailored to the needs of the application. These subnets can communicate seamlessly with each other, minimizing the need for cross-chain bridges – but anchored into the root Filecoin network.

Builders are drawn to IPC for several reasons. First, IPC subnets inherit the security features of their parent network, ensuring a high level of security for the dApps they host. Second, IPC leverages the Filecoin Virtual Machine (FVM), a versatile execution environment that supports various programming languages, allowing for greater interoperability with other blockchains. Finally, IPC’s tight integration with Filecoin, a large decentralized storage network, offers dApps easy access to robust data storage and retrieval capabilities. This combination of scalability, security, interoperability, and storage integration makes IPC an attractive choice for developers building the next generation of dApps.

Advancements in Data Management with Filecoin’s L2 Solutions

As the demand for efficient data management increases, Filecoin’s Layer 2 solutions are rising to meet these needs. These advancements focus on optimizing data storage and retrieval, offering enhanced scalability and cost-effectiveness across various applications. Basin is one such startup leading the charge.

Source: An introduction to Basin: The first data L2 on Filecoin

Basin, the first data Layer 2 on Filecoin, represents an advancement in decentralized data infrastructure, bringing a swath of new services into the Filecoin ecosystem targeted at data heavy applications:

Key Features and Innovations
- Hot and Cold Data Layers: Basin’s dual-layer approach incorporates a hot cache layer for real-time data access and a cold storage component for long-term archiving. This setup ensures both immediate accessibility and cost-effective storage, catering to diverse data needs.
- Scalable Infrastructure: Basin’s architecture combines Filecoin’s secure storage capabilities with a flexible, scalable design ideal for handling high-volume data from IoT and AI applications.
- Familiar Interfaces: Basin supports compatibility with S3, allowing developers to use familiar tools for managing data, which facilitates a smoother transition to decentralized solutions.

Source: A New Era of Web3 Scaling: The World’s First Data L2 on Filecoin – Marla Natoli

Basin is being actively used in real-world applications, such as handling weather data for decentralized stations with WeatherXM, and generating synthetic data for smart contracts. These use cases highlight Basin’s ability to efficiently store, manage, and monetize diverse data types, advancing practices in AI and machine learning.

Simplifying Decentralized Storage: Innovations and Challenges

Efficiently managing decentralized storage involves overcoming challenges related to user accessibility, cost, and integration. Providing more intuitive and cost-effective tools for data management will help address these challenges – and this is where Akave comes in.

Source: Akave X (X)

Akave is the first L2 storage chain powering on-chain data lakes, offering a novel approach to managing large volumes of data within a decentralized network. Data lakes are used in traditional enterprises to manage all types of data – typically feeding into large scale compute flows (e.g. for big data analytics). By leveraging Filecoin’s infrastructure, Akave aims to become a leading solution in decentralized data management, with a focus on enhancing data handling capabilities and integrating advanced security measures.

Key Features and Innovations
- On-Chain Data Management: Akave focuses on creating on-chain data lakes, which provide a highly scalable and secure solution for managing large volumes of data directly on Filecoin.
- Advanced Data Handling: The platform supports customizable data handling options such as replication policies and erasure coding, enhancing data security and availability.
- Integration with Filecoin: Akave leverages Filecoin’s blockchain for improved data management, security, and decentralization.

Source: Akave: AI Trust Unlocked

Akave’s Decentralized Data Lakes revolutionize data storage with faster local access by placing data close to compute stacks, cutting egress costs compared to centralized clouds, and ensuring immutability and integrity through ZK Proofs. Users benefit from competitive pricing and diverse options via an open marketplace. Continuous visibility into data status and history is provided through Akave’s integration with Filecoin’s InterPlanetary Consensus (IPC), enhancing transparency and trust.

Enhancing AI and Unstructured Data Storage with Filecoin’s L2 Solutions

In the realm of AI and unstructured data, specialized storage solutions are crucial for managing and processing large datasets efficiently. Filecoin’s Layer 2 solutions like Storacha Network are stepping up to provide high-performance storage tailored for these needs.

Source: Storacha Network

Storacha Network is a cutting-edge storage solution designed to enhance the management of AI and unstructured data. Leveraging Filecoin’s robust infrastructure, Storacha Network offers high-performance decentralized storage tailored for advanced applications. Looking ahead, Sriracha envisions evolving into a federated network with increased public participation, aiming to enhance global data access through a decentralized CDN and fostering broad community involvement.

Key Features and Innovations
- High-Performance Storage: Storacha offers decentralized hot object storage, ensuring rapid access and retrieval of data, essential for AI applications that require quick processing and scaling.
- Provenance and Ownership: Users maintain control over their data through UCANs (User-Controlled Authorization Networks), providing secure, cryptographic proofs of data ownership and access rights without frequent blockchain interactions.
- Efficient Data Handling: Storacha handles large datasets by sharding files, facilitating quick retrieval and efficient management, crucial for large-scale AI operations.

Source: Storacha Network Revealed: Bringing the Heat to AI Data Storage – Alexander Kinstler

Storacha Network supports a range of AI use cases by providing fast, scalable object storage optimized for both structured and unstructured data. It addresses key needs such as verifiability and provenance for decentralized GPU networks, ensuring that training processes are executed as expected and checkpoints are maintained.

Additionally, Storacha allows users to bring their own storage to training jobs, facilitating the sharing of hyperparameters and weights while ensuring ownership of training results.

In Conclusion

To wrap up, Filecoin’s Layer 2 solutions are paving the way for a new era in decentralized data management. Innovations like Basin, Akave, and Storacha Network are not only addressing the challenges of scalability and cost but also enhancing the efficiency and performance of data handling. As these technologies evolve, they promise to transform how data is stored, managed, and utilized, marking significant progress in the Web3 ecosystem.

To understand more about Basin, Akave or Storacha Network, watch their keynotes at the recent FIL Dev Summit.

Many thanks to Jonathan Victor for reviewing and providing valuable insights to this piece.

At the latest Filecoin Developer Summit (FDS), Nicola Greco (of FilOz) introduced a vision to evolve Filecoin’s decentralized cloud services: Filecoin Web Services (FWS). FWS aims to provide a framework for deploying composable cloud services – allowing new protocols to bootstrap into a shared marketplace of offerings, all composable with each other.

Source: Filecoin Web Services: Ecosystem of verifiable cloud services – Nicola G

Expanding Beyond Proof of Replication: Expanding the functionality of Filecoin

To understand FWS, it’s useful to first recap how the existing service offerings exist inside the Filecoin network.

The core storage offering, Proof of Replication (PoRep), allows storage providers to use proofs over uniquely encoded data to show that they are still in possession of specific pieces of data. Filecoin uses PoRep both for storage and for consensus – this requires higher security parameters, and therefore makes Filecoin’s base storage offering akin to cold storage. This makes Filecoin’s base offering ideal for datasets that might have a need for strong guarantees around uniqueness and existence but can accept slower access times.

Furthermore, because Filecoin launched prior to the FVM, much of the onchain tooling (e.g. to set up and maintain storage deals, to enable payments) exist as “system actors” or non-programmable functions on the network. This means many of those functions were built to support the original storage functions on the network – but for any evolution would require a full network upgrade in order to modify or support new functionality.

However, as more storage on-ramp’s pushed into building storage solutions for customers, it became clear that there was a need for more storage offerings over and above the base offering from Filecoin. As a result, new types of proofs (such as proof of data possession) have been proposed to run on the Filecoin network – allowing for more use cases to be natively supported.

In designing these new offerings to sit on top of Filecoin, it became clear that many of these new proof offerings would need generalized versions of the onchain tooling that exists as “system actors” – such as payment rails. Rather than having many systems independently evolve their own architecture (and risk losing composability), a new proposal was put forward to focus on building modular, composable systems via FWS.

Source: Filecoin Web Services: Ecosystem of verifiable cloud services – Nicola G

Introducing Filecoin Web Services: A Modular Approach to Cloud Services

At the core of Greco’s vision is the concepts of modularity and reuse. If each new service were to build their entire protocol from scratch, they’d need to develop work ranging from deal management to escrow and SLA enforcement – which would lead to a high barrier to entry for new services. FWS proposes a unified protocol that standardizes these components, allowing developers to focus on building specific services rather than recreating the entire stack.

FWS would serve as a thin, opinionated layer that manages payments, collateral, deal structuring, and SLA enforcement across various services. This standardization would enable seamless integration of new services, whether they are storage-related like PDP and retrieval services or entirely new offerings like markets for zk-SNARK proofs or AI-based computations. By providing a common framework, FWS would reduce complexity, lower development costs, and increase the rate of development for building within the Filecoin ecosystem.

Source: Filecoin Web Services: Ecosystem of verifiable cloud services – Nicola G

The Power of a Unified Marketplace: Enhancing Efficiency and Accessibility

One of the key benefits of FWS is its potential to streamline the user experience. Without FWS, users would need to lock tokens in multiple smart contracts to access different services, leading to inefficiencies in collateral management and prepayment and increase users’ cost. FWS envisions a single entry point where users can determine how they’d like to pay – prepaid or pay-as-you-go – with the same rails being usable by multiple services. This model mirrors the convenience of traditional cloud services, where users simply provide a payment method and are periodically billed.

Moreover, by consolidating financial management into a single contract, FWS would improve collateral efficiency and reduce the overhead associated with managing multiple service contracts. This would also allow utilization of one service to enable credit in other services – allowing a credit history to be built up across disparate protocols. This approach not only simplifies the user experience but also enhances the overall liquidity and flexibility of the Filecoin ecosystem.

Source: Filecoin Web Services: Ecosystem of verifiable cloud services – Nicola G

A Vision for the Future: FWS as a Distribution Layer for Decentralized Services

Looking ahead, Greco envisions FWS not just as a tool for enhancing Filecoin’s storage capabilities but as a broader distribution layer for decentralized services. As the ecosystem grows, FWS could facilitate the integration of multiple networks and protocols, creating a cohesive marketplace for storage, compute, bandwidth, and other services. This would position Filecoin at the center of a vibrant, interconnected ecosystem, driving innovation and adoption across the decentralized web. By offering a marketplace for diverse services such as zero-knowledge proof generation, decentralized compute, FWS could position Filecoin as a leading platform in the decentralized web, supporting a wide array of applications beyond storage.

To understand more about FWS, watch the full keynote by Nicola Greco on Youtube

Many thanks to Jonathan Victor for reviewing and providing valuable insights to this piece.

Editor’s Note: This blogpost is a repost of the original content published on 5 March 2024, by Bidhan Roy and Marcos Villagra from Bagel. Founded in 2023 by CEO Bidhan Roy, Bagel is a machine learning and cryptography research lab building a permissionless, privacy-preserving machine learning ecosystem. This blogpost represents the independent view of these authors, whom have given their permission for this re-publication.

Trillion-dollar industries are unable to leverage their immensely valuable data for AI training and inference due to privacy concerns. The potential for AI-driven breakthroughs—genomic secrets that could cure diseases, predictive insights to eliminate supply chain waste, and chevrons of untapped energy sources—remain locked away. Privacy regulations also closely guard this valuable and sensitive information.

To propel human civilization forward in energy, healthcare, and collaboration, it is crucial to enable AI systems that train and generate inference on data while maintaining full end-to-end privacy. At Bagel, pioneering this capability is our mission. We believe accessing a fundamental resource like knowledge, for both human-driven and autonomous AI, should not entail a compromise on privacy.

We have applied and experimented with almost all the major privacy-preserving machine learning (PPML) mechanisms. Below, we share our insights, our approach, and some research breakthroughs.

And if you’re in a rush, we have a TLDR at the end.

Privacy-preserving Machine Learning (PPML)

Recent advances in academia and industry have focused on incorporating privacy mechanisms into machine learning models, highlighting a significant move towards privacy-preserving machine learning (PPML). At Bagel, we have experimented with all the major PPML techniques, particularly those post differential privacy. Our work, positioned at the intersection of AI and cryptography, draws from the cutting edge in both domains.

Our research covered a wide range of PPML techniques suitable for our platform. Among those, Differential Privacy (DP), Federated Learning, Zero-knowledge Machine Learning (ZKML) and Fully Homomorphic Encryption Machine Learning (FHEML) stood out for their potential in PPML.

First, we will delve into each of these, examining their advantages and drawbacks. In subsequent posts, we will describe Bagel’s approach to data privacy, which addresses and resolves the challenges associated with the existing solutions.

Differential Privacy (DP)

One of the first and most important techniques with a mathematical guarantee for incorporating privacy into data is differential privacy or DP (Dwork et al. 2006), addressing the challenges faced by earlier methods with a quantifiable privacy definition.

DP ensures that a randomized algorithm, A, maintains privacy across datasets D1 and D2—which differ by a single record—by keeping the probability of A(D1) and A(D2) generating identical outcomes relatively unchanged. This principle implies that minor dataset modifications do not significantly alter outcome probabilities, marking a pivotal advancement in data privacy.

The application of DP in machine learning, particularly in neural network training and inference, demonstrates its versatility and effectiveness. Notable implementations include adapting DP for supervised learning algorithms by integrating random noise at various phases: directly onto the data, within the training process, or during inference, as highlighted by Ponomareva et al. (2023) and further references.

The balance between privacy and accuracy in DP is influenced by the noise level: greater noise enhances privacy at the cost of accuracy, affecting both inference and training stages. This relationship was explored by Abadi et al. in (2016) through the introduction of Gaussian noise to the stochastic gradient descent (DP-SGD) algorithm, observing the noise’s impact on accuracy across the MNIST and CIFAR-10 datasets.

An innovative DP application, Private Aggregation of Teacher Ensembles (PATE) by Papernot et al. in (2016), divides a dataset into disjoint subsets, training networks on each without privacy, termed as teachers. These networks’ aggregated inferences, subjected to added noise for privacy, inform the training of a student model to emulate the teacher ensemble. This method also underscores the trade-off between privacy enhancement through noise addition and the resultant accuracy reduction.

Further studies affirm that while privacy can be secured with little impact on execution times (Li et a. 2015), stringent privacy measures can obscure discernible patterns essential for learning (Abadi et al. 2016). Consequently, a certain level of privacy must be relinquished in DP to facilitate effective machine learning model training, illustrating the nuanced balance between privacy preservation and learning efficiency.

Pros of Differential Privacy

The advantages of using DP are:

Effortless. Easy to implement into algorithms and code.

Algorithm independence. Schemes can be made independent of the training or inference algorithm.

Fast. Some DP mechanisms have shown to have little impact on the execution times of algorithms.

Tunable privacy. The degree of desired privacy can be chosen by the algorithm designer.

Cons of Differential Privacy

Access to private data is still necessary. Teachers in the PATE scheme must have full access to the private data (Papernot et al. 2016) in order to train a neural network. Also, the stochastic gradient descent algorithm based on DP only adds noise to the weight updates and needs access to private data for training (Abadi et al. 2016).

Privacy-Accuracy-Speed trade-off on data. All implementations must sacrifice some privacy in order to get good results. If there is no discernable pattern in the input, then there is nothing to train (Feyisetan et al. 2020). The implementation of some noise mechanisms can impact execution times, necessitating a balance between speed and the goals of privacy and accuracy.

Zero-Knowledge Machine Learning (ZKML)

A zero-knowledge proof system (ZKP) is a method allowing a prover P to convince a verifier V about the truth of a statement without disclosing any information apart from the statement’s veracity. To affirm the statement’s truth, P produces a proof π for V to review, enabling V to be convinced of the statement’s truthfulness.

Zero-Knowledge Machine Learning (ZKML) is an approach that combines the principles of zero-knowledge proofs (ZKPs) with machine learning. This integration allows machine learning models to be trained and to infer with verifiability.

For an in-depth examination of ZKML, refer to the work by Xin et al. in (2023). Below we provide a brief explanation that focuses on the utilization of ZKPs for neural network training and inference.

ZKML Inference

Consider an unlabeled dataset A and a pretrained neural network N tasked with labeling each record in A. To generate a ZK proof of N‘s computation during labeling, an arithmetic circuit C representing N is required, including circuits for each neuron’s activation function. Assuming such a circuit C exists and is publicly accessible, the network’s weights and a dataset record become the private and public inputs, respectively. For any record a of A, N‘s output is denoted by a pair (l,π), where l is the label and π is a zero-knowledge argument asserting the existence of specific weights that facilitated the labeling.

This model illustrates how ZK proves the accurate execution of a neural network on data, concealing the network’s weights within a ZK proof. Consequently, any verifier can be assured that the executing agent possesses the necessary weights.

ZKML Training

ZKPs are applicable during training to validate N‘s correct execution on a labeled dataset A. Here, A serves as the public input, with an arithmetic circuit C depicting the neural network N. The training process requires an additional arithmetic circuit to implement the optimization function, minimizing the loss function. For each training epoch i, a proof π_i is generated, confirming the algorithm’s accurate execution through epochs 1 to i-1, including the validity of the preceding epoch’s proof. The training culminates with a compressed proof π, proving the correct training over dataset A.

The explanation above illustrates that during training, the network’s weights are concealed to ensure that the training is correctly executed on the given dataset A. Additionally, all internal states of the network remain undisclosed throughout the training process.

Pros of ZKML

The advantages of using ZKPs with neural networks are:

Privacy of model weights. The weights of the neural network are never revealed during training or inference in any way. The weights and the internal states of the network algorithm are private inputs for the ZKP.

Verifiability. The proof certifies the proper execution of training or inference processes and guarantees the accurate computation of weights.

Trustlessness. The proof and its verification properties ensure that the data owner is not required to place trust in the agent operating the neural network. Instead, the data owner can rely on the proof to confirm the accuracy of both the computation and the existence of correct weights.

Cons of ZKML

The disadvantages of using ZKPs with neural networks are:

No data privacy. The agent running the neural network needs access to the data in order to train or do inference. Data is considered a parameter that is publicly known to the data owner and the prover running the neural network (Xing et al. 2023).

No privacy for the model’s algorithm. In order to create a ZK proof, the algorithm of the entire neural network should be publicly known. This includes the activation functions, the loss function, optimization algorithm used, etc (Xing et al. 2023).

Proof generation of an expensive computation. Presently, the process of generating a ZK proof is computationally demanding—-see for example this report on the computation times of ZK provers. Creating a proof for each epoch within a training algorithm can exacerbate the computational burden of an already resource-intensive task.

Federated Learning (FL)

In Federated Learning or FL we look to train a global model using a dataset that is distributed in multiple servers with local data samples but without each server sharing their local data.

In FL there is a global objective function that is being optimized which is defined as

𝑓(𝑥1,…,𝑥𝑛)=1𝑛∑𝑖=1𝑛𝑓𝑖(𝑥𝑖),$f(x_1,\dots,x_n)=\frac 1 n \sum_{i=1}^n f_i(x_i),$

where n is the number of servers, each variables is the set of parameter as viewed by the server i, and each function is a local objective function of server i. FL tries to find the best set of values that optimizes f.

The figure below shows the general process in FL.

Initialization. An initial global model is created and distributed by a central server to all other servers.
Local training. Each server trains the model using their local data. This ensures data privacy and security.
Model update. After training, each server shares with the central server their local updates like gradients and parameters.
Aggregation. The central server receives all local updates and aggregates them into the global model, for example, using averaging.
Model distribution. The updated model is distributed again with local servers and the previous steps are repeated until a desired level of performance is achieve by the global model.

Since local servers never share their local data, FL guarantees privacy over that data. However, the model being constructed is shared among all parties, and hence, its structure and set of parameters are not hidden.

Pros of FL

The advantages of using FL are:

Data privacy. The local data on the local servers are never shared. All computations are done locally, and there is no need of communication between them.

Distributed computing. The creation of the global model is distributed among local servers, thereby parallelizing a resource-intensive computation. Thus, FL is considered a distributed machine learning framework (Xu et al. 2021).

Cons of FL

The disadvantages of using FL are:

Model is not private. The global model is shared among each local server in order to do their computations locally. This includes the aggregated weights and gradients at each step of the FL process. Thus, each local server is aware of the entire architecture of the global model (Konečný et al. 2016).

Data leakage. Recent research indicates that data leakage remains a persistent issue, notably through mechanisms such as gradient sharing—see for example Jin et al. (2022). Consequently, FL cannot provide complete assurances of data privacy.

Trust. Since no proofs are generated in FL, every party involved in the process need to be trusted that their computation and parameters were computed as expected (Gao et al. 2023).

Fully Homomorphic Encryption (FHE)

At its core, homomorphic encryption permits computations on encrypted data. By “homomorphic,” we refer to the capacity of an encryption scheme to allow specific operations on ciphertexts that, when decrypted, yield the same result as operations performed directly on the plaintexts.

Consider a scenario with a secret key k and a plaintext m. In an encryption scheme (E,D), where E and D represent encryption and decryption algorithms respectively, the condition D(k,E(k,m))=m must hold. A scheme (E,D) is deemed fully homomorphic if for any key k and messages m, the properties E(k,m+m’)=E(k,m)+E(k,m’) and E(k,m*m’)=E(k,m)* E(k,m’) are satisfied, with addition and multiplication defined over a finite field. If only one operation is supported, the scheme is partially homomorphic. This definition implies that operations on encrypted data mirror those on plaintext, crucial for maintaining data privacy during processing.

In plain words, if we have a fully homomorphic encryption scheme, then operating over the encrypted data is equivalent to operating over the plaintext. We will write FHE to refer to a fully homomorphic encryption scheme. The figure below shows how an arbitrary homomorphic operation works over a plaintext and ciphertext.

The homomorphic property of FHE makes it invaluable in situations where data must remain secure while still being used for computations. For instance, if we possess sensitive data and require a third party to perform data analysis on it, we can rely on FHE to encrypt the data. This allows the third party to conduct analysis on the encrypted data without the need for decryption. The mathematical properties of FHE guarantee the accuracy of the analysis results.

FHE Inference

Fully Homomorphic Encryption (FHE) can be used to perform inference in neural networks while preserving data privacy. Let’s consider a scenario where N is a pretrained neural network, A is a dataset, and (E,D) is an asymmetric FHE scheme. The goal is to perform inference on a record a of A without revealing the sensitive information contained in a to the neural network.

The inference process using FHE begins with encryption. The data owner encrypts the record a using the encryption algorithm E with the public key public_key, obtaining the encrypted record a’ = E(public_key, a).

Next, the data owner sends the encrypted record a’ along with public_key to the neural network N. The neural network N must have knowledge of the encryption scheme (E,D) and its parameters to correctly apply homomorphic operations over the encrypted data a’. Any arithmetic operation performed by N can be safely applied to a’ due to the homomorphic properties of the encryption scheme.

One challenge in using FHE for neural network inference is handling non-linear activation functions, such as sigmoid and ReLU, which involve non-arithmetic computations. To compute these functions homomorphically, they need to be approximated by low-degree polynomials. The approximations allow the activation functions to be computed using homomorphic operations on the encrypted data a’.

After applying the necessary homomorphic operations and approximated activation functions, the neural network N obtains the inference result. It’s important to note that the inference result is still in encrypted form, as all computations were performed on encrypted data.

Finally, the encrypted inference result is sent back to the data owner, who uses the private key associated with the FHE scheme to decrypt the result using the decryption algorithm D. The decrypted inference result is obtained, which can be interpreted and utilized by the data owner.

By following this inference process, the neural network N can perform computations on the encrypted data a’ without having access to the original sensitive information. The FHE scheme ensures that the data remains encrypted throughout the inference process, and only the data owner with the private key can decrypt the final result.

It’s important to note that the neural network N must be designed and trained to work with the specific FHE scheme and its parameters. Additionally, the approximation of non-linear activation functions by low-degree polynomials may introduce some level of approximation error, which should be considered and evaluated based on the specific application and accuracy requirements.

FHE Training

The process of training a neural network using Fully Homomorphic Encryption (FHE) is conceptually similar to performing inference, but with a few key differences. Let’s dive into the details.

Imagine we have an untrained neural network N and an encrypted dataset A’ = E(public_key, A), where E is the encryption function and public_key is the public key of an asymmetric FHE scheme. Our goal is to train N on the encrypted data A’ while preserving the privacy of the original dataset A.

The training process unfolds as follows. Each operation performed by the network and the training algorithm is executed on each encrypted record a’ of A'. This includes both the forward and backward passes of the network. As with inference, any non-arithmetic operations like activation functions need to be approximated using low-degree polynomials to be compatible with the homomorphic properties of FHE.

A fascinating aspect of this approach is that the weights obtained during training are themselves encrypted. They can only be decrypted using the private key of the FHE scheme, which is held exclusively by the data owner. This means that even the agent executing the neural network training never has access to the actual weight values, only their encrypted counterparts.

Think about the implications of this. The data owner can outsource the computational heavy lifting of training to a third party, like a cloud provider with powerful GPUs, without ever revealing their sensitive data. The training process operates on encrypted data and produces encrypted weights, ensuring end-to-end privacy.

Once training is complete, the neural network sends the collection of encrypted weights w’ back to the data owner. The data owner can then decrypt the weights using his private key, obtaining the final trained model. He is the sole party capable of accessing the unencrypted weights and using the model for inference on plaintext data.

There are a few caveats to keep in mind. FHE operations are computationally expensive, so training a neural network with FHE will generally be slower than training on unencrypted data.

Pros of FHE

The advantages of using FHE are:

Data privacy. Third-party access to encrypted private data is effectively prevented, a security guarantee upheld by the assurances of FHE and lattice-based cryptography (Gentry 2009).

Model privacy. Training and inference processes are carried out on encrypted data, eliminating the need to share or publicize the neural network’s parameters for accurate data analysis.

Effectiveness. Previous studies have demonstrated that neural networks operating on encrypted data using FHE maintain their accuracy—see for example Nandakumar et al. (2019) and Xu et al. (2019). Therefore, we can be assured that employing FHE for training and inference processes will achieve the anticipated outcomes.

Quantum resistance. The security of FHE, unlike other encryption schemes, is grounded in difficult problems derived from Lattice theory. These problems are considered to be hard even for quantum computers (Regev 2005), thus offering enhanced protection against potential quantum threats in the future.

Cons of FHE

The disadvantages of using FHE are:

Verifiability. FHE does not offer proofs of correct encryption nor correct computation. Hence, we must rely on trust that the data intended for encryption is indeed the correct data (Viand et al. 2023).

Speed. Relative to conventional encryption schemes, FHE is still considered to be slow during parameter setups, encryption and decryption algorithms (Gorantala et al. 2023).

Memory requirements. The number of weights that need to be encrypted are proportional to the size of the network. Even for small networks, the RAM memory requirements are in the order of gigabytes (Chen et al. 2018), (Nandakumar et al. 2019).

Usability. FHE schemes use many parameters that need to be carefully tuned and requires extensive experience from users (Al Badawi et al. 2022), (Halevi & Shoup 2020).

TLDR

We examined the four most widely used privacy-preserving techniques in machine learning, focusing on neural network training and inference. We evaluated these techniques across four dimensions: data privacy, model algorithm privacy, model weights privacy, and verifiability.

Data privacy considers the model owner’s access to private data. Differential privacy (DP) and zero-knowledge machine learning (ZKML) require access to private data for training and proof generation, respectively. Federated learning (FL) enables training and inference without revealing data, while fully homomorphic encryption (FHE) allows computations on encrypted data.

Model algorithm privacy refers to the data owner’s access to the model’s algorithms. DP does not require algorithm disclosure, while ZKML necessitates it for proof generation. FL distributes algorithms among local servers, and FHE operates without accessing the model’s algorithms.

Model weights privacy concerns the data owner’s access to the model’s weights. DP and ZKML keep weights undisclosed or provide proofs of existence without revealing values. FL involves exchanging weights among servers for decentralized learning, contrasting with DP and ZKML’s privacy-preserving mechanisms. FHE enables training and inference on encrypted data, eliminating the need for model owners to know the weights.

Verifiability refers to the inherent capabilities for verifiable computation. ZKML inherently provides this capability. DP, FL, and FHE would not provide similar levels of integrity assurance.

The table below summarizes our findings:

What’s Next 🥯

At Bagel, we recognize that existing privacy-preserving machine learning solutions fall short in providing end-to-end privacy, scalability, and strong trust assumptions. To address these limitations, our team has developed a novel approach based on a modified version of homomorphic encryption (FHE).

Our pilot results are extremely promising, indicating that our solution has the potential to revolutionize the field of privacy-preserving machine learning. By leveraging the strengths of homomorphic encryption and optimizing its performance, we aim to deliver a scalable, trustworthy, and truly private machine learning framework.

We believe that our work represents a paradigm shift in the way machine learning is conducted, ensuring that the benefits of AI can be harnessed without compromising user privacy or data security. As we continue to share more about our approach, we invite you to follow our progress by subscribing to the Bagel blog.

For more thought pieces from Bagel, follow out their blog here.

To stay updated on the latest Filecoin happenings, follow the @Filecointldr handle.

The Decentralized Storage space is rapidly evolving. Filecoin is at an important moment – and in this blog we propose both areas for the ecosystem to double down on and ways we can track that progress. It is by no means exhaustive, but written from the vantage point of having been embedded in the Filecoin ecosystem for many years, gathering feedback from users, builders and the community, and having thought deeply about what is needed as the network moves forward.

The blog is organized in the following sections:

What matters for Filecoin in 2024
Why these matter and how to measure progress

It is our hope that with the right north star, teams will be able to better coordinate and identify convergences between project-level interests & ecosystem interests. The proposed framework and metrics should make it easier for capital and resource allocators in the ecosystem to evaluate the level of impact each team is creating, and distribute capital and resources accordingly. For startups, this can help frame where broader ecosystem efforts may dovetail into your roadmap and releases.

WHAT MATTERS IN 2024

Accelerating conversions to paid deals: Helping Filecoin providers increase their paid services (storage, retrieval, compute) is critical for driving cashflows into Filecoin and to support sustainable funding of its hardware outside of token incentives.
Growing on-chain activity: Filecoin is not aiming to be just another L1 fighting over the same use cases. But it does have a unique value proposition as a base layer with “real world” services anchored into it. This enables new use cases (programmable services, DeFi around cash flows, etc.) that are unique to Filecoin. Building out and growing adoption of these services can help prove that Filecoin is not just “a storage layer”, but an economy with a stable set of cash flows.
Making Filecoin indispensable to others: Bull cycles mean velocity is critical – as is making Filecoin an indispensable part of the stack for more teams. There are many emerging themes to capitalize on (Chain Archival, Compute, AI) – and Filecoin positioning itself matters. The ecosystem collectively wins when more participants leverage Filecoin as a core part of their story. For individual teams, this means that shipping to your users matters. At the ecosystem level, it means orienting efforts to unblock the teams closest to driving integrations and building services on Filecoin.

The verticals in our framework remain relatively high-level – and many of these objectives will have their own set of tasks. But it is more critical first, for the ecosystem to be aligned that this is the right set of verticals to progress against. We dive into each vertical and some tangible metrics that the ecosystem should start tracking against.

WHY THESE MATTER AND HOW TO MEASURE PROGRESS

1) Accelerating conversion to paid deals

As a storage network – Filecoin should maximize the cashflows it can bring into its economy. Having incentives as an accelerant is fine – but without having a steady (and growing ramp) of paid deals Filecoin can’t achieve its maximum potential.

Paid deals (when settled on-chain) are a net capital inflow into the Filecoin economy that can be the substrate for use cases uniquely possible in our ecosystem. DeFi as an example has a real opportunity to provide actual services to businesses (e.g. converting currencies to pay for storage).

There are two main paths that we can drive growth of paid services:

Drive growth in existing services (data archival)
Expand to new markets with additional services (hot storage, compute, indexing, etc.)

In both cases, there’s work to be done to reduce friction for paid on-ramps or ship new features that raise the floor (as informed by on-ramps and projects trying to bring Filecoin services to market). It is critical that the Filecoin ecosystem collectively prioritizes the right efforts to make Filecoin services sellable, and allocate resources accordingly.

There are already a number of teams making substantial progress on this front (CID.Gravity, Seal Storage, Holon, Banyan, Lighthouse.storage, Web3Mine, Basin, among others) – and we can best measure progress by helping reduce their friction and helping drive their success.

We propose measuring success for this vertical in two forms:

Dollars and Total Data Stored for Paid Deals (self reported)
Dollars and Total Data Stored for Paid Deals (on-chain)

There are a number of initiatives from public goods teams along these efforts for the quarter (Q2 2024) which include:

FilOz: is working on a FIP for new proofs to reduce storage costs and dramatically improve retrieval speeds
DeStor: is helping drive enterprise adoption for business ready on-ramps
Ansa Research, Filecoin Foundation, etc.: Web3 BD support for ecosystem builders
Targeted grant funding for efforts that directly support growth of sustainable on-chain paid deal activity

2) Growing on-chain activity

Filecoin, as an L1, has more than just its storage service. Building a robust on-chain economy is critical for accelerating the services and tooling with which others can compose. In the Filecoin ecosystem, we have a unique opportunity in that there are real economic flows to enable via paid on-chain deals.

Centering our on-chain economy around supporting those flows – be it from automating renewals, designing incentives for retrievals, creating endowments for perpetual storage, or building economic efficiency for the operators of the network – can lead to compounding growth as it creates a flywheel.

As Filecoin owns more of its own economic activity on-chain, value will accrue for the token – enabling ecosystem users to use Filecoin in more productive ways, generating real demand for services inside the ecosystem.

We propose the following metrics for us to collectively measure success:

Contract calls
Active Filecoin addresses
Volume of on-chain payments

There are notable builders already seeding the on-chain infrastructure to leverage some of these primitives (teams like GLIF working on liquid staking, Lighthouse on storage endowments, and teams like Fluence enabling compute).

There’s a set of improvements that can dramatically reduce friction for driving on-chain activity, and there several efforts prioritized against this for Q2 2024:

FilOz: F3 to bring fast finality to Filecoin can both improve the bridging experience, and enable more “trade” between Filecoin and other economies (e.g. native payments from other ecosystems for services in Filecoin).
FilOz: Refactoring how deals work on Filecoin to enable more flexible payment (e.g. with stablecoins)
FilPonto, FilOz: Reducing EVM tech debt to substantially reduce friction for builders porting Solidity contracts onto Filecoin (and hardening the surrounding infrastructure for more stable services)

3) Making Filecoin indispensable to others

This vertical is broad, but we would argue that there are two key ways to be consider the impact that the Filecoin ecosystem is driving:

The first is along high profile integrations, where Filecoin is critical to the success of the customer and its proposition. It is especially critical for the ecosystem to provide the necessary support for these cross-chain integrations.

The second is along specific verticals, where there is a large and growing trend in activity; Filecoin is uniquely positioned to provide value here, both in terms of the primitives it has, as well as in its cost profile and scale
1. Opportunities are brimming in Web3 at the moment, and the ecosystem should rally workstreams around on-ramps that are making Filecoin integral to narratives such as Compute, DePIN (sensors), Social, Gaming, AI, and Chain Archival.

We propose that the metrics to evaluate for Filecoin indispensability as:

Number of partnerships and integrations

There are a number of efforts from ecosystem teams aimed at helping onramps succeed on this front in the quarter (Q2 2024):

Ansa Research, Filecoin Foundation, DeStor and others: Forming a new working group to accelerate shared ecosystem BD and marketing resources
- Shared BD resources for builders in the Filecoin ecosystem
- Shared Marketing resources and amplification (#ecosystem-amplification-requests in the Filecoin slack) to help signal boost ecosystem wins
- Community Discord to help expand accessibility, visibility, and drive community engagement

FINAL THOUGHTS

After reading the above, we hope that the direction of Filecoin in the coming year is clearer. Filecoin is at a pivotal moment where many of its pieces are coming together. Protocols and ecosystems naturally evolve and each stage calls for different priorities and strategies for the next leg of growth. By focusing efforts in the ecosystem, we believe that the Filecoin ecosystem can make its resources and support go that much further.

We are excited for what is to come and how Filecoin can continue to expand the pie for what can be done on Web3 rails. Moving forward, Ansa Research will post periodic updates on the key metrics for Filecoin’s ecosystem progress.

To stay updated on the latest Filecoin happenings, follow the @Filecointldr handle.

Follow us on

Introduction

Understanding Value Accrual Models

Bitcoin’s Value Accrual Model:

Ethereum and Solana: Value Through Utility

Filecoin’s Chosen Path: External Services

Visualizing Value Flow in the Filecoin Economy

Filecoin’s Strengths, Challenges, and Opportunities

What Sets Filecoin Apart

Key Challenges on the Road to Value Accrual

Opportunities That Could Drive the Next Growth Cycle

The Three-Step Pitch for Maximizing Value

1. Anchor Non-Speculative Value On-Chain

2. Develop Derivative Businesses

3. Expand the Total Addressable Market (TAM)

Conclusion

Introduction

Identifying and Targeting Ideal Customer Profiles (ICPs)

Mapping Filecoin’s Strengths to Its Most Promising Use Cases

Scaling Client Success & Ideal Customer Profiles (ICPs)

Conclusion

Executive Summary

Adoption & Client Demand

Utilization and Go-to-Market Momentum

Storage Providers Evolving Beyond Capacity

Path To Scale

1. Expanding Core Protocol Services

Vision for FWS Architecture

2. New On-Ramps and L2s

3. Scaling DeFi

Cryptoecon Update

Supply-Side Pressures Easing

Demand-Side Sinks Growing

Filecoin Data (where to get them to DYOR)

In Conclusion

Introduction

Retrievability on Filecoin

Key Metrics

Payment Models

Retrieval Services: SP Selection & Client Selection Strategies

Deal-Making Process

SP Selection Mechanisms (POV of a Client)

Client Selection Mechanisms (POV of a Storage Provider)

Next Steps

2024 Retrospective

1. Accelerating Paid Deals

2. Growing On-Chain Activity

3. Becoming Indispensable

2024 Filecoin Challenges

Product Market Fit:

Building a Sustainable Economic Model + Stronger Economic Loops:

Filecoin’s 2025 Outlook

1. Accelerating Filecoin by 450x with Fast Finality (F3)

2. Moving Beyond Storage with FWS

3. Unlocking new value streams in Filecoin

Final Thoughts

The Pillars of Decentralized AI

A closer look: Decentralized AI Projects in the Filecoin Ecosystem

Ramo – Simplifying Decentralized Network Participation (Funding Stage: Seed)

Swan Chain – Decentralized AI Training and Deployment (Funding Stage: Seed)

Lilypad – Distributed Compute for AI (Funding Stage: Seed)

Bagel – AI & Cryptography Research Lab (Funding Stage: Pre-Seed)

In Conclusion

Underlying Architecture

Advancements in Data Management with Filecoin’s L2 Solutions

Simplifying Decentralized Storage: Innovations and Challenges

Enhancing AI and Unstructured Data Storage with Filecoin’s L2 Solutions

In Conclusion

Expanding Beyond Proof of Replication: Expanding the functionality of Filecoin

Introducing Filecoin Web Services: A Modular Approach to Cloud Services

The Power of a Unified Marketplace: Enhancing Efficiency and Accessibility

A Vision for the Future: FWS as a Distribution Layer for Decentralized Services

Privacy-preserving Machine Learning (PPML)

Differential Privacy (DP)

Pros of Differential Privacy

Cons of Differential Privacy

Zero-Knowledge Machine Learning (ZKML)

ZKML Inference

ZKML Training

Pros of ZKML