The Filecoin network launched in October 2020, introducing an incentive layer to the IPFS protocol and enabling open services for data. Today, the network primarily focuses on storage as an open service — however, an earlier blog post summarizing the direction of the network presents the larger vision for Filecoin to also “include the infrastructure to store, distribute, and transform data.”
The Filecoin “master plan” to execute this vision begins with accumulating a critical mass of hardware resources (storage capacity and compute power). This is important, as the only way web3 infrastructure will credibly subsume (not just compete with) the traditional cloud, is if it can operate at orders of magnitude and scale that exceed current offerings. While no web3 protocol has achieved this yet, Filecoin has made the most meaningful progress.
Furthermore, to drive long-term demand for network resources (capacity, retrieval, and compute power), it’s critical to bootstrap the network with useful data and to develop the software and tooling to enable compute and composable services on top of data. Ultimately, the demand for these services will be the basis for a robust economy on top of the Filecoin blockchain.
Filecoin’s tokenomics were designed with this long-term vision in mind and have helped incentivize the network’s rapid growth and development; to date, the network contains >16 EiB of committed storage capacity, with an explosion of ecosystem developments driving demand for network resources. This growth and continued success is due to the efforts of the Filecoin community, an interconnected network of Storage Clients, Developers, Storage Providers (SP’s), Ecosystem Partners, and Token Holders.
Filecoin the Token
This global digital economy also requires a singular valid currency for transactions. Since Filecoin is a permissionless market with cryptographically verifiable goods, design constraints could only be satisfied with a native utility token, the filecoin (see Engineering the Filecoin Economy for further color). The FIL token serves a number of functions:
Pays for messages on-chain.
Used as collateral creating economic incentives for reliable data storage over time and to secure the blockchain (and subnets or “shards” once Interplanetary Consensus goes live!).
Burned to regulate shared resources (blockspace).
Unlike other storage networks, the Filecoin token is primarily concerned with incentivizing reliable services and facilitating the on-chain economy. The storage market exists off-chain, but is anchored on-chain by messages containing cryptographic proofs of storage. Crucially, this means that the token can accrue value without creating undue pressure on users of the network’s various services (storage, retrieval, compute, etc), while Token Holders can still benefit from FIL consumption due to increased utilization (i.e. demand for blockspace).
Cryptoeconomics for Token Holders
Filecoin’s Token Allocation
Filecoin’s cryptoeconomic constructions help ensure that value accrual for participants aligns with the long-term utility of the protocol. As such, the initial allocation of Filecoin at network launch was designed to support a protocol that incentivizes sustainable value creation (see below).
Source: Filecoin Spec. For illustrative purposes only.
As of current protocol specifications, a maximum of 2 Billion FIL will ever be created; of this, 70% was allocated for storage and related services (i.e. minting tokens to reward Storage Providers), and 20% (vesting over 6 years, starting October 2020) was allocated to Protocol Labs and the Filecoin Foundation to support network development, adoption, and ecosystem growth. The remainder was FIL allocated to SAFT investors, which began vesting in October 2020 over time periods ranging between 6 months and 3 years.
2 Billion FIL is the maximum amount that could theoretically be minted and vested. However, it may not be the amount of FIL that enters the network’s circulating supply:
The 300 Million FIL held as a Mining Reserve would require a protocol upgrade to be tapped — meaning the community determines how much (if any) of this should be released.
The growth of the network requires utilization and consumption of the token, reducing the available token supply. As of September 2022 end, ~520 Million FIL has been minted or vested; of this, approximately 70% is in circulationsince significant amounts of FIL is burned (permanently removed from circulation) due to network transaction fees, or locked as collateral to secure the network and incentivize reliable storage. As the network matures, the rate of token emissions (vesting and minting) is expected (and was designed) to taper, since Filecoin has finite vesting schedules and a minting model in which emissions are indexed to network growth.
Token Supply: Sources and Sinks
Token Holders can benefit from an understanding of the sources and sinks that determine token supply over time because it potentially informs their relative purchasing power.
For informational purposes only and may be based on assumptions/third party info.
Minting, Vesting, Locking, and Burning contribute to the net inflow or outflow of tokens from the circulating supply. See below for the evolution of Filecoin’s token supply since mainnet launch.
Source: Starboard Ventures. Data as of September 30, 2022. This chart represents purely historical data. Filecoin circulating supply cannot be accurately predicted because it is subject to market inputs.
Currently, the network is expanding; tokens vesting for The Filecoin Foundation and Protocol Labs support ecosystem development and growth, while block rewards for Storage Providers subsidize the onboarding of deals and storage capacity. This presents a tremendous opportunity for Token Holders to participate in the growth of the ecosystem since they can lend to SP’s, facilitating SP’s much-needed access to FIL, and share in resulting network rewards. In this way, Token Holders can index themselves to token emissions whilst also supporting network growth and advancing Filecoin’s vision.
Ecosystem Roadmap: Implications for Token Holders
We are at an incredibly exciting point in the Filecoin Ecosystem, with continued improvements in User Programmability, Data Onboarding, Data Retrievability, Scalability, and Computation slated for implementation over the next 4 quarters. These initiatives should have positive effects for Token Holders through supporting client demand, unlocking various network use cases, and ultimately driving usage of the token. Some of these important improvements as well as their potential economic implications are discussed below.
Filecoin Virtual Machine (FVM)
The Filecoin Virtual Machine unlocks boundless possibilities, ranging from programmable storage primitives, to cross-chain interoperability bridges, to data-centric Decentralized Autonomous Organizations (DAOs), and Layer 2 solutions. In short, this means smart contracts and user programmability are coming to Filecoin! Slated for release in H1 2023, this upgrade to the network should positively impact Token Holders since it may increase FIL’s use cases, and will likely impact the “sinks” component of the circulating supply equation (FIL Locked and Burnt).
FVM and “Sinks” to Token Supply
As long as there is action or utility on the network, FIL tokens will be consumed (“burned”) to compensate for the computation and storage resources on-chain messages consume. Introducing smart contract capabilities may increase demand for block space, resulting in increased burnt FIL. This rate of token consumption is in the hands of the community as network participants compete for on-chain resources.
Furthermore, the FVM protocol upgrade may increase the amount of FIL put to use and/or locked. Currently, the majority of FIL locked on the network comes from Storage Provider collateral. FVM introduces the possibility of substantial funds locked to support various smart contract applications. Decentralized Finance (DeFi) protocols are just a subset of applications that may leverage Filecoin’s Proof of Useful Work chain, not only increasing token burn, but also generating new use cases for locking/staking FIL.
The FVM is unique in that it brings expressivity to the Filecoin network — while today you can contract a storage provider to store your data, with the FVM you can add additional rules, automation, and composability with other services (e.g. DeFi). As other components of the roadmap land (retrieval markets, compute over data, etc) — we expect the FVM to facilitate more sophisticated offerings leveraging Filecoin’s base primitives, promoting broader network adoption.
Note: For a flavor of what might be possible, this tweet thread might help elucidate how one might use smart contracts and the base primitives of Filecoin to build more sophisticated offerings.
Filecoin Plus and Data Onboarding
Filecoin Plus (FIL+) emerged as a pragmatic solution to incentivize productive use of the Filecoin Network. Since it is difficult to algorithmically distinguish real useful data from generated randomness, FIL+ introduced a layer of social trust to the network. In the FIL+ program, clients that undergo a verification process are granted a novel resource, DataCap, with which they can spend and make storage deals with SP’s. SP’s are incentivized to enter into storage deals with Clients who spend DataCap, because FIL+ deals boost the amount of block rewards they receive as compared to regular (non-FIL+) deals, or through committing storage capacity to the network. In order to receive these boosted rewards from FIL+, Storage Providers put up more collateral (roughly 10x the collateral for non-FIL+ storage sectors), and lock more tokens on the network.
Year to date, we have seen robust growth in data from FIL+ clients, indicating increasing amounts of useful data onboarded.
Source: Starboard Ventures. Data as of September 30, 2022. For illustrative purposes only.
From a tokenomics perspective, this trend persisting may increase tokens locked on the network and decrease the circulating token supply. More importantly, as the Filecoin network grows towards its mission of storing useful data, so too may the consumption and utilization of the token.
Interplanetary Consensus
Interplanetary Consensus (IPC) is an upcoming network upgrade designed to increase the use cases that Filecoin can support while also improving the network’s scalability, throughput, and finality. An exciting blockchain innovation that horizontally scales the chain with subnets (shards), IPC also has some potential cryptoeconomic implications, and may increase the:
amount of FIL burned while also decreasing gas fees for users due to scalability improvements coupled with FVM utilization.
amount of FIL locked (or staked) as collateral in order to secure subnets.
demand for FIL consumption with shards supporting customized on-chain use cases.
Conclusion
As a novel data storage and application network, Filecoin’s mission is to create a decentralized, efficient, and robust foundation for humanityʼs information. Developments in the Filecoin ecosystem to support this mission are numerous, ongoing, and can positively impact Token Holders. By design, the protocol’s cryptoeconomics intend to reward long-term participation and contribution to the network. Community participants can benefit from understanding the tokenomics, and, in particular the sources of token inflow and outflow that undergird the token economy. Looking forward, continuing improvements to the protocol may present new and exciting use cases for the network and the token, potentially enabling further Token Holder engagement and positively driving the value proposition of this fast-growing ecosystem.
Disclaimer: Personal views and not reflective of my employer nor should be treated as “official”. This information is intended to be used for informational purposes only. It is not investment advice. Although we try to ensure all information is accurate and up to date, occasionally unintended errors and misprints may occur.
Many thanks to Jon Victor (@jnthnvctr), as well as to the TLDR and CryptoEconLab teams for their help shaping this.
There is an overwhelming amount of work going on in the Filecoin ecosystem, and it can be difficult to see how all the pieces fit together. In this blog post, I’m going to explain the structure of Filecoin and various components of the roadmap to hopefully simplify navigating the ecosystem. This blog is organized into the following sections:
What is Filecoin?
Diving into the Major Components
Final Thoughts
This post is intended to be a primer on the major goings-on in Filecoin land; it is by no means exhaustive of everything happening! Hopefully, this post serves as a useful anchor and the embedded links are jumping-off points for the intrepid reader.
What is Filecoin?
My short answer: Filecoin is enabling open servicesfor data, built on top of the IPFS protocol.
IPFS allows data to be uncoupled from specific servers —reducing the siloing of data to specific machines. In IPFS land, the goal is to allow permanent references to data — and do things like compute, storage, and transfer — without relying on specific devices, cloud providers, or storage networks. Why content addressing is super powerful and what CIDs unlock is a separate topic — worthy of its own blog post — that I won’t get into here.
Filecoin is an incentivized network on top of IPFS — in that it allows you to contract out services around data on an open market.
Today, Filecoin focuses primarily on storage as an open service— but the vision includes the infrastructure to store, distribute, and transform data. Looking at Filecoin through this lens, the path the project is pursuing and the bets/tradeoffs that are being taken become clearer.
It’s easier to bucket Filecoin into a few major components:
There are 3 core pillars of Filecoin, enabled by 2 critical protocol upgrades
Storage Market(s): Exists today (cold storage), improvements in progress.
Retrieval Market(s): In progress
Compute over Data (Off-chain Compute): In progress
FVM (Programmable Applications): In progress
Interplanetary Consensus (Scaling): In progress
Diving into the Major Components
Storage Market(s)
Storage is the bread and butter of the Filecoin economy. Filecoin’s storage network is an open market of storage providers — all offering capacity on which storage clients can bid. To date, there are 4000+ storage providers around the world offering 17EiB (and growing) of storage capacity.
Filecoin is unique in that it uses two types of proofs (both related to storage space and data) for its consensus: Proof-of-replication (PoRep) and Proof-of-Spacetime (PoST).
PoRep allows a miner to prove both that they’ve allocated some amount of storage space AND that there is a unique encoding of some data (could be empty space, could be a user’s data) into that storage space. This proves that a specific replica of data is being stored on the network.
PoST allows a miner to prove to the network that data from sets of storage space are indeed still intact (the entire network is checked every 24 hrs). This proves that said data is being stored (space) over time.
These proofs are tied to economic incentives to reward miners who reliably store data (block rewards) and severely penalize those who lose data (slashing). One can think of these incentives like a cryptographically enforced service-level agreement, except rather than relying on the reputation of a service provider — we use cryptography and protocols to ensure proper operation.
In summary, the Filecoin blockchain is a verifiable ledger of attestations about what is happening to data and storage space on the network.
A few features of the architecture that make this unique:
The Filecoin Storage Network (total storage capacity) is 17EiB of data — yet the Filecoin blockchain is still verifiable on commodity hardware at home. This gives the Filecoin blockchain properties similar to that of an Ethereum or a Bitcoin, but with the ability to manage internet-scale capacity for the services anchoring into the blockchain.
This ability is uniquely enabled by the fact that Filecoin uses SNARKs for its proofs — rather than storing data on-chain. In the same way zk-rollups can use proofs to assert the validity of some batched transactions, Filecoin’s proofs can be used to verify the integrity of data off-chain.
Filecoin is able to repurpose the “work” that storage providers would normally do to secure our chain via consensus to also store data. As a result, storage users on the network are subsidized by block rewards and other fees (e.g. transaction fees for sending messages) on the network. The net result is Filecoin’s storage price is super cheap (best represented in scientific notation per TiB/year).
Filecoin gets regular “checks” via our proofs about data integrity on the network (the entire network is checked 24 hrs!). These verifiable statements are important primitives that can lead to unique applications and programs being built on Filecoin itself.
While this architecture has many advantages (scalability! verifiability!), it comes at the cost of additional complexity — the storage providing process is more involved and writing data into the network can take time. This complexity makes Filecoin (as it is today) best suited for cold storage. Many folks using Filecoin today are likely doing so through a developer on-ramp (Estuary.tech, NFT.Storage, Web3.Storage, Chainsafe’s SDKs, Textile’s Bidbot, etc) which couples hot caching in IPFS with cold archival in Filecoin. For those using just Filecoin alone, they’re typically storing large scale archives.
However, as improvements land both to the storage providing process and the proofs, expect more hot storage use cases to be enabled. Some major advancements to keep an eye on:
✅ SnapDeals — coupled with the below, storage providers can turn the mining process into a pipeline, injecting data into existing capacity on the network to dramatically lessen time to data landing on-chain.
🔄 Sealing-as-a-service / SNARKs-as-a-service — allowing storage providers to focus on data storage and outsource expensive computations to a market of specialized providers.
🔄 Proofs optimizations — tuning hardware to optimize for the generation of Filecoin proofs.
🔄 More efficient cryptographic primitives — reducing the footprint or complexity of proof generation.
Note: All of this is separate from the “read” flow — which techniques for faster reads exist today via unsealed copies. However, for Filecoin to get to web2 latency we will need Retrieval Market(s), discussed in the next section.
Retrieval Market(s)
The thesis with retrieval markets is straightforward: at scale, caching data at the edge via an open market can solve for the speed of light problem and result in performant delivery at lower costs than traditional infrastructure.
Why might this be the case? The argument is as follows:
The magic of content addressing (using fingerprints of content as the canonical reference) means data is verifiable.
This maps neatly to building a permissionless CDN — meaning anyone can supply infrastructure and serve content — as end users can always verify that the content they receive back is the content they requested (even from an untrusted computer).
If anyone can supply infrastructure into this permissionless network, a CDN can be created from a market of edge-caching nodes (rather than centrally planning where to put these nodes) and use incentive mechanisms to bootstrap hardware — leading to the optimal tradeoff on performance and cost.
The way retrieval markets are being designed on Filecoin, the aim is not to mandate a specific network to be used — rather to let an ecosystem evolve (e.g. Magmo, Ken Labs, Myel, Filecoin Saturn, and more) to solve the components involved with building a retrieval market.
This video is a good primer on the structure and approach of the working group and one can follow progress here.
Note: Given latency requirements, retrievals happen off-chain, but the settlement for payment for the services can happen on-chain.
Compute over Data (Off-chain Compute)
Compute over data is the third piece of the open services puzzle. When one thinks of what needs to be done with data, it’s typically not just storage and retrieval — users also want to be able to transform the data. The goal with these compute over data protocols are generally to perform computation over IPLD.
For the unfamiliar, IPLD aims to be the data layer for content-addressed systems. It can be used to describe a filesystem (like UnixFS which IPFS uses), Ethereum data, Git data, — really anything that is hash linked. This video might be a helpful primer.
The neat thing about IPLD being generic is that it can be an interface for all sorts of data — and by building computation tools that interact with IPLD, we reduce the complexity for teams building these tools to have their networks be compatible with a wide range of underlying types of data.
Note: This should be exciting for any network building on top of IPFS / IPLD (e.g. Celestia, Gala Games, Audius, Ceramic, etc)
Of course, not all compute is created equal — and for different use cases, different types of compute will be needed. For some use cases, there might be stricter requirements for verifiability — and one may want a zk proof along with the result to know the output was correctly calculated. For others, one might want to keep the data entirely private — and so instead might require fully homomorphic encryption. For others, one may want to just run batch processing like on a traditional cloud (and rely on economic collateral or reputational guarantees for correctness).
There are a bunch of teams working on different types of compute — from large scale parallel compute (e.g. Bacalhau), to cryptographically verifiable compute (e.g. Lurk), to everything in between.
One interesting feature of Filecoin is that the storage providers have compute resources (GPUs, CPUs — as a function of needing to run the proofs) colocated with their data. Critically, this feature sets up the network well to allow compute jobs to be moved to data — rather than moving the data to external compute nodes. Given that data has gravity, this is a necessary step to set the network up to support use cases for compute over large datasets.
Filecoin is set up well to have compute layers be deployed on top as L2s.
One can follow the compute over data working group here.
FVM (Programmable Applications)
Up until this point, I’ve talked about three services (storage, retrieval, and compute) that are related to the datastored on the Filecoin network. These services and their composability can lead to compounding demand for the services of the network — all of which ultimately anchor into the Filecoin blockchain and generate demand for block space.
But how can these services be enhanced?
Enter the FVM — Filecoin’s Virtual Machine.
The FVM will enable computation over Filecoin’s state. This service is critical — as it gives the network all the powers of smart contracts from other networks — but with the unique ability to interact with and trigger the open services mentioned above.
With the FVM, one can build bespoke incentive systems to make more sophisticated offerings on the network:
Undercollateralized lending markets for storage providers
ETL pipelines
… and so much more
Filecoin’s virtual machine is a WebAssembly (WASM) VM designed like a hypervisor. The vision with the FVM is to support many foreign runtimes, starting with the Ethereum Virtual Machine (EVM). This interoperability means Filecoin will support multiple VMs — on the same network contracts designed for the EVM, MoveVM, and more can be deployed.
By allowing for many VMs, Filecoin developers can deploy hardened contracts from other ecosystems to build up the on-chain infrastructure in the Filecoin economy, while also making it easier for other ecosystems to natively bridge into the services on the Filecoin network. Multiple VM support also allows for more native interactions between the Filecoin economy and other L1 economies.
Note the ipld-wasm module — the generalized version of this will be the IPVM work (which could be backported here). Source: https://fvm.filecoin.io
The FVM is critical as it provides the expressiveness for people to deploy and trigger custom data services from the Filecoin network (storage, retrieval, and compute). This feature allows for more sophisticated offerings to be built on Filecoin’s base primitives, and expand the surface area for broader adoption.
Note: For a flavor of what might be possible, this tweet thread might help elucidate how one might use smart contracts and the base primitives of Filecoin to build more sophisticated offerings.
Most importantly, the FVM also sets the stage for the last major pillar to be covered in this post: interplanetary consensus.
One can follow progress on the FVM here, and find more details on the FVM here.
Interplanetary Consensus (Scaling)
Before diving into what interplanetary consensus is, it’s worth restating what Filecoin is aiming to build: open services for data (storage, retrieval, compute) as credible alternatives to the centralized cloud.
To do this, the Filecoin network needs to operate at a scale orders of magnitude above what blockchains are currently delivering:
Product requirements for the Filecoin network.
Looking at the above requirements, it may seem contradictory for one chain to target all of these properties. And it is! Rather than trying to force all these properties at the base layer, Filecoin is aiming to deliver these properties across the network.
With interplanetary consensus, the network allows for recursive subnets to be created on the fly. This framework allows each subnet to tune its own trade off between security and scalability (and recursively spin up subnets of its own) — while still checkpointing information to their respective parent subnets.
This setup means that while Filecoin’s base layer can be highly secure (allowing many folks to verify at home on commodity hardware) — Filecoin can have subnets that are natively connected that can make different trade offs, allowing for more use cases to be unlocked.
A few interesting properties based on how interplanetary consensus is being designed:
Each subnet can spin up their own subnets (enabling recursive subnets)
Native messaging up, down, and across the tree — meaning any of these subnets can communicate with each other
Tunable tradeoffs between security and scalability (each subnet can choose their own consensus model and can choose to maintain their own state tree).
Firewall-esque security guarantees from children to parents (think of each subnet as being like a limited liability chain up to the tokens injected from the perspective of the parent chain).
To double click on some of the things interplanetary consensus sets Filecoin up for:
Because subnets can have different consensus mechanisms, interplanetary consensus opens the door for subnets that allow for native communication with other ecosystems (e.g. a Tendermint subnet for Cosmos).
Enabling subnets to tune between scalability and security (and enabling communications to subnets that make different trade offs) means Filecoin can have different regions of the network with different properties. Performant subnets can get hyper fast local consensus (to enable things like chat apps) — while allowing for results to checkpoint into the highly secure (and verifiable and slow) Filecoin base layer.
In a very high throughput subnet (a single data center, running a few nodes) — the FVM/IPVM work could be used to simply task schedule and execute computation directly “on-chain” — with native messaging and payment bubbling back up to more secure base layers.
Learn more by reading this blogpost and following the progress of ConsensusLab. This Github discussion may also be useful to contextualize IPC vs L2s.
Final Thoughts
So, after reading all the above, hopefully clearer what Filecoin is — and how it’s not exactly like any other protocol out there. Filecoin’s ambition is not just to be a storage network (as Tesla’s ambition was not to just ship the Roadster) — the goal is to facilitate a fully decentralized web powered by open services.
Compared to most other web3 infra plays, Filecoin is aiming to be substantially more than a single service. Compared to most L1s, Filecoin is targeting a set of use cases that are uniquely enabled through the architecture of the network. Excitingly, this means rather than competing for the same use cases, Filecoin can uniquely expand the pie for what can actually be done on crypto rails.
Disclaimer: Personal views, not reflective of my employer nor should be treated as “official”. This is my distillation of what Filecoin is and what makes it different based on my time in the ecosystem. Thanks to @duckie_han and @yoitsyoung for helping shape this.