elephant-xyz

Oracle Mining Process

Overview

This document outlines the end-to-end oracle mining process in the protocol, detailing how off-chain property data is validated, transformed, and anchored on-chain. The oracle mining process is the core mechanism by which decentralized property records are created and maintained.

Oracle Mining Process

The protocol relies on three types of oracles to ensure comprehensive coverage and data integrity:

1. Technical Oracle

Software programmers who participate in mining as a side opportunity, typically handling 500,000 to 1 million properties. These oracles:

Possess technical expertise to run mining software and scripts
Operate independently to earn MAHOUT tokens
Often automate their mining operations for efficiency
Contribute significant data volume to the network

2. Institutional Oracle

Professional entities hired by organizations or high-net-worth individuals to mine MAHOUT on their behalf, managing more than 1 million properties. These oracles:

Operate at enterprise scale with dedicated resources
Provide white-glove mining services for clients
Often have teams dedicated to different jurisdictions

3. Owner/Provider Oracle

Non-technical individuals who benefit from the Elephant Protocol ecosystem, typically managing 1 to 100 properties. These oracles:

Include property owners verifying their own assets
Service providers (agents, brokers) adding value for clients
Community members contributing local property knowledge
Use simplified tools and interfaces for participation

Each oracle type plays a crucial role in maintaining the integrity, timeliness, and verifiability of decentralized property records. The diversity of oracle participants ensures both comprehensive coverage and resistant to centralization.

Oracle Mining Process Flow

graph LR
    A[Learn] --> B[Set Up]
    B --> C[Property Identification]
    C --> D[Mining]
    D --> E[Minting]

Learning Key Concepts

Before beginning the oracle mining process, it's essential to understand the foundational concepts that power the Elephant Protocol.

Lexicon Schema

The Lexicon is Elephant Protocol's domain-specific data format designed to create cross-jurisdictional consistency in property data. It serves as a universal translator between existing real estate data standards.

Structure:

Property Data Classes: Core property attributes including ownership, assessments, and physical characteristics
Relationship Classes: Define connections between properties, owners, and transactions - structured specifically for IPFS relationship management
Groups Structure: Organized to optimize Merkle DAG construction for efficient on-chain verification

The Lexicon normalizes fragmented county data into a unified, queryable format while maintaining compatibility with existing systems. Each property's data is divided into distinct classes that can be independently verified and updated.

For detailed schema definitions and implementation guides, visit: Elephant Protocol Lexicon

Oracle Wallets

Oracles receive MAHOUT and vMAHOUT tokens through crypto wallets, with two distinct approaches based on participant type:

Individual Wallets

Standard crypto wallets like MetaMask for independent oracles
Direct control over private keys
Suitable for technical oracles and small-scale operations
Simple setup with personal accountability

Organization Wallets

Enterprise-grade wallet infrastructure leveraging Key Management Systems (KMS)
Designed for institutional oracles managing large-scale operations
Separates wallet control from individual employees
Avoids the complicated processes required for traditional exchange-based corporate wallets
Open-source framework ensures transparency and security

The wallet infrastructure ensures that both individual contributors and large organizations can participate in the oracle ecosystem while maintaining appropriate security and operational controls for their scale.

Setup

Before you begin, ensure you have completed the following:

Setup Guide: You must have a configured environment with a Google account, Pinata JWT, MetaMask wallet, and POL tokens.
Tech Stack Review: Familiarize yourself with the core technologies, including IPFS, IPLD, and JSON Canonicalization.

Property Identification

The property identification process varies based on oracle type and scale of operations. This step establishes the foundational data needed to begin mining properties on the Elephant Protocol.

Individual Property Identification

For oracles focusing on individual properties (Owner/Provider Oracles managing 1-100 properties):

Locate Property Information
- Start with the property's street address
- Google search: "[County Name] appraiser office" or "[County Name] assessor office"
- Navigate to the county's property search portal
Extract Core Identifiers
- Search for your target property using the address
- Locate and record the Parcel Identifier (may also be called Folio ID, APN, or Parcel Number)
- Verify the address matches county records
Standardize Property Address
- Input the address into Google Maps
- Use the formatted address provided by Google Maps as the standardized version
- This ensures consistency across all data submissions
Capture Source Information
- Copy the full URL of the property page from your browser
- Open browser Developer Tools (F12)
- Navigate to the Network tab
- Refresh the property page
- Identify the request method (typically GET)
- Note any required headers or parameters for data access

Large-Scale Property Identification

For Technical and Institutional Oracles managing thousands to millions of properties:

The Elephant Protocol foundational team has developed systematic approaches for collecting property identifiers across all 150 million properties in the United States.

Resources Available:

Pre-collected county data structures and access patterns
Automated scripts for bulk property identification
Standardized extraction methodologies by jurisdiction

View examples and access tools at: US Properties

This repository contains:

County-specific data schemas
API endpoints and access methods
Bulk extraction scripts
Property identifier mapping tools

Required Data Points

Regardless of scale, each property identification must include:

Standardized Address: Google Maps formatted address
County: Google Maps formatted address
Parcel Identifier: The unique county-assigned property ID
Source URL: Direct link to the county's property page
Request Method: HTTP request method
Headers: Additional HTTP request arguements

Mining Process

The mining process transforms raw property data into validated Lexicon format ready for on-chain submission. The process varies based on which data group you're mining.

Mining

graph LR
    A[Prepare] --> B[Enrich]
    B --> C[Transform]
    C --> D[Validate]

Prepare

Input requirements vary by data group:

Seed Group

Uses property identification fields prepared from the previous step
Includes: Parcel ID, standardized address, source URL
Minimal data requirements for initial property registration

County Group

Start with property identification URL
Download all HTML pages related to the property (for some counties, data may also be available as JSON or CSV files).
Extract all county-specific data and attributes
Capture complete property record from county sources

Photo Group

Locate publicly accessible URLs for property images
Download all available property photos
Store images for subsequent IPFS upload
Maintain original image quality and metadata

Photo Metadata Group

Download photos from IPFS (previously uploaded)
Prepare images for Enrich step
Extract existing metadata from image files

Enrich

AI-powered enrichment varies by data group:

Not Required For:

Seed Group
County Group
Photo Group

Required For:

Photo Metadata Group

The AI Agent processes photos to gather detailed property information:

Architectural features and building characteristics
Property condition assessment
Geolocation verification
Environmental context and surroundings

Transform

Converting raw data to Elephant Protocol's standardized schema:

Simple Transformations:

Seed: Straightforward field mapping
Photo: Basic metadata extraction
Method: Manual mapping or simple scripts

Complex Transformations:

County: Requires custom transformation code for each jurisdiction
Photo Metadata: Converts AI Agent generated insights into Lexicon format
Method: AI-aided transformation using AI-Agent

The transformation complexity depends on:

Data source structure
Jurisdiction-specific formats
Required field mappings
Data validation rules

Validate

The final validation step using the Elephant CLI:

Validates data against Lexicon schema requirements
Ensures all required fields are properly formatted
Enables off-chain validation before consensus submission

The validation process:

Schema compliance check
Data integrity verification

This step is critical for ensuring data quality and preventing failed consensus attempts.

Learn more about Seeding

Minting

The minting process pushes all validated data to IPFS and commits the final hash to the blockchain, establishing an immutable property record.

graph LR
    A[Hash] --> B[Upload]
    B --> C[Submit]

Hash

Objective: Anchor the property's full dataset to the blockchain via a Merkle DAG root hash.

Procedure:

For each data object, first canonicalize its contents to ensure a standard, deterministic representation before hashing.
Compute a unique cryptographic hash for each canonicalized object.
Construct a Merkle DAG where relationships between objects are represented by their corresponding content hashes.
Compute the final Merkle root of the DAG. This single hash serves as a verifiable commitment to the entire dataset's state.
Use the Merkle root to update the smart contract with the current state commitment.
Leverage the Merkle root in consensus protocols to efficiently prove that the off-chain state matches the on-chain commitment.

Learn more about Merkle Hash Commitment

Upload

Objective: Move all the full state to IPFS

Submit

Objective: The computed Merkle root is submitted to the smart contract, acting as a cryptographic commitment to the full off-chain state.

Token Issuance

Objective: After the submitted Merkle root is verified on-chain and the required consensus is reached, a token is issued to represent the newly committed state.

Seed and County groups require consensus from 3 oracles.
Photo and Photo Metadata require consensus from 1 oracle.

Related Documentation:

Reward Summary

Stage	Token Output	Description
Mint Success	Mahout	Economic token for staking/utility
Mint Success	vMahout	Proof-of-contribution (non-transfer)

Security & Consensus Guarantees

Merkle DAG generation ensures data immutability and versioning.
Multi-oracle validation mitigates bad data injection and single-point failure.
Token rewards align oracle incentives with protocol data integrity.

Future Extensions

Integration of ZK proofs for ingestion verification.
DAO-based oracle slashing for repeated invalid submissions.
Automated jurisdictional prioritization via market demand signals.

License

Copyright 2025 Elephant.xyz Licensed under the Apache License, Version 2.0 (the “License”); you may not use this file except in compliance with the License. You may obtain a copy of the License at http://www.apache.org/licenses/LICENSE-2.0

Oracle Mining Process

Overview

Oracle Mining Process

1. Technical Oracle

2. Institutional Oracle

3. Owner/Provider Oracle

Oracle Mining Process Flow

Learning Key Concepts

Lexicon Schema

Oracle Wallets

Setup

Property Identification

Individual Property Identification

Large-Scale Property Identification

Required Data Points

Mining Process

Mining

Prepare

Enrich

Transform

Validate

Minting

Hash

Upload

Submit

Token Issuance

Reward Summary

Security & Consensus Guarantees

Future Extensions

License

Popular repositories Loading

Repositories

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

People

Top languages

Uh oh!

Most used topics

Uh oh!