aptos-labs / aptos-indexer-processors Goto Github PK

View Code? Open in Web Editor NEW

44.0 44.0 48.0 48.87 MB

Set of core processors that index data on the Aptos blockchain

Home Page: https://aptos.dev/indexer/indexer-landing

Python 20.57% Dockerfile 0.14% Shell 0.39% TypeScript 1.21% Rust 74.64% PLpgSQL 0.09% Move 2.49% JavaScript 0.47%

aptos-indexer-processors's People

Contributors

Stargazers

Watchers

Forkers

nangongamo dianabniceb93 lendrajulaiha insane2626 gregnazario ekc9399 manipulation44 econia-labs aminaghazadeh138383 jillxuu lll1lll2ll3ll4 surgeharb fgfm999 mirage-protocol movementlabsxyz byzantion-xyz aileenpaulahtmm bsnadh pichtranst123 yuunlimm dconroy rig410 pontem-network summerrsnow web3vx julianbraha centure2222 ttsplit keyliaran jeremyvn1489 ekasusantif7ll noahs-wood lokesh-lync crbl69 lastmichael aave jamecornel nanozx16 sirouk itszeroman

aptos-indexer-processors's Issues

0x00429810f9071fd7e968af899a1088a82bb2c71c8b2b0e32fe31d15c735369f1

Description

Repro

[Custom Processors] Coin Flip

Description

Using docker compose and the coin_flip index processor, i believe i am streaming events and detecting coin flip events, but nothing is getting written to the postgres database. I have verified connection strings, database and table names and am able to connect through a SQL tool, but the tables are always empty.

Does this log output mean that it should be writing an event?

index-processor-1 | {"timestamp": "2024-01-19 22:08:10,486", "level": "INFO", "fields": {"message": "[Parser] DB insertion time of one batch of transactions", "processor_name": "coin_flip", "start_version": "628091000", "end_version": "628091999", "service_type": "processor", "num_of_transactions": "1000", "duration_in_secs": "0.00018042", "size_in_bytes": "2939880"}, "module": "worker", "func_name": "run", "path_name": "/app/utils/worker.py", "line_no": 538}

Repro

config.yaml

connecting to testnet

Optional. Start processor at starting_version

starting_version: 626589722
# Optional. Stop processor after ending_version.
ending_version: 646589730

docker-compose.yaml


version: '3'
services:
  index-processor:
    build: .
    environment:
      DB_CONNECTION_URI: postgresql://coin_flip:postgres@db:5432/coin_flip
    depends_on:
      - db
    volumes:
      - ./config.yaml:/app/config/config.yaml
  db:
    image: postgres:15.2
    environment:
      POSTGRES_USER: coin_flip
      POSTGRES_PASSWORD: postgres
    ports:
      - "5432:5432"
    volumes:
      - db-data:/var/lib/postgresql/data
volumes:
  db-data:

[Indexer] Typescript processors stuck

From the same experiment as above, there's a few other errors that were consistent (all with the TS processors):
A lot of this error: 13 INTERNAL: Received RST_STREAM with code 0. Even with retry, this sometimes would keep happening even with repeated retries, and only goes away by switching back and forth among a couple of different API keys
The GRPC service also returns duplicate transaction entries from time to time, up to a few times for the same transaction. For example transaction version 3014600. Seems to be starting from transaction versions around ~200K.
In general, it'd be great if we can figure out a way to reduce this kind of error if possible or at least have better error messages and may be better out-of-the-box error handling/retry

https://aptos-org.slack.com/archives/C03MN5F7WUV/p1701542240169189

[Custom Processors] [Python] `Stream removed` error every 5 minutes

Description

The gRPC service seems to cut my connection every 5 minutes. It reconnects to the stream successfully (I'm using the Python version), but every 5 minutes, I get the following message:

2024-05-14 11:48:16 | WARNING | utils.worker:producer:221 - [Parser] RpcError receiving datastream response. | {'processor_name': 'staker', 'stream_address': 'grpc.testnet.aptoslabs.com:443', 'error': '<_MultiThreadedRendezvous of RPC that terminated with:\n\tstatus = StatusCode.UNKNOWN\n\tdetails = "Stream removed"\n\tdebug_error_string = "UNKNOWN:Error received from peer ipv4:34.110.202.98:443 {grpc_message:"Stream removed", grpc_status:2, created_time:"2024-05-14T11:48:16.622325+01:00"}"\n>', 'next_version_to_fetch': 979680402, 'ending_version': None, 'service_type': 'processor'}

I'd like to know if it's a configuration on the test server.

Repro

Just run the Python indexer and wait 5 minutes.

[Custom Processors] Add variant starting_version field to config for when a version is already in the DB

Description

Currently we have only one starting_version field. This field applies unconditionally, even if there is a version in the DB. This means if the user wants to start from the version in the DB if present, but otherwise start from a given version, there is no way for them to do that without a tricky order of starts and stops with adding and removing starting_version from the config.

We should have two fields (more concise naming TBD):

starting_version_if_nothing_in_db: If given, start from this version if nothing is in the DB.
starting_version_no_matter_what: Start from this version even if there is something in DB.

[Indexer API] graphql API server

How to build the graphql API server in Python or in Rust?

[Indexer API] for mainnet is not working

Description

While using
const client = new IndexerClient(
"https://indexer.mainnet.aptoslabs.com/v1/graphql"
);
const txnDetails = await client.getAccountTransactionsData(
"0x62818ab1a3567b03bdb19078e42b774fbdab279a2bf5dfae886d29461feb1fcd"
);
console.log("txnDetails: ", txnDetails);

this is my error
{
"errors": [
{
"message": "Connection template evaluation failed: 'Object' has no attritubte 'operation_name'.",
"extensions": {
"path": "$",
"code": "template-resolution-failed"
}
}
]
}

Repro

[Custom Processors] [Python] Error deserializing some transactions.

Description

Transaction 1023992588 on testnet is failing deserialization, causing whole batches to be skipped.

The batch skipping bit has been added in #352.

This approach doesn't actually skip batches, it moves the start of the next batch by 1 and then tries again.
For the sake of argument, let's say we have a bad transaction at version 3456 and the size of batches is 5000 consistently.
If we're processing a batch that goes from trx 2000 to trx 6999, the process will fail and will restart again, but this time trying to process trx 2001 to trx 7000. It will then fail again and again, until we get passed trx 3456, and then the process will resume without errors.
The problem with that, continuing with the example above, is that if I'm interested in a transaction at version 3000, I'm never going to see it because it'll always be in a bad batch.

Ideally, the bad transaction will not fail and will either be deserialised properly, or bad fields will be ignored.

We've had to add a "slow mode" in our code so that if we see a deserialisation failure, we restart the stream asking for 1 transaction only, until we fail again, at which point we know we've processed the actual bad transaction and we can restart the stream in full speed mode.

Repro

Run the python indexer starting a bit before transaction 1023992588.

[Docs] READMEs are outdated, with (at the very least) misleading config file specifications

@banool @bowenyang007 Per offline discussion

none of the readmes are updated. This is the only place we actually updated: https://aptos.dev/indexer/legacy/migration#2-migrate-processors-to-transaction-stream-service

[Indexer API] - Null From Address for NFT trade - account_transactions graphql

Description

I'm using the indexer graphql API account_transactions for a NFT trade transaction.
It appears some data is missing in both token_activities and token_activities_v2 - previous owner (the seller) is null while the value should be 0xd6e3ad94ed9d1f628d6b4e1a287378158beb0930f4d0be0f89de683386746f53

Sharing the response here: "Token_activities_v2": [
{
"Aptos_names_from": [],
"Aptos_names_to": [],
"Before_value": null,
"From_address": null,
"Is_fungible_v2": null,
"To_address": "0x629ed8449b71c464d253159b5f8b26a5c26bce40dfdb7420a279041c380e4464",
"Token_amount": 1,
"Token_data_id": "0xab34d7afd9fb00e0008a6181dfa485ac3c6ae98b43b74b6de0fd4be1c58f9e5b",
"Token_standard": "v1",
"Transaction_timestamp": "2023-12-13T09:25:33.089239",
"Transaction_version": 359534234,
"Type": "0x3::token::DepositEvent",
"Property_version_v1": 0,
"Event_account_address": "0x629ed8449b71c464d253159b5f8b26a5c26bce40dfdb7420a279041c380e4464",
"Event_index": 0,
"Entry_function_id_str": "0x2c7bccf7b31baf770fdbcc768d9e9cb3d87805e255355df5db32ac9a669010a2::marketplace_v2::buy"
}
]

Is it a bug or intentional?

Repro

Query link:

https://cloud.hasura.io/public/graphiql?endpoint=https%3A%2F%2Findexer.mainnet.aptoslabs.com%2Fv1%2Fgraphql&variables=%7B%22transaction_version%22%3A0%2C%22account_address%22%3A%22%22%7D&query=query+SingleTransaction%28%24transaction_version%3A+bigint%2C+%24account_address+%3A+String%29+%7B%0A++account_transactions%28%0A++++where%3A+%7Btransaction_version%3A+%7B_eq%3A+%24transaction_version%7D%2C+token_activities_v2%3A+%7B%7D%2C+account_address%3A+%7B_eq%3A+%24account_address%7D%7D%0A++++offset%3A+0%0A++%29+%7B%0A++++token_activities_v2+%7B%0A++++++from_address%0A++++++is_fungible_v2%0A++++++property_version_v1%0A++++++to_address%0A++++++token_amount%0A++++++token_data_id%0A++++++token_standard%0A++++++transaction_timestamp%0A++++++transaction_version%0A++++++type%0A++++++event_index%0A++++++event_account_address%0A++++++entry_function_id_str%0A++++++aptos_names_to+%7B%0A++++++++domain%0A++++++++domain_with_suffix%0A++++++++expiration_timestamp%0A++++++++is_active%0A++++++++owner_address%0A++++++++registered_address%0A++++++++subdomain%0A++++++++token_name%0A++++++++token_standard%0A++++++%7D%0A++++++aptos_names_from+%7B%0A++++++++domain%0A++++++++domain_with_suffix%0A++++++++expiration_timestamp%0A++++++++is_active%0A++++++++is_primary%0A++++++++last_transaction_version%0A++++++++owner_address%0A++++++++registered_address%0A++++++++subdomain%0A++++++++token_name%0A++++++++token_standard%0A++++++%7D%0A++++%7D%0A++%7D%0A%7D%0A

Query variables:

{
"transaction_version": 359534234,
"account_address" : "0x629ed8449b71c464d253159b5f8b26a5c26bce40dfdb7420a279041c380e4464"
}

[Custom Processors] High RAM consumption

Description

There appears to be a memory leak issue, as evidenced by the increasing memory consumption observed during the indexing process. Valgrind has been utilized to analyze memory allocations, revealing a significant portion of memory being allocated within the transaction vector. After processing each batch, the memory consumption persists and continues to grow. After two minute of indexing, the indexer's RAM usage exceeds 5 GB and continues to increase.

massif report:
->12.31% (76,360,304B) 0x2285542: alloc (alloc.rs:98) | ->12.31% (76,360,304B) 0x2285542: alloc::alloc::Global::alloc_impl (alloc.rs:181) | ->12.31% (76,360,304B) 0x2286318: <alloc::alloc::Global as core::alloc::Allocator>::allocate (alloc.rs:241) | ->12.31% (76,360,304B) 0x228607E: alloc::raw_vec::finish_grow (raw_vec.rs:521) | ->05.87% (36,373,376B) 0x6E834A: alloc::raw_vec::RawVec<T,A>::grow_amortized (raw_vec.rs:433) | | ->05.87% (36,373,376B) 0x70A8B8: alloc::raw_vec::RawVec<T,A>::reserve_for_push (raw_vec.rs:318) | | ->05.87% (36,373,376B) 0xAF5A77: alloc::vec::Vec<T,A>::push (mod.rs:1922) | | ->05.87% (36,373,376B) 0xB91356: prost::encoding::message::merge_repeated (encoding.rs:1114) | | ->04.03% (24,965,408B) 0xC78005: <aptos_protos::pb::aptos::transaction::v1::MoveStructTag as prost::message::Message>::merge_field (aptos.transaction.v1.rs:739) | | | ->04.03% (24,965,408B) 0xB9D7F6: prost::encoding::message::merge::{{closure}} (encoding.rs:1086) | | | ->04.03% (24,965,408B) 0x139AC98: prost::encoding::merge_loop (encoding.rs:374) | | | ->04.03% (24,965,408B) 0xB96C34: prost::encoding::message::merge (encoding.rs:1080) | | | ->04.02% (24,932,544B) 0xC7857A: <aptos_protos::pb::aptos::transaction::v1::WriteResource as prost::message::Message>::merge_field (aptos.transaction.v1.rs:398) | | | | ->04.02% (24,932,544B) 0xBA2456: prost::encoding::message::merge::{{closure}} (encoding.rs:1086) | | | | ->04.02% (24,932,544B) 0x139E8F8: prost::encoding::merge_loop (encoding.rs:374) | | | | ->04.02% (24,932,544B) 0xB95734: prost::encoding::message::merge (encoding.rs:1080) | | | | ->04.02% (24,932,544B) 0x8861EA: aptos_protos::pb::aptos::transaction::v1::write_set_change::Change::merge (aptos.transaction.v1.rs:329) | | | | ->04.02% (24,932,544B) 0xC688AA: <aptos_protos::pb::aptos::transaction::v1::WriteSetChange as prost::message::Message>::merge_field (aptos.transaction.v1.rs:278) | | | | ->04.02% (24,932,544B) 0xBA06F6: prost::encoding::message::merge::{{closure}} (encoding.rs:1086) | | | | ->04.02% (24,932,544B) 0x13A2838: prost::encoding::merge_loop (encoding.rs:374) | | | | ->04.02% (24,932,544B) 0xB94234: prost::encoding::message::merge (encoding.rs:1080) | | | | ->04.02% (24,932,544B) 0xB909BA: prost::encoding::message::merge_repeated (encoding.rs:1113) | | | | ->04.02% (24,932,544B) 0xC69C4E: <aptos_protos::pb::aptos::transaction::v1::TransactionInfo as prost::message::Message>::merge_field (aptos.transaction.v1.rs:166) | | | | ->04.02% (24,932,544B) 0xB9B7A6: prost::encoding::message::merge::{{closure}} (encoding.rs:1086) | | | | ->04.02% (24,932,544B) 0x139B818: prost::encoding::merge_loop (encoding.rs:374) | | | | ->04.02% (24,932,544B) 0xB96B34: prost::encoding::message::merge (encoding.rs:1080) | | | | ->04.02% (24,932,544B) 0xC75E54: <aptos_protos::pb::aptos::transaction::v1::Transaction as prost::message::Message>::merge_field (aptos.transaction.v1.rs:37) | | | | ->04.02% (24,932,544B) 0xB9F846: prost::encoding::message::merge::{{closure}} (encoding.rs:1086) | | | | ->04.02% (24,932,544B) 0x139EBD8: prost::encoding::merge_loop (encoding.rs:374) | | | | ->04.02% (24,932,544B) 0xB95A34: prost::encoding::message::merge (encoding.rs:1080) | | | | ->04.02% (24,932,544B) 0xB91B0A: prost::encoding::message::merge_repeated (encoding.rs:1113) | | | | | | | ->00.01% (32,864B) in 1+ places, all below ms_print's threshold (01.00%)

Repro

Run indexer (commit 48d7794) with config:
health_check_port: 8084 server_config: processor_config: type: coin_processor indexer_grpc_data_service_address: https://grpc.testnet.aptoslabs.com:443 postgres_connection_string: *********** auth_token: ************** number_concurrent_processing_tasks: 1 starting_version: 951262066

[Feature request] Honor total order for concurrent processing threads

Presently, processor insertions are nondeterministic when concurrent processing tasks are enabled, such that the processor must be pinned to single-threading to enforce total ordering of transactions.

In practice, this slows down processing, in particular doubling or even tripling the time to sync to chain tip, for example with the Econia Data Service Stack (https://econia.dev/off-chain/dss/data-service-stack).

Offline notes and suggestions:

@banool

@larry-aptos, perhaps we could devise some kind of multi worker sequential execution scheme.

@bowenyang007

One relatively simple implementation is to allow for a reduce. Currently we create multiple threads (map) and these directly write to the db, but if we actually could compute the results of the threads and run through a reduce against data already in the db we could achieve ordering.

Myself

Cache insertions in a postgres table and only insert to main tables once colliding threads are complete

Execute each insertion as a subtransaction of an overall postgres transaction, comitting once colliding threads are complete

cc @CRBl69

[Indexer API] No support for validator_transaction

Description

GraphQL API has no support for validator_transaction transaction type.

Repro

Using list of available graphQL tables (https://cloud.hasura.io/public/graphiql?endpoint=https://api.mainnet.aptoslabs.com/v1/graphql) I can find only following transaction related tables:

block_metadata_transactions
user_transactions
account_transactions

But no validator_transaction table. This type was added and used only recently but is important to have.

Example of transaction of this type: https://explorer.aptoslabs.com/txn/975859175?network=mainnet

[Custom Processors] on coflict update

Description

fun insert_current_coin_balances (https://github.com/aptos-labs/aptos-indexer-processors/blob/main/rust/processor/src/processors/coin_processor.rs#L216). If two+ records with the same owner_address, coin_type_hash will be in one chunk, then insertion will be impossible (ON CONFLICT DO UPDATE).

Repro

INSERT INTO current_coin_balances (owner_address, coin_type_hash, coin_type,
                          amount, last_transaction_version, last_transaction_timestamp)
VALUES
    ('0x1', '0x1', 'test', 1, 1, now()),
    ('0x1', '0x1', 'test', 1, 2, now())
ON CONFLICT (owner_address, coin_type_hash) DO UPDATE
    SET
        amount = excluded.amount,
        last_transaction_version = excluded.last_transaction_version
WHERE current_coin_balances.last_transaction_version <= excluded.last_transaction_version;

[Transaction Stream Service] Focus contract address confusingly ignores wrapped calls

cc @banool @CapCap @CRBl69

Description

The focus contract address alpha feature is presumably intended to simplify client filtering behavior: only events corresponding to a particular Move package are allowed through the top-level worker process

However, the underlying implementation appears to filter based on the public entry function rather than by the event address. E.g. if a project composes on top of another and invokes the underlying package via a wrapped function call, then the wrapping package's events (which are also the wrapped package's events) get removed

Instead, it is suggested that filtration rely on the address of emitted event types, or alternatively, the focus_contract_addresses be renamed to something like entry_function_address with an additional event_type_address

Repro

server_config:
  transaction_filter:
    focus_contract_addresses:
      - $ECONIA_ADDRESS

[Custom Processors] should handle DeleteListingEvent `current_token_ownerships_v2`

Description

execute the query in https://cloud.hasura.io/public/graphiql?endpoint=https://indexer.mainnet.aptoslabs.com/v1/graphql

query MyQuery {
  current_token_ownerships_v2(where: {token_data_id: {
    _eq: "0x5a7b9686aeb01a38bf2450385fd07a9202c5a0f7d7c41495a4f58e8d40984f4e"
  }}
  ) {
    amount
    owner_address
    last_transaction_version
  }
}

The result is

{
  "data": {
    "current_token_ownerships_v2": [
      {
        "amount": 0,
        "owner_address": "0xadeb45c274f9f4f535afe8957a8cf9ffecbd2b79026fba6c207111136d963f14",
        "last_transaction_version": 371523890
      },
      {
        "amount": 0,
        "owner_address": "0x82117cc55459b4de3ffb371d014d164273e9f2795545ece7c55e8c76bced1e7a",
        "last_transaction_version": 377597523
      },
      {
        "amount": 0,
        "owner_address": "0x24eea652ba98ed744267eee683805a6a53091a842ea914d4bc43785d7de90c6b",
        "last_transaction_version": 371958547
      },
      {
        "amount": 0,
        "owner_address": "0x91d8b03c217aea6f2b36bc182dc34021ad6c81fa3dded74cb5240252c27e9657",
        "last_transaction_version": 372285477
      },
      {
        "amount": 0,
        "owner_address": "0x7f6fd0671110708302d00c2c1549dec9e05183588eed95fdd4f3864e9524da9e",
        "last_transaction_version": 374455789
      },
      {
        "amount": 1,
        "owner_address": "0xe18dec131fa7165f807d451c468028b1768d04ec52764cbd0234295d4ab8a08d",
        "last_transaction_version": 374455789
      }
    ]
  }
}

According to the logic in the Indexer, this token is currently owned by 0xe18dec131fa7165f807d451c468028b1768d04ec52764cbd0234295d4ab8a08d, but result is wrong because this token is burned at 377597523

It seems indexer doesn't process DeleteListingEvent so the amount corresponding to 0xe18dec131fa7165f807d451c468028b1768d04ec52764cbd0234295d4ab8a08d didn't update

more cases:

token id:
0xa72923d8863c8370d0673cca437262fcfbfa805844003fdb17d344f56e426e8b

How can the value of indexer_api_key be obtained?

Where can the value of indexer_api_key in the config.yaml configuration file be obtained from? thanks！

[Indexer API] Self hosted indexer api

Description

Is there any updates on self hosted indexer api?
https://aptos.dev/indexer/api/self-hosted

I'm running processors which are saving data to Postgres DB, and I would like to open an endpoint to this.
It seems no guide on this is provided yet.

[Custom Processors] rust token_v2_processor missing most NFT burn events

Description

If the NFT mint & burn acttions are not too close in time, the burn event will be missing

Repro

Run the token_v2_processor
Mint a NFT and wait for more than 1000 transaction versions
Burn the NFT
No burn event in token_activities_v2 table for this NFT

Analysis

The possible reason is the hashmap token_v2_metadata_helper is constructed by write resource, but the burn event like this has no such write resource, which breaks the process progress

aptos-labs / aptos-indexer-processors Goto Github PK

aptos-indexer-processors's People

Contributors

Stargazers

Watchers

Forkers

aptos-indexer-processors's Issues

Description

Repro

Description

Repro

Optional. Start processor at starting_version

Description

Repro

Description

Description

Repro

Description

Repro

Description

Repro

Description

Repro

Description

Repro

Description

Repro

Description

Repro

Description

Description

Description

Repro

Analysis

Recommend Projects

Recommend Topics

Recommend Org