ohai.social team @ohai

**Neil Craig** @tdp_org@mastodon.social · 1d

TIL the Tranco domain ranking data is available as a public data source in BigQuery - so you can do e.g.:

```
select
domain,
rank
from `tranco.daily.daily`
where date = date_sub(current_date(), interval 1 day)
and domain in ("bbc.co.uk", "bbc.com")
order by rank asc
```
https://tranco-list.eu

tranco-list.euA research-oriented top sites ranking hardened against manipulation - Tranco

#BigQuery #Tranco #DomainRanking

**Spatialists** @spatialists@mapstodon.space · 2d

Spatialists @spatialists@mapstodon.space

New geospatial data in Google BigQuery: #Google is adding geospatial content to its #DWH solution #BigQuery. Additions encompass annotated Street View #imagery, Places (#POI) data, and #traffic data, among others.
https://spatialists.ch/posts/2025/04-14-new-geospatial-data-in-google-bigquery/ #GIS #GISchat #geospatial #SwissGIS

spatialists.ch – geospatial newsNew geospatial data in Google BigQuery – spatialists.ch – geospatial news

More from

Spatialists

**Clinton** @clintonsears@mastodon.social · Apr 3

Apr 3

Clinton @clintonsears@mastodon.social

Diving into #Vermont wildlife for the #30DayChartChallenge "circle" day! Using #Python & #plotly to compare monthly #Moose and #BlackBear sightings Data wrangled with #BigQuery and #SQL. Any guesses which animal is seen more consistently throughout the year? #DataViz #Wildlife #RadialChart

Who's Out and About? Tracking Vermont's Big Two by Month
While moose are a year-round presence, black bears hibernate during the winter months
Moose (19,670 Observations) Black Bear (2,059 Observations)
Data Source: https://www.sciencebase.gov/catalog/item/663ce56cd34e77890839%1c8
Observations are from trail cameras located in Caledonia, Essex, and Orleans Counties, (2014-2022).

**Hacker News** @h4ckernews@mastodon.social · Mar 25

Mar 25

Hacker News @h4ckernews@mastodon.social

BigQuery pricing model cost us $10k in 22 seconds

https://www.linkedin.com/posts/yingjun-wu_%F0%9D%90%81%F0%9D%90%A2%F0%9D%90%A0%F0%9D%90%90%F0%9D%90%AE%F0%9D%90%9E%F0%9D%90%AB%F0%9D%90%B2%F0%9D%90%AC-%F0%9D%90%91%F0%9D%90%88%F0%9D%90%83%F0%9D%90%88%F0%9D%90%82%F0%9D%90%94%F0%9D%90%8B%F0%9D%90%8E%F0%9D%90%94-activity-7307736315079405569-F2Ng

#HackerNews #BigQuery #Cost #$10k #DataAnalysis #CloudComputing #PricingModel #TechNews

www.linkedin.com🚨 𝐁𝐢𝐠𝐐𝐮𝐞𝐫𝐲’𝐬 𝐑𝐈𝐃𝐈𝐂𝐔𝐋𝐎𝐔𝐒 𝐩𝐫𝐢𝐜𝐢𝐧𝐠 𝐦𝐨𝐝𝐞𝐥… | Yingjun Wu | 678 comments🚨 𝐁𝐢𝐠𝐐𝐮𝐞𝐫𝐲’𝐬 𝐑𝐈𝐃𝐈𝐂𝐔𝐋𝐎𝐔𝐒 𝐩𝐫𝐢𝐜𝐢𝐧𝐠 𝐦𝐨𝐝𝐞𝐥 𝐜𝐨𝐬𝐭 𝐮𝐬 $10,000 𝐢𝐧 𝐣𝐮𝐬𝐭 22 𝐬𝐞𝐜𝐨𝐧𝐝𝐬!!! 🚨 I am serious—𝐭𝐡𝐢𝐬 𝐢𝐬… | 678 comments on LinkedIn

**N-gated Hacker News** @ngate@mastodon.social · Mar 4

Mar 4

N-gated Hacker News @ngate@mastodon.social

"Map of Python" is the digital dabbling of a cartographer, lost in the vast jungle of 500,000+ #Python packages, desperately trying to create art from JSON blobs. But hey, at least there's #BigQuery to save the day from the horror of actually downloading data!
https://fi-le.net/pypi/ #DataVisualization #Cartography #ArtInTech #HackerNews #ngated

fi-le.netfi-le.netfi-le.net, the Fiefdom of Files

**Recce - Making Data Productive** @DataRecce@mastodon.social · Mar 3

Mar 3

Recce - Making Data Productive @DataRecce@mastodon.social

No more manually cross-referencing dbt docs from dev and prod

No more manually checking schemas in your data warehouse

No more manually comparing row-counts on models.

See a trend?

Read how an experienced data professional validates zero regression on a #dbt PR:

https://www.linkedin.com/posts/abdelm_recce-dbt-activity-7300436694808358914-aKiF?rcm=ACoAAAVLBgkB6r61wCsOcKgFDrtf_EEEe3UjdXs

www.linkedin.comAbdel. M. on LinkedIn: #recce #dbt🚀 Accélérer la data validation avec Recce Contexte J’ai 2 envs : preprod et prod avec un Airflow qui run des centaines de modèles dbt et génére des…

#DataEngineering #Data #Analytics

**emmuzoo** @emmuzoo@mastodon.social · Feb 28

Feb 28

emmuzoo @emmuzoo@mastodon.social

Exploring dbt for Data Transformation
The journey continues! In this part of the project, I'm learning how dbt models help automate data transformation. I'm building out models in dbt for these taxi datasets to create clean, analysis-ready data in #BigQuery. It’s fascinating to see how everything connects! #DataEngineering #dbt #GCP #ETL #DataTalksClub

**emmuzoo** @emmuzoo@mastodon.social · Feb 28

Feb 28

emmuzoo @emmuzoo@mastodon.social

Started Module 4 of #DataEngineering Zoomcamp!
Just kicked off the Analytics Engineering module and I'm diving into transforming the Green Taxi, Yellow Taxi, and FHV NY Taxi datasets loaded in #BigQuery. Excited to see how dbt can help create analytical views for better decision-making! #dbt #DataTalksClub #GCP #AnalyticsEngineering #ETL

**James C** @jamescooke@fosstodon.org · Feb 27

Feb 27

James C @jamescooke@fosstodon.org

> We are adding additional services to your project(s) to create a unified platform for AI-powered data analytics.

#BigQuery

**Qiita - 人気の記事** @qiita@rss-mstdn.studiofreesia.com · Feb 20

Feb 20

Qiita - 人気の記事 @qiita@rss-mstdn.studiofreesia.com

【BigQuery】画像から類似画像を検索！マルチモーダルエンベディングの簡単解説
https://qiita.com/te_yama/items/d320f81ddffae447b495?utm_campaign=popular_items&utm_medium=feed&utm_source=popular_items

Qiita【BigQuery】画像から類似画像を検索！マルチモーダルエンベディングの簡単解説 - QiitaはじめにGoogle CloudのBigQueryでマルチモーダルエンベディングが導入されています。例としてテキストから画像検索が挙がっていましたが、画像から画像検索がなかったのでPythonを…

#qiita #Python #BigQuery

**The OpenAIRE Graph** @OpenAIREGraph@mastodon.social · Feb 17

Feb 17

The OpenAIRE Graph @OpenAIREGraph@mastodon.social

This week! Join us on 19 February for our #CommunityCall! This month, we'll present the different avenues for accessing the #OpenAIREGraph's data (such as #API, #Zenodo, #GoogleCloud, etc.) with a brief recap of our #BigQuery training from October. Register now! Registration and agenda in link below,

OpenAIRE Graph Community Calls. New developments, Use Cases & API, How it Works. Are you interested in the OpenAIRE Graph and how to use it? Are you a developer, computer scientist, or research data analyst or scientist? Joint he conversation in our dedicated Community Calls. We want to hear from you! Calls are held every month on the third Wednesday at 11:00 CET

**Gus** @goosewastaken@mastodon.social · Feb 11

Feb 11

Gus @goosewastaken@mastodon.social

DataTalksClub's Data Engineering Zoomcamp Week 3 - BigQuery as a data warehousing solution.

For this week's module, we used Google's BigQuery to read Parquet files from a GCS bucket, and compare querying on regular, external and partitioned/clustered tables.

My answers to this module: https://github.com/goosethedev/de-zoomcamp-2025/blob/ecb1f1f3fc69b8d10703eb07328567dab2acf688/03-data-warehousing/README.md

Homeworks for the DataTalksClub's Data Engineering Zoomcamp 2025. - goosethedev/de-zoomcamp-2025

GitHubde-zoomcamp-2025/03-data-warehousing/README.md at ecb1f1f3fc69b8d10703eb07328567dab2acf688 · goosethedev/de-zoomcamp-2025Homeworks for the DataTalksClub's Data Engineering Zoomcamp 2025. - goosethedev/de-zoomcamp-2025

#dataengineering #bigquery #bootcamp

**The OpenAIRE Graph** @OpenAIREGraph@mastodon.social · Feb 7

Feb 7

The OpenAIRE Graph @OpenAIREGraph@mastodon.social

Join us on 19 February for our #CommunityCall! This month, we'll present the different avenues for accessing the #OpenAIREGraph's data (such as #API, #Zenodo, #GoogleCloud, etc.) with a brief recap of our #BigQuery training from October. Register now! Registration and agenda in link below,

**Qiita - 人気の記事** @qiita@rss-mstdn.studiofreesia.com · Feb 2

Feb 2

Qiita - 人気の記事 @qiita@rss-mstdn.studiofreesia.com

TROCCO クックブックをやってみた
https://qiita.com/manabian/items/0ba949e54502c53a82e4?utm_campaign=popular_items&utm_medium=feed&utm_source=popular_items

QiitaTROCCO クックブックをやってみた - Qiita概要イベントで登壇した際に『TROCCO クックブック』を入手したｍ、その内容を実践しました。本記事では、実践にあたり必要となる環境準備や手順をまとめています。書籍では S3 を利用した例が掲載さ…

#qiita #GoogleDrive #BigQuery

**Neil Craig** @tdp_org@mastodon.social · Jan 30

Jan 30

Neil Craig @tdp_org@mastodon.social

FFS. Turns out (after I built a feature) that you can't supply a schema for BigQuery Materialised Views.

> Error: googleapi: Error 400: Schema field shouldn't be used as input with a materialized view, invalid

So it's impossible to have column descriptions for MVs? That sucks.

#BigQuery

**Neil Craig** @tdp_org@mastodon.social · Jan 20

Jan 20

Neil Craig @tdp_org@mastodon.social

Whilst migrating our log pipeline to use the BigQuery Storage API & thus end-to-end streaming of data from Storage (GCS) via Eventarc & Cloud Run (read, transform, enrich - NodeJS) to BigQuery, I tested some big files, many times the largest we've ever seen in the wild.

It runs at just over 3 log lines/rows per millisecond end-to-end (i.e. inc. writing to BigQuery) over 3.2M log lines.

Would be interested to know how that compares with similar systems.

#BBC #NodeJS #BigQuery