With Great Humor Comes Great Developer Engagement

faker-ruby/faker 4 Dec 2023

The worldwide collaborative effort for the creation of software is technically and socially demanding.

Software Engineering

TwHIN: Embedding the Twitter Heterogeneous Information Network for Personalized Recommendation

twitter/the-algorithm-ml 11 Feb 2022

Social networks, such as Twitter, form a heterogeneous information network (HIN) where nodes represent domain entities (e. g., user, content, advertiser, etc.)

Social and Information Networks

TF-GridNet: Integrating Full- and Sub-Band Modeling for Speech Separation

aigc-audio/audiogpt 22 Nov 2022

To show its robustness to noise and reverberation, we evaluate it on monaural reverberant speaker separation using the SMS-WSJ dataset and on noisy-reverberant speaker separation using WHAMR!, and obtain state-of-the-art performance on both datasets.

Sound Audio and Speech Processing

Towards Scalable Dataframe Systems

modin-project/modin 3 Jan 2020

Dataframes are a popular abstraction to represent, prepare, and analyze data.

Databases

WhisperX: Time-Accurate Speech Transcription of Long-Form Audio

m-bain/whisperx 1 Mar 2023

Large-scale, weakly-supervised speech recognition models, such as Whisper, have demonstrated impressive results on speech recognition across domains and languages.

Sound Audio and Speech Processing

NIPS4Bplus: a richly annotated birdsong audio dataset

benawad/dogehouse 6 Nov 2018

Recent advances in birdsong detection and classification have approached a limit due to the lack of fully annotated recordings.

Sound Digital Libraries Audio and Speech Processing

Engineering Record And Replay For Deployability: Extended Technical Report

mozilla/rr 16 May 2017

With some novel optimizations, our system 'rr' records and replays real-world low-parallelism workloads with low overhead, with an entirely user-space implementation, using stock hardware, compilers, runtimes and operating systems.

Programming Languages

Pulsar: Efficient Sphere-based Neural Rendering

facebookresearch/pytorch3d CVPR 2021

To alleviate these problems, Pulsar employs: 1) a sphere-based scene representation, 2) an efficient differentiable rendering engine, and 3) neural shading.

Graphics

Merkle-CRDTs: Merkle-DAGs meet CRDTs

orbitdb/orbitdb 27 Apr 2020

We study Merkle-DAGs as a transport and persistence layer for Conflict-Free Replicated Data Types (CRDTs), coining the term Merkle-CRDTs and providing an overview of the different concepts, properties, advantages and limitations involved.

Networking and Internet Architecture

A lightweight design for serverless Function-as-a-Service

second-state/SSVM 13 Oct 2020

FaaS (Function as a Service) allows developers to upload and execute code in the cloud without managing servers.

Distributed, Parallel, and Cluster Computing