[go: nahoru, domu]

Page MenuHomePhabricator

achou (AikoChou)
Machine Learning Engineer

Projects

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Sunday

  • Clear sailing ahead.

User Details

User Since
Feb 15 2022, 2:51 PM (126 w, 3 d)
Availability
Available
IRC Nick
aiko
LDAP User
Unknown
MediaWiki User
AChou-WMF [ Global Accounts ]

Recent Activity

Tue, Jul 9

achou added a comment to T356102: Allow calling revertrisk language agnostic and revert risk multilingual APIs in a pre-save context.

@achou thanks for the documentation!

I made a patch to clarify using -1 for revision ID if the edit has not yet been saved.

Tue, Jul 9, 1:49 PM · Machine-Learning-Team

Thu, Jul 4

achou committed rMLIS1a5e2dfca6b5: revertrisk: Clarify to use -1 if revision doesn't exist (authored by kostajh).
revertrisk: Clarify to use -1 if revision doesn't exist
Thu, Jul 4, 3:31 PM

Fri, Jun 28

achou added a comment to T356102: Allow calling revertrisk language agnostic and revert risk multilingual APIs in a pre-save context.

The feature has been deployed to the revert risk models on ml-staging.

Fri, Jun 28, 4:59 PM · Machine-Learning-Team
achou committed rMLIS0a3b919e00df: revertrisk: accept revision data as input and bypass MW API.
revertrisk: accept revision data as input and bypass MW API
Fri, Jun 28, 2:12 PM

Wed, Jun 26

achou added a comment to T356102: Allow calling revertrisk language agnostic and revert risk multilingual APIs in a pre-save context.

@kostajh yeah we can keep it very simple - all fields need to be provided by the caller, and missing any field will result in an invalid json error. On the caller side, if they can ensure all required values are correctly calculated, it should work fine, and the new endpoint should be kept for internal use only.

Wed, Jun 26, 12:31 PM · Machine-Learning-Team

Tue, Jun 25

achou added a comment to T356102: Allow calling revertrisk language agnostic and revert risk multilingual APIs in a pre-save context.

I resumed work on this task this week. Upon reviewing the schema required by the model, I think that some fields should not require user input. Instead, these values should be assigned or calculated by the system. For example, fields such as revision.bytes and revision.timestamp should be system-generated since the model uses these values for prediction. For the revision.id field, we will simply assign -1, representing a pre-save edit.

Tue, Jun 25, 1:45 PM · Machine-Learning-Team

Fri, Jun 21

achou committed rMLISf05a8fbaf423: articlequality: add predict.
articlequality: add predict
Fri, Jun 21, 7:37 PM
achou created P65320 error.
Fri, Jun 21, 3:11 PM

Thu, Jun 20

achou committed rMLIS65dac0639f7a: articlequality: add feature preprocess.
articlequality: add feature preprocess
Thu, Jun 20, 2:33 PM

Jun 19 2024

achou closed T358744: Deploy RR-language-agnostic batch version to prod, a subtask of T348153: Q3 2024 Goal: Lift Wing users can request multiple predictions using a single request., as Resolved.
Jun 19 2024, 11:55 AM · Goal, Machine-Learning-Team
achou closed T358744: Deploy RR-language-agnostic batch version to prod as Resolved.
Jun 19 2024, 11:55 AM · Machine-Learning-Team
achou closed T366250: Test Revert Risk model with the transparent config as Resolved.
Jun 19 2024, 11:54 AM · Machine-Learning-Team
achou moved T363725: Patch Location headers of HTTP redirects coming from the MW API in Lift Wing services from In Progress to Ready To Go on the Machine-Learning-Team board.
Jun 19 2024, 11:53 AM · Machine-Learning-Team

Jun 10 2024

achou created P64529 Extracting features.
Jun 10 2024, 11:55 AM

Jun 5 2024

achou added a comment to T366250: Test Revert Risk model with the transparent config.

I tested the Revert Risk models with the transparent config in staging. It worked without any issues. Notably, it seems that the transparent config somehow increases the performance of the revertrisk multilingual isvc. The requests per second (RPS) increased from 2.9 to 6.79 based on load test results.

Jun 5 2024, 4:41 PM · Machine-Learning-Team
achou added a comment to T358744: Deploy RR-language-agnostic batch version to prod.

The new revertrisk images have been deployed to production.

Jun 5 2024, 4:26 PM · Machine-Learning-Team
achou created P64095 [load test] revertrisk multilingual (codfw).
Jun 5 2024, 10:40 AM
achou created P64092 [load test] revertrisk multilingual (transparent proxy).
Jun 5 2024, 10:30 AM
achou updated the title for P64089 [load test] revertrisk language agnostic (KI v0.8) (transparent proxy) from [load test] revertrisk language agnostic (KI v0.8) to [load test] revertrisk language agnostic (KI v0.8) (transparent proxy).
Jun 5 2024, 10:28 AM
achou created P64089 [load test] revertrisk language agnostic (KI v0.8) (transparent proxy).
Jun 5 2024, 10:21 AM

Jun 3 2024

achou committed rMLIS80b4f3197810: revertrisk: modify the response type in batch model.
revertrisk: modify the response type in batch model
Jun 3 2024, 1:34 PM
achou committed rMLIS259081897e51: outlink: move test_transformer to unit test directory.
outlink: move test_transformer to unit test directory
Jun 3 2024, 8:49 AM

May 30 2024

achou created T366298: Move all isvcs to the transparent config .
May 30 2024, 3:48 PM · Machine-Learning-Team
achou created T366250: Test Revert Risk model with the transparent config.
May 30 2024, 9:21 AM · Machine-Learning-Team

May 29 2024

achou claimed T363725: Patch Location headers of HTTP redirects coming from the MW API in Lift Wing services.
May 29 2024, 9:22 AM · Machine-Learning-Team
achou moved T363725: Patch Location headers of HTTP redirects coming from the MW API in Lift Wing services from Ready To Go to In Progress on the Machine-Learning-Team board.
May 29 2024, 9:21 AM · Machine-Learning-Team

May 24 2024

achou updated subscribers of T361483: Selectively disable changeprop functionality that is no longer used.

Hi @dcausse @EBernhardson, I just wanted to sync with you whether it is acceptable to lose some events in the stream for eqiad.mediawiki_page_outlink_topic_prediction_change_v1 and eqiad.mediawiki_revision_score_drafttopic when we transition from changeprop to cp-jobqueue. If I recall correctly, Search uses these streams to update Elastic Search. I checked the consumer groups on the dashboards (outlink, drafttopic) and the cirrus-streaming-updater-producer-eqiad was there. :)

May 24 2024, 1:00 PM · Patch-For-Review, Machine-Learning-Team, Lift-Wing, ORES, RESTBase Sunsetting, Content-Transform-Team, serviceops, ChangeProp, API Platform (RESTBase Deprecation Roadmap)

May 17 2024

achou created T365226: Investigate a way to return other 2xx status code from predict in kserve.
May 17 2024, 9:01 AM · Machine-Learning-Team

May 16 2024

achou created P62477 (An Untitled Masterwork).
May 16 2024, 9:45 AM

May 15 2024

achou claimed T356102: Allow calling revertrisk language agnostic and revert risk multilingual APIs in a pre-save context.
May 15 2024, 12:24 PM · Machine-Learning-Team
achou moved T356045: Test revertrisk-multilingual with GPU from Ready To Go to Blocked on the Machine-Learning-Team board.
May 15 2024, 12:21 PM · Machine-Learning-Team

May 14 2024

achou renamed T363554: Airflow training pipeline from Sprint: Airflow training pipeline to Airflow training pipeline.
May 14 2024, 1:54 PM · Machine-Learning-Team
achou moved T363554: Airflow training pipeline from Unsorted to Ready To Go on the Machine-Learning-Team board.
May 14 2024, 11:52 AM · Machine-Learning-Team
achou moved T356102: Allow calling revertrisk language agnostic and revert risk multilingual APIs in a pre-save context from Backlog/Lift Wing to In Progress on the Machine-Learning-Team board.
May 14 2024, 11:51 AM · Machine-Learning-Team
achou added a comment to T356102: Allow calling revertrisk language agnostic and revert risk multilingual APIs in a pre-save context.

Thanks for sharing the use case!

Potentially called on all edit attempts by not-yet-logged-in users.

One thing to note is that for edits by not-yet-logged-in users, the revert risk multilingual (RRML) model might be more suitable than revertrisk language agnostic (RRLA) model as it handles bias better. But RRML requires more resources and is much slower, with prediction latency ranging from hundreds of ms to a few seconds.

May 14 2024, 11:46 AM · Machine-Learning-Team

May 6 2024

achou added a comment to T363870: [Session] Cuteness association meetup.

Can't believe I missed this :(

IMG_0192.jpg (4×3 px, 328 KB)

May 6 2024, 6:54 PM · Wikimedia-Hackathon-2024
achou awarded T364239: Add fox icon / badge to Phabricator a Fox token.
May 6 2024, 6:22 PM · Phabricator (2024-05-05), Release-Engineering-Team, User-brennen, Upstream, Wikimedia-Hackathon-2024

Apr 26 2024

achou committed rMLIS176666cfbaa5: revertrisk: support all wikis and upgrade KI to v0.7 (authored by isarantopoulos).
revertrisk: support all wikis and upgrade KI to v0.7
Apr 26 2024, 2:28 PM
achou closed T355656: Investigate how to implement batch inference for revertrisk-multilingual as Declined.
Apr 26 2024, 2:14 PM · Patch-For-Review, Machine-Learning-Team
achou closed T355656: Investigate how to implement batch inference for revertrisk-multilingual, a subtask of T348153: Q3 2024 Goal: Lift Wing users can request multiple predictions using a single request., as Declined.
Apr 26 2024, 2:14 PM · Goal, Machine-Learning-Team
achou added a comment to T358744: Deploy RR-language-agnostic batch version to prod.

I got an error when testing the batch model after deploying the new image of kserve 0.12.1 for revert risk models

aikochou@deploy1002:~$ curl "https://inference-staging.svc.codfw.wmnet:30443/v1/models/revertrisk-language-agnostic:predict" -d@./input_some_succeed.json -H "Host: revertrisk-language-agnostic-batcher.revertrisk.wikimedia.org" --http1.1 -k | jq '.'
{
  "error": "AttributeError : 'JSONResponse' object has no attribute 'encode'"
}

It worked before. There may be a change in kserve 0.12.1 that's causing the problem. I'll debug this.

Apr 26 2024, 1:40 PM · Machine-Learning-Team
achou moved T356102: Allow calling revertrisk language agnostic and revert risk multilingual APIs in a pre-save context from Watching to Backlog/Lift Wing on the Machine-Learning-Team board.
Apr 26 2024, 12:51 PM · Machine-Learning-Team
achou added a comment to T356102: Allow calling revertrisk language agnostic and revert risk multilingual APIs in a pre-save context.

Hi @kostajh, yes, this is something we can work on this quarter. I am wondering if there's an ongoing project or product in development that needs this feature. If so, could you provide the links? Also, do you have an estimate of the expected traffic for this feature? I'm assuming it will be requested via the external endpoint, correct?

Apr 26 2024, 12:44 PM · Machine-Learning-Team
achou created T363554: Airflow training pipeline.
Apr 26 2024, 9:38 AM · Machine-Learning-Team

Apr 23 2024

achou added a comment to T356102: Allow calling revertrisk language agnostic and revert risk multilingual APIs in a pre-save context.

Thanks, that is what I am proposing as well. @achou, how feasible do you think this is from your side? It would involve accepting a POST with all the features (https://gitlab.wikimedia.org/repos/research/knowledge_integrity/-/blob/main/knowledge_integrity/featureset.py?ref_type=heads) needed.

Apr 23 2024, 3:13 PM · Machine-Learning-Team
achou moved T356045: Test revertrisk-multilingual with GPU from In Progress to Ready To Go on the Machine-Learning-Team board.
Apr 23 2024, 12:04 PM · Machine-Learning-Team

Apr 19 2024

achou committed rMLIS0706f1a55693: revertrisk: add support for base model's payloads in batch model.
revertrisk: add support for base model's payloads in batch model
Apr 19 2024, 9:29 AM

Apr 16 2024

achou created M335: Trainings-Workflow.
Apr 16 2024, 2:15 PM
achou added a comment to T356102: Allow calling revertrisk language agnostic and revert risk multilingual APIs in a pre-save context.

@kostajh @XiaoXiao-WMF thanks for tagging. Sorry I was unaware of the discussion here. The ML team is currently in the middle of quarterly planning. I will bring up the proposal during our planning and get back to you shortly!

Apr 16 2024, 1:31 PM · Machine-Learning-Team

Apr 12 2024

achou created P60462 test.
Apr 12 2024, 10:15 AM

Apr 11 2024

achou committed rMLIS901a1b20990b: revertrisk: use the Pytorch base image for RRML GPU inference.
revertrisk: use the Pytorch base image for RRML GPU inference
Apr 11 2024, 3:35 PM

Apr 9 2024

achou added a comment to T356045: Test revertrisk-multilingual with GPU.

I built a RRML image locally using the Pytorch 2.2.x base image from T360638.

Apr 9 2024, 10:03 AM · Machine-Learning-Team
achou moved T356045: Test revertrisk-multilingual with GPU from Ready To Go to In Progress on the Machine-Learning-Team board.
Apr 9 2024, 8:24 AM · Machine-Learning-Team

Apr 8 2024

achou committed rMLIS25333d8fb60c: revertrisk: update KI to v0.6 for RRML and RR-wikidata.
revertrisk: update KI to v0.6 for RRML and RR-wikidata
Apr 8 2024, 3:47 PM

Apr 5 2024

achou closed T360423: Deploy RevertRisk language-agnostic with knowledge integrity v0.6.0 as Resolved.

We have deployed the new RRLA model server to production.

Apr 5 2024, 7:00 PM · Machine-Learning-Team
achou moved T360423: Deploy RevertRisk language-agnostic with knowledge integrity v0.6.0 from In Progress to 2023-2024 Q4 Done on the Machine-Learning-Team board.
Apr 5 2024, 7:00 PM · Machine-Learning-Team
achou updated the task description for T360423: Deploy RevertRisk language-agnostic with knowledge integrity v0.6.0.
Apr 5 2024, 6:56 PM · Machine-Learning-Team

Apr 4 2024

achou closed T361234: Fix locust load testing for Revert Risk models as Resolved.

This task is complete. I've created T361881 to follow up on the above test results issue.

Apr 4 2024, 8:13 PM · Patch-For-Review, Machine-Learning-Team
achou moved T361234: Fix locust load testing for Revert Risk models from Unsorted to 2023-2024 Q4 Done on the Machine-Learning-Team board.
Apr 4 2024, 8:11 PM · Patch-For-Review, Machine-Learning-Team
achou created T361881: Investigate the inconsistent load test results (locust) for revertrisk.
Apr 4 2024, 8:08 PM · Machine-Learning-Team
achou moved T358744: Deploy RR-language-agnostic batch version to prod from Ready To Go to In Progress on the Machine-Learning-Team board.
Apr 4 2024, 7:54 PM · Machine-Learning-Team
achou moved T356045: Test revertrisk-multilingual with GPU from Blocked to Ready To Go on the Machine-Learning-Team board.
Apr 4 2024, 7:54 PM · Machine-Learning-Team
achou moved T360406: Error handling in Batch Predictions for RevertRisk Models from In Progress to 2023-2024 Q4 Done on the Machine-Learning-Team board.
Apr 4 2024, 7:53 PM · Patch-For-Review, Machine-Learning-Team
achou moved T351278: Improving error message for Revertrisk models from In Progress to 2023-2024 Q4 Done on the Machine-Learning-Team board.
Apr 4 2024, 7:53 PM · Patch-For-Review, Machine-Learning-Team
achou moved T358748: Prep work for (re)training workflow sprint from Ready To Go to 2023-2024 Q4 Done on the Machine-Learning-Team board.
Apr 4 2024, 7:52 PM · Machine-Learning-Team
achou closed T358748: Prep work for (re)training workflow sprint as Resolved.
Apr 4 2024, 7:52 PM · Machine-Learning-Team
achou added a comment to T355742: Assess runtime performance impact of pydantic data models in the RRLA model-server.

FYI @MunizaA :)

The new RRLA model server featuring KI v.0.6 has been deployed to ML-staging. I used wrk to conduct load testing and compare the performance between the old and new versions. The results for the previous version are under P59447, and the results for the new version are under P59464. From these results, it's clear that the new KI version does not affect the performance metrics, such as average latency and RPS.

Apr 4 2024, 7:50 PM · Patch-For-Review, Machine-Learning-Team
achou added a comment to T360423: Deploy RevertRisk language-agnostic with knowledge integrity v0.6.0.

The new RRLA model server featuring KI v.0.6 has been deployed to ML-staging. I used wrk to conduct load testing and compare the performance between the old and new versions. The results for the previous version are under P59447, and the results for the new version are under P59464. From these results, it's clear that the new KI version does not affect the performance metrics, such as average latency and RPS.

Apr 4 2024, 7:45 PM · Machine-Learning-Team
achou updated the task description for T360423: Deploy RevertRisk language-agnostic with knowledge integrity v0.6.0.
Apr 4 2024, 7:25 PM · Machine-Learning-Team
achou updated subscribers of T358744: Deploy RR-language-agnostic batch version to prod.

I repost what I previously wrote here as the issue is more related to deployment.

Apr 4 2024, 7:25 PM · Machine-Learning-Team
achou closed T360406: Error handling in Batch Predictions for RevertRisk Models as Resolved.

This task is complete. Check out these examples:

Apr 4 2024, 7:15 PM · Patch-For-Review, Machine-Learning-Team
achou closed T360406: Error handling in Batch Predictions for RevertRisk Models, a subtask of T358744: Deploy RR-language-agnostic batch version to prod, as Resolved.
Apr 4 2024, 7:15 PM · Machine-Learning-Team
achou updated the task description for T360406: Error handling in Batch Predictions for RevertRisk Models.
Apr 4 2024, 6:58 PM · Patch-For-Review, Machine-Learning-Team
achou edited P59464 [load test] revertrisk language agnostic (KI v0.6).
Apr 4 2024, 12:25 PM
achou created P59464 [load test] revertrisk language agnostic (KI v0.6).
Apr 4 2024, 12:24 PM
achou closed T351278: Improving error message for Revertrisk models as Resolved.

This task is complete. Check out these examples of new error messages:

$ curl "https://inference-staging.svc.codfw.wmnet:30443/v1/models/revertrisk-language-agnostic:predict" -d '{"rev_id": 15925124, "lang": "ro"}' -H "Host: revertrisk-language-agnostic.revertrisk.wikimedia.org" --http1.1 -k |  jq '.'
{
  "detail": "Could not make prediction for revision 15925124 (ro). Reason: revision_missing"
}
Apr 4 2024, 12:19 PM · Patch-For-Review, Machine-Learning-Team
achou committed rMLIS891bacff86f4: revertrisk: error handling for batch requests.
revertrisk: error handling for batch requests
Apr 4 2024, 9:40 AM
achou created P59447 [load test] revertrisk language agnostic.
Apr 4 2024, 9:33 AM

Apr 3 2024

achou updated subscribers of T360406: Error handling in Batch Predictions for RevertRisk Models.

@kevinbazira posed a question - how can end users switch between batch and non-batch requests?

Apr 3 2024, 4:20 PM · Patch-For-Review, Machine-Learning-Team
achou committed rMLIS7af63a06e2ee: locust: fix missing host header for revertrisk load tests.
locust: fix missing host header for revertrisk load tests
Apr 3 2024, 11:13 AM

Apr 2 2024

achou moved T355656: Investigate how to implement batch inference for revertrisk-multilingual from Ready To Go to Backlog/Lift Wing on the Machine-Learning-Team board.
Apr 2 2024, 9:40 AM · Patch-For-Review, Machine-Learning-Team

Mar 28 2024

achou updated subscribers of T361234: Fix locust load testing for Revert Risk models.

@isarantopoulos do you remember the config values in locust.conf when you ran the revertrisk tests? I can't reproduce the result in revertrisk_stats.csv. I haven't deployed RRLA to staging yet, so it's the same model you tested.

Mar 28 2024, 4:59 PM · Patch-For-Review, Machine-Learning-Team
achou created P58996 load test #2.
Mar 28 2024, 4:07 PM
achou created P58995 load test #1.
Mar 28 2024, 4:06 PM
achou created T361238: Update and fix locust load testing for revscoring models .
Mar 28 2024, 2:38 PM · Machine-Learning-Team
achou created T361234: Fix locust load testing for Revert Risk models.
Mar 28 2024, 2:23 PM · Patch-For-Review, Machine-Learning-Team

Mar 27 2024

achou created P58959 load testing .
Mar 27 2024, 2:46 PM

Mar 26 2024

achou moved T355656: Investigate how to implement batch inference for revertrisk-multilingual from In Progress to Ready To Go on the Machine-Learning-Team board.
Mar 26 2024, 4:07 PM · Patch-For-Review, Machine-Learning-Team
achou claimed T360423: Deploy RevertRisk language-agnostic with knowledge integrity v0.6.0.
Mar 26 2024, 4:06 PM · Machine-Learning-Team
achou moved T360423: Deploy RevertRisk language-agnostic with knowledge integrity v0.6.0 from Ready To Go to In Progress on the Machine-Learning-Team board.
Mar 26 2024, 4:06 PM · Machine-Learning-Team
achou set the point value for T360423: Deploy RevertRisk language-agnostic with knowledge integrity v0.6.0 to 2.
Mar 26 2024, 4:02 PM · Machine-Learning-Team
achou moved T360423: Deploy RevertRisk language-agnostic with knowledge integrity v0.6.0 from Backlog/Lift Wing to Ready To Go on the Machine-Learning-Team board.
Mar 26 2024, 4:00 PM · Machine-Learning-Team
achou created P58922 dep.
Mar 26 2024, 1:57 PM
achou committed rMLIS050e347821c5: revertrisk: improve error messages.
revertrisk: improve error messages
Mar 26 2024, 11:12 AM

Mar 25 2024

achou created P58905 rr-ml-gpu.
Mar 25 2024, 12:48 PM
achou created P58904 docker history.
Mar 25 2024, 12:47 PM

Mar 22 2024

achou created P58900 debug.
Mar 22 2024, 4:58 PM
achou created P58899 docker-pkg-build.log.
Mar 22 2024, 4:41 PM
achou created P58898 docker-pkg build.
Mar 22 2024, 3:51 PM

Mar 20 2024

achou created P58825 HF image.
Mar 20 2024, 3:45 PM