Planet maemo: category "feed:707464dc098c2bcec8595a67d1be151e"

feng gao

[GSoC09]]Report #3

2009-07-11 00:43 UTC  by  feng gao
0
0
Hi,all:

In the past two weeks, the major work is to mine user interest via observing user behavior in the gpodder. I try to record top score keywords of one episode which user chooses to download in the behavior model. Then use a mining algorithm to pick up top-n keywords from this model to populate the user profile.

I also tried to re-design the recommendation score algorithm. I want to use new social tag feature in opencalais to score each episode.

I think until now, the basic work of recommendation has been done. In the next phase, I would focus on the context-aware module which will not only be in charge of user-awareness but also be aware of device and environment context. I will try to design a framework to implement basic functions and then other developers can easily use this framework to add context-aware funtions into their applications.

In the next two weeks, I will do some paper works. I will document the new score algorithm and the framework. At the same time , I will fix some bugs in current code then release a new version of gpodder , then get some feedbacks from the community.

If you are interested in my project, then you can find much more information on http://garage.maemo.org/projects/newssprite. You can also check out the code on the svn repository.

Any comments and suggestions are welcome.Thanks.
feng gao

Report #2

2009-06-23 11:49 UTC  by  feng gao
0
0

Hello,all:

In the past two weeks, I have done following works:

1. Write a test script for semantic module, then other developers can test it

2. Write a simple xml format user profile file, then user can import and export their interest later

3. Revise the semantic module, add a new class named profile into it to represent the basic class of item profile and user profile.

4. Revise the recommendation algorithm5. Integrate the semantic module into gpodder

Problems:

1. The algorithm still needs to be re-designed. The current algorithm seems not to be accurate. Most of items are given a high score(>0.8), it's due to the semantic similarity measure algorithm is not very good. I will try to read some papers and adopt some new ideas. Later, I will revise this part to fix the problem.

2. Because I spent much more time on the integration than expected, I should postpone the mining interest work to next bi-week.

3. The opencalais may categorize one word into different type, For example, the word "Linux" may be have a type "Technology" or a type "OperatingSystem". So how to correctly generate the user profile is a problem. And how to let user edit the userprofile is another import problem.

Future Plans:

1. If possible, I want to take the relation between two words, the topic category, the social tag( new feature introduced by opencalais 4.1) into account to calculate the semantic similarity measure.

2. I will work on the gpodder to add some essential UIs for user to edit the userprofile and import and export it.

3. Start to consider which context information should be taken into account. LIke the location, city info maybe useful for user to filter some news? Any other useful context? Please tell me if you have some good ideas about it!

If you are interested in my project, then you can find much more information on http://garage.maemo.org/projects/newssprite. You can also check out the code on the svn repository.

Any comments and suggestions are welcome.Thanks.

feng gao

[GSoC] Newssprite report #1

2009-06-06 19:14 UTC  by  feng gao
0
0
Click to read 1021 words
feng gao

Some problems I met

2009-05-03 04:17 UTC  by  feng gao
0
0
This week I have tried to implement a semantic annotation and resolution module. But there are some problems confused me.

1. the first problem is about where should I do the semantic work.

For my previous work, it's a system that have a server and a client. The server aggregates rss feeds and then classify those news items into different topics. On the client side, user can not subscribe news items by adding rss feed. Instead of , they subscribe news itmes by selecting topics they like. Then the server broadcast all information, client receive information based on result of selection.

In this system, I can easily implement semantic module, all work are done on the server side. I write a python program by using python-calais api to annotate each news items then resolve the response and generate a news item package with metadata.

However, in traditional rss reader system, this solution can not work. I plan to do semantic work in two different ways:

the first one is to use yahoo pipe, this way is similar to my previous solution. the rss feeds are imported to the pipe then the pipe would annotate those news items via using calais pipe. on client side, the resolution is done via separate metadate from news content. But this solution has the same problem with my previous solution that user can not determine which rss feed he or she want to use and the rss feed list is fixed.( In fact, I can allow users to use their ompl file to import rss lists, but the list would be fixed and any new rss feed can not be easily added into it.)

the second one is to do all semantic work on client side. Once a news item is download, then it would be annotated by calais, and resolved. But I wonder whether the device can perform such complex computation quickly and efficiently.

2. the second problem is about what I should finally do
I think my original idea about this project during GSOC is to implement a smart filter not a complete rss reader system( In fact, I would continue my work after GSOC, and finnaly implement a new rss reader system). I plan to test semantic , context-aware module on my previous system.But if I do so, then no one else can test the work I have done. If I choose to implement a complete rss reader, then I don't think I have enough time to finish my work. I want to know whether maemo community or google would test my work on mid or final evaluation? If they would do this, that means I should quicly choose a open source rss reader system in garage, and try to integarte my work with it, then any one in community can test during GSOC.


Do you have any suggestions or solutions on above two problems? tell me, please
feng gao

The first post for NewsSprite

2009-04-24 01:31 UTC  by  feng gao
0
0
Hi,all:

I'm so glad that I have been accepted by GSOC 2009. My project is NewsSprite, which aims to add some intelligent features into the reader system then it can recommend or filter news information according to user's interest which is represented as keywords (or named tags) and user’s environment which depicts physical condition and abstract condition about user, and can be used to answer questions like what is user doing now? What’s the status of user? Where is user? And whom is user with? Moreover, it even can intelligently store and download information according to the situation of network and storage of the Internet Tablet.

I have registered my project on maemo garage,you can find it by searching newssprite,or simply click here. A maillist for discussing related issues about project is also available, you can subscribe it here.

I also maintain a web site for this project, it's on google site containing my project proposal, paper and other interesting information, you can visit it here.

Finally, you can find me on PlanetSoC(derkaiser), my name is Feng Gao, the nickname is derkaiser(D.K.). You also can find me on Freenode, twitter(gaofeng860918), msn(mhzl860918@gmail.com). Feel free to contact me, if you are interested in my project.